We show that building very large caches can be an effective way to narrow the growing CPU-memory speed gap. Building a cache as a combination of DRAM and SRAM yields a system with a relatively low cost per bit while achieving performance that is close to that of a system with a very large cache made of (expensive) fast SRAM. We demonstrate that ISDRAM cache, containing SRAM buffers that are only 1/64th the size of DRAM, performs almost as well as a conventional SRAM cache of the same size. "Back of the envelope" estimates show that the ISDRAM bit-density should be approximately 1/2 the bit-density of commercial DRAM parts and at least 5x denser than a conventional on-chip SRAM cache of the same size.
Junyi Xie and Gershon Kedem, The Case for Building High Performance
Intermediate-Level Memory Systems,
PDF format.