2005
DOI: 10.1109/mdt.2005.134
|View full text |Cite
|
Sign up to set email alerts
|

Bridging the Processor-Memory Performance Gapwith 3D IC Technology

Abstract: MICROPROCESSOR PERFORMANCE has been improving at roughly 60% per year. Memory access times, however, have improved by less than 10% per year. 1 The resulting gap between logic and memory performance has forced microprocessor designs toward complex and power-hungry architectures that support out-of-order and speculative execution. Moreover, processors have been designed with increasingly large cache hierarchies to hide main memory latency. This article examines how 3D IC technology can improve interactions betw… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

2
86
0

Year Published

2007
2007
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 249 publications
(90 citation statements)
references
References 16 publications
2
86
0
Order By: Relevance
“…Various mechanisms have been proposed to exploit memory bandwidth and improve the cache performance ( [3] and [4]). The stream buffer proposed in [5] and [6] is one of the most efficient structures to exploit memory bandwidth and reduce memory access latency.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Various mechanisms have been proposed to exploit memory bandwidth and improve the cache performance ( [3] and [4]). The stream buffer proposed in [5] and [6] is one of the most efficient structures to exploit memory bandwidth and reduce memory access latency.…”
Section: Related Workmentioning
confidence: 99%
“…The latency of the 2D off-chip memory is obtained from an actual x86 processor system by using LMbench tool [13]. The access latency of 3D stacked memory is obtained with the assumption for high bandwidth as used in [3] and [14]. The multi-way stream buffer is used and has 16-entries for each way as in [3].…”
Section: Related Workmentioning
confidence: 99%
“…In [5], Woo et al proposed a SMART-3D architecture, which efficiently utilized the TSVs to reduce long latency of fetching and write-back in an L2 cache. Liu et al [6] investigated various conventional schemes including a stream buffer to bridge the processor and memory performance gap based on the 3-D ICs. This paper introduces versatile stream buffer architecture with a victim cache to exploit massive memory bandwidth and to reduce memory traffic.…”
Section: Related Workmentioning
confidence: 99%
“…Some of this attention has focused on implementing a cache in 3D [25,29,40], even in the context of a multi-core NUCA layout [16]. Few studies have considered using additional dies entirely for SRAM or DRAM [2,17,18].…”
Section: Related Workmentioning
confidence: 99%