HPCA - 16 2010 the Sixteenth International Symposium on High-Performance Computer Architecture 2010
DOI: 10.1109/hpca.2010.5416628
|View full text |Cite
|
Sign up to set email alerts
|

An optimized 3D-stacked memory architecture by exploiting excessive, high-density TSV bandwidth

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
100
0

Year Published

2012
2012
2020
2020

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 190 publications
(100 citation statements)
references
References 39 publications
0
100
0
Order By: Relevance
“…A different approach is followed by Loh, that in [9] considers 3D-DRAM stacked on top of multi-processors and revises the memory system organization in a 3D context. More recently, also Woo et al [10], have explored a memory architecture that exploits TSVs for connecting the last level cache to the 3D stacked DRAM. The work of Madan et al [11] instead, takes in consideration a 3D system composed by a DRAM layer and an SRAM cache banks layer on top of a processing layer.…”
Section: Related Workmentioning
confidence: 99%
“…A different approach is followed by Loh, that in [9] considers 3D-DRAM stacked on top of multi-processors and revises the memory system organization in a 3D context. More recently, also Woo et al [10], have explored a memory architecture that exploits TSVs for connecting the last level cache to the 3D stacked DRAM. The work of Madan et al [11] instead, takes in consideration a 3D system composed by a DRAM layer and an SRAM cache banks layer on top of a processing layer.…”
Section: Related Workmentioning
confidence: 99%
“…We assume a 3D-stacked DRAM cache that leverages high TSV (through silicon via) bandwidth [27]. We present a latencypower tradeoff with mixed SRAM and DRAM caches (SRAM/DRAM cache + PCRAM main memory).…”
Section: Cache Hierarchy With Heterogeneous Technologiesmentioning
confidence: 99%
“…Other CMPs have been designed in later years exploiting multiple 3-D-DRAM layers [16], [17]; these solutions showed the possibility to reorganize modules and interconnections in order to have a significant bandwidth increase, resulting in a relevant speedup in the routine execution. Loh's [18] solution demonstrated an achievable speed-up of 280% with respect to the baseline CMP (an Intel QuadCore) connected to off-chip DRAM.…”
Section: Related Workmentioning
confidence: 99%