2014
DOI: 10.1109/l-ca.2013.18
|View full text |Cite
|
Sign up to set email alerts
|

Cache Hierarchy Optimization

Abstract: Power consumption, off-chip memory bandwidth, chip area and Network on Chip (NoC) capacity are among main chip resources limiting the scalability of Chip Multiprocessors (CMP). A closed form analytical solution for optimizing the CMP cache hierarchy and optimally allocating area among hierarchy levels under such constrained resources is developed. The optimization framework is extended by incorporating the impact of data sharing on cache miss rate. An analytical model for cache access time as a function of cac… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
7
0

Year Published

2014
2014
2017
2017

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 14 publications
(30 reference statements)
1
7
0
Order By: Relevance
“…Larger caches are more expensive [26,27]: Area and static power increase roughly linearly with size, while access latency and energy scale roughly with its square root [65]. SRAM caches from 512 KB to 32 MB have access latencies from 9 to 45 cycles and access energies from 0.2 to 1.7 nJ, and stacked DRAM caches from 128 MB to 2 GB have access latencies from 42 to 74 cycles and energies from 4.4 nJ to 6 nJ.…”
Section: Motivationmentioning
confidence: 99%
“…Larger caches are more expensive [26,27]: Area and static power increase roughly linearly with size, while access latency and energy scale roughly with its square root [65]. SRAM caches from 512 KB to 32 MB have access latencies from 9 to 45 cycles and access energies from 0.2 to 1.7 nJ, and stacked DRAM caches from 128 MB to 2 GB have access latencies from 42 to 74 cycles and energies from 4.4 nJ to 6 nJ.…”
Section: Motivationmentioning
confidence: 99%
“…Krishna et al [5] researched the optimal area allocation between cores and cache. Yavits et al [19] developed an analytical model for cache hierarchy levels.…”
Section: Related Workmentioning
confidence: 99%
“…This framework can be extended to any number of private, shared or hybrid levels. Following [19], we assume that the access time of the LLC is approximated by power-law model: (22) Both T and the exponent p are found by fitting the power law (22) curve to the cache access time data generated by CACTI. For caches having several shared clients, the access time can be written as follows:…”
Section: B Processing Corementioning
confidence: 99%
“…A classical CMP architecture paradigm includes design choices such as symmetric vs. asymmetric CMP [18], number of cores vs. core size [18], cores vs. cache [1] [14] etc. When designing a 3D CMP, the computer architect must address an additional question: How does the temperature affect the number of cores of 3D CMP and their size?…”
Section: Introduction and Related Workmentioning
confidence: 99%