2011 IEEE 17th International Symposium on High Performance Computer Architecture 2011
DOI: 10.1109/hpca.2011.5749731
|View full text |Cite
|
Sign up to set email alerts
|

CloudCache: Expanding and shrinking private caches

Abstract: The number of cores in a single chip multiprocessor is expected

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
49
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 53 publications
(49 citation statements)
references
References 20 publications
0
49
0
Order By: Relevance
“…In [32,42], the communication bottleneck to a central directory is mitigated by distributing hierarchical directories across the chip. More recent studies have focused on providing flexible sized clusters to best serve application demands [9,18,23,31,35]. LOCO provides an efficient mechanism for supporting flexible clusters while maintaining coherence and balancing cache utilization across clusters with IVR, unlike the complex hardware monitoring and remapping mechanisms required by prior techniques.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…In [32,42], the communication bottleneck to a central directory is mitigated by distributing hierarchical directories across the chip. More recent studies have focused on providing flexible sized clusters to best serve application demands [9,18,23,31,35]. LOCO provides an efficient mechanism for supporting flexible clusters while maintaining coherence and balancing cache utilization across clusters with IVR, unlike the complex hardware monitoring and remapping mechanisms required by prior techniques.…”
Section: Related Workmentioning
confidence: 99%
“…The clusters can be 1D or 2D meshes of any size, as shown in Figure 1. The size of the cluster depends on the applications' working sets and/or aggregate cache requirement [18,23,31,35], and is not the focus of this work. However, we do believe that HP C max should drive the cluster sizes, since any cache within HP C max hops can typically be accessed in 1 (X-only or Y-only) to 2 (X+Y) SMART-hops (which corresponds to 2 to 4 cycles low-load latency), as explained in Section 2.…”
Section: Background: Smart Nocmentioning
confidence: 99%
See 1 more Smart Citation
“…Both private and shared LLCs have their advantages and drawbacks, so hybrid configurations have been proposed to exploit the benefits of both design choices, such as ESP-NUCA [17] and CloudCache [18]. CMP-NuRAPID [19] decouples tags and data to allow data placement and replication in any LLC bank.…”
Section: Related Workmentioning
confidence: 99%
“…Some innovations also exists for implementing hybrid architectures by combining private LLC concepts and the distributed shared LLC concepts [21], [22], [23], [24], [25], [26].…”
Section: Cmp With Distributed Shared Cachementioning
confidence: 99%