HPCA - 16 2010 the Sixteenth International Symposium on High-Performance Computer Architecture 2010
DOI: 10.1109/hpca.2010.5416644
|View full text |Cite
|
Sign up to set email alerts
|

StimulusCache: Boosting performance of chip multiprocessors with excess cache

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2011
2011
2013
2013

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 11 publications
(6 citation statements)
references
References 16 publications
0
6
0
Order By: Relevance
“…The state-of-the-art nonuniform cache architecture (NUCA) management schemes, such as RNUCA [Hardavellas et al 2009] and page-recoloring scheme [Cho and Jin 2006], intend to place cache blocks near their most frequent requestors by smart initial placement, dynamic migration, and replication. Moreover, cache partitioning schemes [Qureshi and Patt 2006;Lee et al 2010; can be used so that a cluster of cores are only going to access a locally allocated cache partition. Communication between these partitions is only required when there are coherence invalidations and fills.…”
Section: Hierarchical Stream Arbitrationmentioning
confidence: 99%
“…The state-of-the-art nonuniform cache architecture (NUCA) management schemes, such as RNUCA [Hardavellas et al 2009] and page-recoloring scheme [Cho and Jin 2006], intend to place cache blocks near their most frequent requestors by smart initial placement, dynamic migration, and replication. Moreover, cache partitioning schemes [Qureshi and Patt 2006;Lee et al 2010; can be used so that a cluster of cores are only going to access a locally allocated cache partition. Communication between these partitions is only required when there are coherence invalidations and fills.…”
Section: Hierarchical Stream Arbitrationmentioning
confidence: 99%
“…It avoids excessive replication of shared data and places private data in local L2 banks. StimulusCache [19] introduced techniques to utilize "excess caches" when some cores are disabled to improve the chip yield. Lastly, Elastic Cooperative Caching (ECC) [20] uses a distributed coherence engine for scalability.…”
Section: Related Workmentioning
confidence: 99%
“…The thread on this core needs a capacity of 6, which can be provided locally. Lastly, the cores in the core ID list form a "virtual L2 cache chain," somewhat similar to [19]. For example, when core 4 has a miss, the access is directed to core 1, then to core 5, and so on (from the MRU position to later positions).…”
Section: Cloudcachementioning
confidence: 99%
See 1 more Smart Citation
“…Another approach is to detect sharedresource contention dynamically and take counter measures to mitigate the contention [8,11,15,21,24,30]. Several other recent studies address contention for shared LLC by using different hardware techniques [7,12,16,19,23,25].…”
Section: Introductionmentioning
confidence: 99%