32nd International Symposium on Computer Architecture (ISCA'05)
DOI: 10.1109/isca.2005.31
|View full text |Cite
|
Sign up to set email alerts
|

Improving Multiprocessor Performance with Coarse-Grain Coherence Tracking

Abstract: (and up to 21.7%).

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
27
0

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 70 publications
(29 citation statements)
references
References 22 publications
0
27
0
Order By: Relevance
“…As used in CGCT [3], this technique can reduce snoops effectively on clean data. However, it may increase the latencies of L2 misses for content-shared data, since it does not check whether there are cached copies, which can be obtained by fast on-chip cache-to-cache transfers.…”
Section: Support For Content-based Sharingmentioning
confidence: 99%
See 1 more Smart Citation
“…As used in CGCT [3], this technique can reduce snoops effectively on clean data. However, it may increase the latencies of L2 misses for content-shared data, since it does not check whether there are cached copies, which can be obtained by fast on-chip cache-to-cache transfers.…”
Section: Support For Content-based Sharingmentioning
confidence: 99%
“…RegionScount maintains region-based coherence filters at requesting nodes to avoid broadcasting snoop requests for private data [2]. Coarse-grain coherence tracking (CGCT) also uses additional coarse-grained coherence tags for each cache and tracks the private or shared states of regions, in addition to the conventional cacheline-unit coherence [3]. Snoop requests are either broadcast or sent directly to the memory depending on the coarse grain states.…”
Section: Related Workmentioning
confidence: 99%
“…Coarse-grain coherence tracking [7] and RegionScout [33] both propose mechanisms to reduce coherence traffic in broadcast-based systems by managing coherence at a coarser granularity than a line. While these techniques can reduce storage costs, both mechanisms impose restrictions on alignment and sizing of coherence regions and may lead to increased message traffic; both are situations we wish to avoid with Cohesion.…”
Section: Hardware Schemesmentioning
confidence: 99%
“…A preliminary evaluation of RegionScout appears in [26]. Cantin, Lipast and Smith have also proposed exploiting coarse sharing for snoop coherence bandwidth reduction [8].…”
Section: (A) Locall2mentioning
confidence: 99%