2009 Design, Automation &Amp; Test in Europe Conference &Amp; Exhibition 2009
DOI: 10.1109/date.2009.5090736
|View full text |Cite
|
Sign up to set email alerts
|

A power-efficient migration mechanism for D-NUCA caches

Abstract: D-NUCA L2 caches are able to tolerate the increasing wire delay effects due to technology scaling thanks to their banked organization, broadcast line search and data promotion/demotion mechanism. Data promotion mechanism aims at moving frequently accessed data near the core, but causes additional accesses on cache banks, hence increasing dynamic energy consumption. We shown how, in some cases, this migration mechanism is not successful in reducing data access latency and can be selectively and dynamically inhi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2009
2009
2018
2018

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(11 citation statements)
references
References 6 publications
0
11
0
Order By: Relevance
“…D-NUCA can mitigate the large latency associated with frequent remote block access under S-NUCA through gradually migrating the frequently accessed block to the requestor. However, the performance improvements yielded by block migration in D-NUCA could be limited by the ping-pong effect as well as the conflict hit phenomenon [4], [14]. In this work, we propose thread-aware migration policy to eliminate these issues.…”
Section: Background and Related Workmentioning
confidence: 90%
See 2 more Smart Citations
“…D-NUCA can mitigate the large latency associated with frequent remote block access under S-NUCA through gradually migrating the frequently accessed block to the requestor. However, the performance improvements yielded by block migration in D-NUCA could be limited by the ping-pong effect as well as the conflict hit phenomenon [4], [14]. In this work, we propose thread-aware migration policy to eliminate these issues.…”
Section: Background and Related Workmentioning
confidence: 90%
“…Moreover, the on-chip network power is obtained using Orion [23] which is integrated in GEMS framework. We compare the proposed TOM scheme with a baseline D-NUCA [1] substrate and the power-efficient migration scheme (PEM [24] for short). For TOM, the number of critical threads selected by thread categorization is set to 2.…”
Section: A Evaluation Methodologymentioning
confidence: 99%
See 1 more Smart Citation
“…As a consequence, while in the 8p topology both the applications succeed in bringing the blocks in the low latency lines, with the 4+4p topology barnes present an utilization of about 50% of the central lines. This phenomenon is known as conflict hit [23]: a thread accesses a block, and the block migrates in the next bank toward the requestor. However, another thread running on a cpu plugged at the opposite side of the D-NUCA accesses the same block, that consequently migrates back to the previous bank.…”
Section: Influence Of Mapping Topology and Applications Featuresmentioning
confidence: 98%
“…In CMP configurations in which processors are placed at different sides of the shared D-NUCA cache, such as our reference system shown in Figure 1, performance improvements due to the migration process can be limited by the conflict hit phenomenon [23]. In CMP systems, such problem is related to the presence of shared blocks that are accessed by threads running on processors placed at opposite sides of the shared cache.…”
Section: Introductionmentioning
confidence: 99%