Impact of on-chip network parameters on NUCA cache performances

Bardine, Alessandro; Comparetti, M.; Foglia, Pierfrancesco; Gabrielli, Giacomo; Prete, Cosimo Antonio

doi:10.1049/iet-cdt.2008.0078

Cited by 5 publications

(2 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The NoC is organized as a partial 2D mesh network, with 64 wormhole [Duato et al 2003] switches (one for each NUCA bank); NoC link latency has been calculated using the Berkeley Predictive Model [PTM 2007]. Details on the design and modeling of such a network can be found in Bardine et al [2008a]. Table I summarizes the configuration parameters for the considered CMP.…”

Section: Methodsmentioning

confidence: 99%

“…As for the static energy, we observe that the most important component comes from cache banks, whereas the contribution of switches is very low, about 6%. The small switch component comes from the adoption of simple NoC switches, which also relay on little input and output buffers (1-flit buffers) [Bardine et al 2008a]. The cache banks component is dominant, and it is the most influenced by the execution time reduction: the lower the execution time, the higher the reduction of the energy dissipation.…”

Section: Energy Consumption Evaluationmentioning

confidence: 99%

See 1 more Smart Citation

Exploiting replication to improve performances of NUCA-based CMP systems

Foglia

Solinas

2014

ACM Trans. Embed. Comput. Syst.

View full text Add to dashboard Cite

Improvements in semiconductor nanotechnology made chip multiprocessors the reference architecture for high-performance microprocessors. CMPs usually adopt large Last-Level Caches (LLC) shared among cores and private L1 caches, whose performances depend on the wire-delay dominated response time of LLC. NUCA (NonUniform Cache Architecture) caches represent a viable solution for tolerating wire-delay effects. In this article, we present Re-NUCA, a NUCA cache that exploits replication of blocks inside the LLC to avoid performance limitations of D-NUCA caches due to conflicting access to shared data. Results show that a Re-NUCA LLC permits to improve performances of more than 5% on average, and up to 15% for applications that strongly suffer from conflicting access to shared data, while reducing network traffic and power consumption with respect to D-NUCA caches. Besides, it outperforms different S-NUCA schemes optimized with victim replication. ACM Reference Format:Pierfrancesco Foglia and Marco Solinas. 2014. Exploiting replication to improve performances of NUCAbased CMP systems.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Energy Consumption Evaluationmentioning

confidence: 99%