Energy-Efficient All-Spin Cache Hierarchy Using Shift-Based Writes and Multilevel Storage

Venkatesan, Rangharajan; Sharad, Mrigank; Roy, Kaushik; Raghunathan, Anand

doi:10.1145/2723165

Cited by 5 publications

(2 citation statements)

References 39 publications

(38 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Other research [37,39,40,45], closer to ours, pursue to completely remove the shift latency in DWM L1 caches. This is mainly accomplished by deploying as many access headers as bits stored in the DWM L1 cache.…”

Section: Introductionmentioning

confidence: 92%

See 1 more Smart Citation

Fast-track cache

Tárrega

Valero

Lorente

et al. 2022

Proceedings of the 36th ACM International Conference on Supercomputing

View full text Add to dashboard Cite

First-level (L1) caches have been traditionally implemented with Static Random-Access Memory (SRAM) technology, since it is the fastest memory technology, and L1 caches call for tight timing constraints in the processor pipeline. However, one of the main downsides of SRAM is its low density, which prevents L1 caches to improve their storage capacity beyond a few tens of KB. On the other hand, the recent Domain Wall Memory (DWM) technology overcomes such a constraint by arranging multiple bits in a magnetic racetrack, and sharing a header to access those bits. Accessing a bit requires a shift operation to align the target bit under the header. Such shifts increase the final access latency, which is the main reason why DWM has been mostly used to implement slow last-level caches.This paper proposes a novel DWM-based L1 cache data array design, namely Fast-Track Cache (FTC), that allows L1 caches with bigger storage capacities while reducing the shift overhead thanks to an enhanced exploitation of spatial and temporal localities.Experimental results show that most FTC accesses do not require shifts. As a consequence, and due to its larger capacity, FTC improves the processor performance on average by 15% over a conventional SRAM memory subsystem and the state-of-the-art TapeCache architecture based on DWM. At the same time, energy savings are improved on average by 34% over the conventional design. CCS CONCEPTS• Hardware → Spintronics and magnetic technologies; Memory and dense storage.

show abstract

Section: Introductionmentioning

confidence: 92%

“…Prior work focusing on L1 caches usually exploit the 1-bit DWM technology, where as many headers as number of bits are implemented to completely remove the shift latency at the cost of sacrificing storage capacity for a given die area [39,40].…”

Section: Related Workmentioning

confidence: 99%