2015 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES) 2015
DOI: 10.1109/cases.2015.7324558
|View full text |Cite
|
Sign up to set email alerts
|

Reducing shift penalty in Domain Wall Memory through register locality

Abstract: General-purpose graphics processing units (GPGPUs) have the ability to execute hundreds to thousands of threads simultaneously. Extreme multithreading requires a large register file to hold state of executing threads and facilitate context switching. As feature size reduces, power consumption in the large register file becomes a major concern.In this work, we exploit Domain Wall Memory (DWM) which is a spin-based memory to reduce power consumption in register file. DWM is a promising technology and offers non-… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 25 publications
(11 citation statements)
references
References 21 publications
0
11
0
Order By: Relevance
“…Literature suggests that the most established technique to improve RTM performance without increasing the number of access ports is preshifting [71], [87], [88]. The concept of preshifting is analogous to prefetching which consists in fetching the data of the next likely accessed element in advance.…”
Section: A Hardware Techniques For Minimizing Shiftsmentioning
confidence: 99%
“…Literature suggests that the most established technique to improve RTM performance without increasing the number of access ports is preshifting [71], [87], [88]. The concept of preshifting is analogous to prefetching which consists in fetching the data of the next likely accessed element in advance.…”
Section: A Hardware Techniques For Minimizing Shiftsmentioning
confidence: 99%
“…Many techniques have been proposed in the past to mitigate the negative impact of RTM shift overhead. These include data compression [22], reconfigurability of RTM in terms of deactivating (or activating) rarely (or highly) used domains, runtime data swapping [20], proactively aligning the likely accessed domains to the port positions [1], [12], [20], [21], and intelligent instruction [16] and data placement [2], [5], [7], [8], [11]. Among these proposals, data placement has demonstrated significant benefits with trivial or no overheads.…”
Section: Overall Performance and Energy Analysismentioning
confidence: 99%
“…Sun et al proposed swapping highly accessed DWs with those closer to the access port(s) [47]. Atoofian proposed a predictor-based proactive shifting by exploiting register locality [1]. Likewise, proactive shifting is performed on the data items waiting in the queue [30].…”
Section: Related Workmentioning
confidence: 99%