Proceedings of the 48th International Symposium on Microarchitecture 2015
DOI: 10.1145/2830772.2830784
|View full text |Cite
|
Sign up to set email alerts
|

GPU register file virtualization

Abstract: To support massive number of parallel thread contexts, Graphics Processing Units (GPUs) use a huge register file, which is responsible for a large fraction of GPU's total power and area. The conventional belief is that a large register file is inevitable for accommodating more parallel thread contexts, and technology scaling makes it feasible to incorporate ever increasing size of register file. In this paper, we demonstrate that the register file size need not be large to accommodate more threads context. We … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
53
0
1

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 76 publications
(54 citation statements)
references
References 40 publications
(49 reference statements)
0
53
0
1
Order By: Relevance
“…In HAWS execution, when instructions retire, we use register renaming to avoid WARs and WAWs when storing results, as necessary. Registers files in the GPU are underutilized in many applications, which has been discussed in many recent studies [7,8,16]. Based on these studies and our evaluation, we propose to use the underutilized registers for register renaming.…”
Section: Hint Formatmentioning
confidence: 95%
“…In HAWS execution, when instructions retire, we use register renaming to avoid WARs and WAWs when storing results, as necessary. Registers files in the GPU are underutilized in many applications, which has been discussed in many recent studies [7,8,16]. Based on these studies and our evaluation, we propose to use the underutilized registers for register renaming.…”
Section: Hint Formatmentioning
confidence: 95%
“…In this section we evaluate how the register widths affect what needs to be stored in the register file at any given time during the lifetime of the kernel. Studying the amount that register pressure can be reduced at all instructions (and not just the maximum register pressure) can be interesting for, for instance, power optimizations such as shutting down unused banks [12] or for performance optimizations such as sharing unused physical registers [7].…”
Section: Per-instruction Register Pressurementioning
confidence: 99%
“…GPU register file optimizations. Register file optimization techniques targeting GPUs include a method which compresses the register file data using BDI [12], virtualization of the register file [7], and a partitioned register file that places rarely accessed registers in a lowenergy, high-latency register bank [1]. All of these approaches target power efficiency rather than performance efficiency, and none of them exploit approximations.…”
Section: Related Workmentioning
confidence: 99%
“…Jeon et al presented a register file virtualization technique to reduce the physical area as well as power consumption of architected register files [5]. In the second category, Liang et al proposed variable latency register files to mitigate the effects of process variation in CPUs [9].…”
Section: Introductionmentioning
confidence: 99%
“…A huge register file is required to maintain the latency-hiding ability. Therefore the register file becomes a potential soft target for PV [5]. The most prominent manifestation of PV in GPU register files is the variability in the access latency.…”
mentioning
confidence: 99%