Proceedings of the 40th Annual International Symposium on Computer Architecture 2013
DOI: 10.1145/2485922.2485934
|View full text |Cite
|
Sign up to set email alerts
|

Microarchitectural mechanisms to exploit value structure in SIMT architectures

Abstract: SIMT architectures improve performance and efficiency by exploiting control and memory-access structure across data-parallel threads. Value structure occurs when multiple threads operate on values that can be compactly encoded, e.g., by using a simple function of the thread index. We characterize the availability of control, memory-access, and value structure in typical kernels and observe ample amounts of value structure that is largely ignored by current SIMT architectures. We propose three microarchitectura… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
5
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 25 publications
(5 citation statements)
references
References 23 publications
0
5
0
Order By: Relevance
“…Several researchers have proposed microarchitectural techniques to take advantage of value locality by reusing results from a result cache [54,60] and designing value predictors to enable speculative execution [30,45,46]. Researchers have studied value locality or redundancy in GPGPU compute kernels to detect scalar variables [9,31,42], affine computations [42], and value similarity [63] across threads of SIMT warps, thread-block (TB) dimensionality dependent redundancy across threads of TBs [64], and have proposed hardware-only or hardware-software techniques to gainfully exploit such value locality.…”
Section: Related Workmentioning
confidence: 99%
“…Several researchers have proposed microarchitectural techniques to take advantage of value locality by reusing results from a result cache [54,60] and designing value predictors to enable speculative execution [30,45,46]. Researchers have studied value locality or redundancy in GPGPU compute kernels to detect scalar variables [9,31,42], affine computations [42], and value similarity [63] across threads of SIMT warps, thread-block (TB) dimensionality dependent redundancy across threads of TBs [64], and have proposed hardware-only or hardware-software techniques to gainfully exploit such value locality.…”
Section: Related Workmentioning
confidence: 99%
“…Scalar register file [23] eliminates redundant power consumption in that case by storing the thread register value of only one SIMT lane shared across all lanes. Prior works [16,17,30,33] have also used the notion of value similarity. Value structure [16] is exploited for improving memory storage efficiency [17] and power efficient computations [30].…”
Section: Related Workmentioning
confidence: 99%
“…Prior works [16,17,30,33] have also used the notion of value similarity. Value structure [16] is exploited for improving memory storage efficiency [17] and power efficient computations [30]. However, they did not exploit value structure for improving register file efficiency, which is the primary focus of this paper.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations