2015
DOI: 10.1145/2811402
|View full text |Cite
|
Sign up to set email alerts
|

Spatiotemporal SIMT and Scalarization for Improving GPU Efficiency

Abstract: Temporal SIMT (TSIMT) has been suggested as an alternative to conventional (spatial) SIMT for improving GPU performance on branch intensive code. Although TSIMT has been briefly mentioned before, it was not evaluated. Therefor we present a complete design and evaluation of TSIMT GPUs, along with the inclusion of scalarization and a combination of temporal and spatial SIMT, named Spatio-Temporal SIMT (STSIMT). Simulations show that TSIMT alone results in a performance reduction but a combination of Scalarizatio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 31 publications
0
4
0
Order By: Relevance
“…The analysis shows the maximum, average and minimum PSNR losses (red, green, and blue lines, respectively), alongside the percentile of intra pixels (purple line) for each decoded B-frame. The first important information from these plots is that the PSNR losses within an intra period resemble a negative quadratic function, with highest loss values being achieved in the middle frame (16) of the intra period between GoPs.…”
Section: Results For Random Access Configurationmentioning
confidence: 99%
See 1 more Smart Citation
“…The analysis shows the maximum, average and minimum PSNR losses (red, green, and blue lines, respectively), alongside the percentile of intra pixels (purple line) for each decoded B-frame. The first important information from these plots is that the PSNR losses within an intra period resemble a negative quadratic function, with highest loss values being achieved in the middle frame (16) of the intra period between GoPs.…”
Section: Results For Random Access Configurationmentioning
confidence: 99%
“…In this sense, approximate storage is a promising paradigm for improving access time, energy consumption, memory cell density and lifetime, among others [10] [13] [14] [15]. It can be implemented through a variety of techniques based on the target memory technology, such as reducing the refresh rate in DRAM modules [16][17][18], increasing STT-RAM's read pulse current magnitude [10,19], reducing the SET/RESET iteration number [10] of PCM memories, among others. Such gains come at the cost of reduced reliability in the stored, read and/or written data, by subjecting it to the possibility of errors and, consequently, to the loss of quality in the results obtained from the target application, producing less accurate or lower quality results.…”
Section: Introductionmentioning
confidence: 99%
“…Since the threads in a sub-warp are executed by all the SIMD lanes in synchrony, it is also called a spatial sub-warp. On the other hand, those warp threads that are executed by the same SIMD lane in consecutive execution round form a temporal sub-warp [24]. Thus, the SIMT co-processor introduces a third component, a spatial sub-warp context dispatcher, which prepares the current CPU thread ID, warp instruction, and PC for a spatial sub-warp execution, and as a result runs a warp in the consecutive execution of spatial sub-warps.…”
Section: Dual Mode Cpu Architecturementioning
confidence: 99%
“…One of these levels should be an operation considered free from errors. Several proposed architectures [23,70,75,116] separate memory regions by reliability levels to isolate some data from errors. AxRAM is compatible with the architectural model of Truffle [23], but without the usage of ISA extensions, which would demand changes in some level of the application.…”
Section: Architectural Modelmentioning
confidence: 99%