2014
DOI: 10.1145/2629681
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Power Gating of SIMD Accelerators Through Dynamic Selective Devectorization in an HW/SW Codesigned Environment

Abstract: Leakage energy is a growing concern in current and future microprocessors. Functional units of microprocessors are responsible for a major fraction of this energy. Therefore, reducing functional unit leakage has received much attention in recent years. Power gating is one of the most widely used techniques to minimize leakage energy. Power gating turns off the functional units during the idle periods to reduce the leakage. Therefore, the amount of leakage energy savings is directly proportional to the idle tim… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 25 publications
0
4
0
Order By: Relevance
“…Abdel-Majeed et al enlarges the idle period of functional units in a GPU through dynamic instruction issue policies [29]. Kumar et al proposed algorithms to power off parts of vector lanes in an SIMD accelerator [30]. Aghilinasab et al proposed algorithms to power off vector functional units in a GPU [31].…”
Section: B Related Work: Compiler-directed Leakage Power Controlmentioning
confidence: 99%
“…Abdel-Majeed et al enlarges the idle period of functional units in a GPU through dynamic instruction issue policies [29]. Kumar et al proposed algorithms to power off parts of vector lanes in an SIMD accelerator [30]. Aghilinasab et al proposed algorithms to power off vector functional units in a GPU [31].…”
Section: B Related Work: Compiler-directed Leakage Power Controlmentioning
confidence: 99%
“…Kumar et al [21] use such an approach to improve the efficiency of power-gating the processor's SIMD unit. In this scenario, devectorizing parts of the program reduces the speedup caused by SIMD instructions, but reduces the power-gating overhead.…”
Section: Profile-guided Software Modificationsmentioning
confidence: 99%
“…In these works, dynamic configurability enables lane resource to execute as a traditional SIMD processor, be re-purposed to behave as a clustered VLIW processor, or combinations of both. On the other hand, the work presented in [17] uses power gating for turning off the lanes of the SIMD function units in the case of low DLP and the corresponding portion of the code is devectorized to keep the higher lanes off. The devectorized code is executed on the lowest SIMD lane.…”
Section: Related Workmentioning
confidence: 99%