2016 International Conference on High Performance Computing &Amp; Simulation (HPCS) 2016
DOI: 10.1109/hpcsim.2016.7568315
|View full text |Cite
|
Sign up to set email alerts
|

Efficient control flow restructuring for GPUs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
2
2

Relationship

3
5

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 21 publications
0
7
0
Order By: Relevance
“…As discussed above, branch divergence takes both costs of the true and false sides of a divergent branch to execute it, so that it decreases execution efficiency of GPU programs. Several methods have been proposed to reduce branch divergence and improve execution efficiency [19], [20], [21], [22], [23], [24], [25]. On the other hand, some traditional optimizations may increase branch divergence and decrease execution efficiency.…”
Section: Branch Divergencementioning
confidence: 99%
See 1 more Smart Citation
“…As discussed above, branch divergence takes both costs of the true and false sides of a divergent branch to execute it, so that it decreases execution efficiency of GPU programs. Several methods have been proposed to reduce branch divergence and improve execution efficiency [19], [20], [21], [22], [23], [24], [25]. On the other hand, some traditional optimizations may increase branch divergence and decrease execution efficiency.…”
Section: Branch Divergencementioning
confidence: 99%
“…In addition, it does not duplicate code, so that it incurs only a linear increase in the number of basic blocks. Reissmann et al [24] proposed control flow restructuring technique that consists of loop restructuring and branch restructuring. Loop restructuring converts all loops to tail-controlled loops, and branch restructuring ensures proper nesting of control flow.…”
Section: Related Workmentioning
confidence: 99%
“…nodes in -regions, i.e., control flow equivalent to if-then-else, switch, and do-while statements, while PCFR can recover arbitrary complex control flow, i.e., control flow that is not restricted to RVSDG constructs. PCFR reduces the number of static branches in the resulting control flow [3], but might also result in undesirable control flow for certain architectures, such as graphic processing units [36]. For the sake of brevity, we omit a discussion of SCFR and PCFR as the algorithms are extensively described by Bahmann et al [3].…”
Section: Intra-procedural Control Flowmentioning
confidence: 99%
“…SCFR recovers only control flow that resembles the structural nodes in λ-regions, i.e., control flow equivalent to if-then-else, switch, and do-while statements, while PCFR can recover arbitrary complex control flow, i.e., control flow that is not restricted to RVSDG constructs. PCFR reduces the number of static branches in the resulting control flow [3], but might also result in undesirable control flow for certain architectures, such as graphic processing units [33]. For the sake of brevity, we omit a discussion of SCFR and PCFR as the algorithms are extensively described by Bahmann et al [3].…”
Section: Intra-procedural Control Flow Recoverymentioning
confidence: 99%