2018 30th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) 2018
DOI: 10.1109/cahpc.2018.8645848
|View full text |Cite
|
Sign up to set email alerts
|

Automated GPU Grid Geometry Selection for OPENMP Kernels

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
2
1
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 11 publications
0
2
0
Order By: Relevance
“…For some programs that are not sensitive to register resources 15 , although optimization can also reduce the number of registers, its performance changes are not obvious. At the same time, both optimizations are for the case where the number of loop iterations is known.…”
Section: Future Workmentioning
confidence: 99%
“…For some programs that are not sensitive to register resources 15 , although optimization can also reduce the number of registers, its performance changes are not obvious. At the same time, both optimizations are for the case where the number of loop iterations is known.…”
Section: Future Workmentioning
confidence: 99%
“…For example, when applying the analysis to calculate the inter-thread access stride of a GPU parallel loop, a number of iterations equal to the GPU thread-block size is tested. Existing OpenMP GPU runtimes select fixed-sized thread-block sizes based on the target GPU architecture (e.g., 128 for Pascal [17]).…”
Section: Loop Iteration Point Algebraic Differencesmentioning
confidence: 99%