Proceedings of the 2018 International Symposium on Code Generation and Optimization 2018
DOI: 10.1145/3168831
|View full text |Cite
|
Sign up to set email alerts
|

CUDAAdvisor: LLVM-based runtime profiling for modern GPUs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4
1

Relationship

2
8

Authors

Journals

citations
Cited by 33 publications
(5 citation statements)
references
References 36 publications
0
4
0
Order By: Relevance
“…This is especially desired when porting traditional CPU-based HPC applications onto the new GPU-based exascale systems, such as Summit [6], Sierra [7] and Perlmutter [37]. As part of the community effort, we are planning to pursue these research directions in our future work with our past experience on GPU analytic modeling [38], [39], [40] and performance optimization [41], [42], [43], [44], [45], [46], [47], [48], [49], [50], [51].…”
Section: Discussionmentioning
confidence: 99%
“…This is especially desired when porting traditional CPU-based HPC applications onto the new GPU-based exascale systems, such as Summit [6], Sierra [7] and Perlmutter [37]. As part of the community effort, we are planning to pursue these research directions in our future work with our past experience on GPU analytic modeling [38], [39], [40] and performance optimization [41], [42], [43], [44], [45], [46], [47], [48], [49], [50], [51].…”
Section: Discussionmentioning
confidence: 99%
“…In addition, the overhead of obtaining these indicators is usually high. The second approach is based on code instrumentation, such as CUDAAdvisor [55], to measure statistics about the control flow. Although the code instrumentation has little performance effect than the former, it changes the actual behavior of GPU kernels and requires extra effort made by developers and the system administrator in code changes and maintenance.…”
Section: Discussion and Future Workmentioning
confidence: 99%
“…Based on the stall analysis, it identifies inefficient software-hardware interactions and their root causes, thus helping make informed optimization decisions. CUDAAdvisor [37], built on top of LLVM, instrumentalizes application code on both the host and device sides. It conducts code-and data-centric profiling to identify performance bottlenecks arising from competition for cache resources and memory and control flow divergence.…”
Section: Related Workmentioning
confidence: 99%