50th International Conference on Parallel Processing Workshop 2021
DOI: 10.1145/3458744.3473358
|View full text |Cite
|
Sign up to set email alerts
|

Advancing OpenMP Offload Debugging Capabilities in LLVM

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 3 publications
0
3
0
Order By: Relevance
“…In recent work we implemented loop transformation constructions introduced in OpenMP 5.1 [70,71], asynchronous offloading for OpenMP [132], efficient lowering of idiomatic OpenMP code to GPUs (under review), OpenMP-aware compiler optimizations with informative and actionable remarks for users (under review), a portable OpenMP device (=gpu) runtime written in OpenMP 5.1 (including atomic 2) partial( 4) partial( 8) partial( 16) partial (32) partial (64) partial (128) partial( 256 support) [133], a virtual GPU as debugging friendly offloading target on the host [134], improved diagnostics and execution information [135,136]. We redone the OpenMP GPU code generation in LLVM/Clang [137] to improve performance and correctness.…”
Section: Recent Progressmentioning
confidence: 99%
See 1 more Smart Citation
“…In recent work we implemented loop transformation constructions introduced in OpenMP 5.1 [70,71], asynchronous offloading for OpenMP [132], efficient lowering of idiomatic OpenMP code to GPUs (under review), OpenMP-aware compiler optimizations with informative and actionable remarks for users (under review), a portable OpenMP device (=gpu) runtime written in OpenMP 5.1 (including atomic 2) partial( 4) partial( 8) partial( 16) partial (32) partial (64) partial (128) partial( 256 support) [133], a virtual GPU as debugging friendly offloading target on the host [134], improved diagnostics and execution information [135,136]. We redone the OpenMP GPU code generation in LLVM/Clang [137] to improve performance and correctness.…”
Section: Recent Progressmentioning
confidence: 99%
“…This work was complemented by a new LLVM/OpenMP GPU device runtime that helps us further close the performance gap compared to CUDA and other kernel languages [138]. Various efforts in improving development and debugging have also been integrated into LLVM/OpenMP [135,134].…”
Section: Recent Progressmentioning
confidence: 99%
“…A recent development is the introduction of the OpenMPIR-Builder [6] to extract out the base-language independent portion of the OpenMP lowering from the one that is specific to the Clang AST. The goal is to share the implementation of the heavy lowering between Clang and the MLIR OpenMP Dialect [2], similar to how IRBuilder is used by many language front-ends and not just Clang.…”
Section: Clang Layer Architecturementioning
confidence: 99%