2021
DOI: 10.1007/978-3-030-74224-9_2
|View full text |Cite
|
Sign up to set email alerts
|

Performance Assessment of OpenMP Compilers Targeting NVIDIA V100 GPUs

Abstract: Heterogeneous systems are becoming increasingly prevalent. In order to exploit the rich compute resources of such systems, robust programming models are needed for application developers to seamlessly migrate legacy code from today's systems to tomorrow's. Over the past decade and more, directives have been established as one of the promising paths to tackle programmatic challenges on emerging systems. This work focuses on applying and demonstrating OpenMP offloading directives on five proxy applications. We o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 16 publications
0
6
0
Order By: Relevance
“…A significant challenge is to port existing applications to platforms with accelerators, including GPUs. The ultimate goal 39 is taking advantage of these powerful platforms without having to learn the hardware details or significantly change the application codes. Numerous programming models and environments have been developed, including CUDA, 11 HIP, 12 OpenCL, 24 and Kokkos 40 .…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…A significant challenge is to port existing applications to platforms with accelerators, including GPUs. The ultimate goal 39 is taking advantage of these powerful platforms without having to learn the hardware details or significantly change the application codes. Numerous programming models and environments have been developed, including CUDA, 11 HIP, 12 OpenCL, 24 and Kokkos 40 .…”
Section: Related Workmentioning
confidence: 99%
“…An alternative option is directive‐based models such as OpenACC 41 and OpenMP 42 . They offer 39 an abstraction layer over different hardware types with a unified interface that allows reducing the work needed to accelerate applications, requiring only some "hints" or annotations to be added for the compiler. The OpenMP model incorporated support for offloading code to accelerators from version 4.0 (released in 2013) and has upgraded various features in subsequent versions (till version 5.2).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…For many programming models, there are studies that evaluate these for CPUs or GPUs. In [7] the authors study OpenMP offload on NVIDIA V100 with a few mini-apps and various compilers, observe performance variations, and provide some OpenMP optimization techniques. In [8], the authors present the computebound mini-app miniBUDE and evaluate various programming models, including offload to GPUs.…”
Section: Related Workmentioning
confidence: 99%
“…The offloading model is beginning to mature as we speak based on the validation and verification findings [27], [28]. We also note that the model is being used on mini-applications [29] and other applications including Pseudo-Spectral Direct Numerical Simulation-Combined Compact Difference (PSDNS-CCD3D) [30] and Quicksilver [31] among others [32], [33], [34], [30], [35]. We chose to go with OpenACC instead of the OpenMP offloading model.…”
Section: Directive-based Programming With Openaccmentioning
confidence: 99%