Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC 2015
DOI: 10.1145/2833157.2833161
|View full text |Cite
|
Sign up to set email alerts
|

Integrating GPU support for OpenMP offloading directives into Clang

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
16
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 42 publications
(16 citation statements)
references
References 7 publications
0
16
0
Order By: Relevance
“…The introduction of such features into this prominent open standard demonstrates that there is an ever-increasing acceptance that such architectures will become a permanent feature in modern supercomputing. Although the specification has been in existence since the middle of 2013, compiler support for the heterogeneous features has been limited to a number of experimental open source implementations until more recently [3], [4], [5]. Until now, the principal use of OpenMP 4.0 has been for targeting the Intel Xeon Phi Knights Corner (KNC) architecture, but future releases of the Intel Xeon Phi architecture, such as the Knights Landing, are going to self-host, removing the requirement for an offloading model.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The introduction of such features into this prominent open standard demonstrates that there is an ever-increasing acceptance that such architectures will become a permanent feature in modern supercomputing. Although the specification has been in existence since the middle of 2013, compiler support for the heterogeneous features has been limited to a number of experimental open source implementations until more recently [3], [4], [5]. Until now, the principal use of OpenMP 4.0 has been for targeting the Intel Xeon Phi Knights Corner (KNC) architecture, but future releases of the Intel Xeon Phi architecture, such as the Knights Landing, are going to self-host, removing the requirement for an offloading model.…”
Section: Introductionmentioning
confidence: 99%
“…Ozen et al [4] partially implemented OpenMP 4.0 in the OmpSs compiler and performed a performance evaluation with three kernels. Bertolli et al [3] and Bercea et al [18] implemented GPU support for Clang using the OpenMP 4.0 specification, and presented performance results for a representative set of kernels in LULESH.…”
mentioning
confidence: 99%
“…Bertolli et al [3] discuss the coordination of threads within an NVIDIA GPU, and show that their novel approach limits the impact on code generation when integrated into the LLVM compiler infrastructure. They later discussed their approach to integrating OpenMP 4.5 offloading for NVIDIA GPUs into Clang [2].…”
Section: Concluding Suggestions For Performance Portabilitymentioning
confidence: 99%
“…Some experimental compilers were developed in the interim, with the most notable being the Clang OpenMP 4.5 project, which was contributed to by a number of collaborators, including AMD, IBM, Intel, and NVIDIA. In particular, the GPU targeting functionality was developed by IBM, who are actively migrating this functionality into the main trunk of Clang [2]. In September 2015, the Cray Compiling Environment version 8.4 introduced the first official vendor support for OpenMP 4.0 on NVIDIA GPUs, with full support for version 4.0 of the specification.…”
Section: Introductionmentioning
confidence: 99%
“…e existing support for OpenMP target regions is built on top of the host implementation of OpenMP and is confined, almost exclusively, to the Clang frontend code generation module. e most recent code generation scheme for OpenMP target regions is detailed in [5] and is based on previous work [1,3,4] covering data-parallel cases [2,6] as well as nested parallelism [5].…”
Section: Introductionmentioning
confidence: 99%