2022
DOI: 10.1007/978-3-031-15922-0_1
|View full text |Cite
|
Sign up to set email alerts
|

Enhancing MPI+OpenMP Task Based Applications for Heterogeneous Architectures with GPU Support

Abstract: Heterogeneous supercomputers are widespread over HPC systems and programming efficient applications on these architectures is a challenge. Task-based programming models are a promising way to tackle this challenge. Since OpenMP 4.0 and 4.5, the target directives enable to offload pieces of code to GPUs and to express it as tasks with dependencies. Therefore, heterogeneous machines can be programmed using MPI+OpenMP(task+target) to exhibit a very high level of concurrent asynchronous operations for which data t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 20 publications
0
1
0
Order By: Relevance
“…Recent works [2][3][4] have provided solutions for interoperability issues when nesting MPI communications within OpenMP tasks [5][6][7], enabling the overlap of communications through OpenMP task scheduling. Yet, few applications have migrated towards such a task-based composition: at least two attempts led to convincing performances with the porting of the Cholesky factorization [6] and the hydrodynamics proxy-app LULESH [8]. Other attempts failed to implement functional applications due to interoperability issues [5], had to tinker application codes [7] to sequentialize communications, or added coarse barriers losing potential communication overlap [9].…”
Section: Introductionmentioning
confidence: 99%
“…Recent works [2][3][4] have provided solutions for interoperability issues when nesting MPI communications within OpenMP tasks [5][6][7], enabling the overlap of communications through OpenMP task scheduling. Yet, few applications have migrated towards such a task-based composition: at least two attempts led to convincing performances with the porting of the Cholesky factorization [6] and the hydrodynamics proxy-app LULESH [8]. Other attempts failed to implement functional applications due to interoperability issues [5], had to tinker application codes [7] to sequentialize communications, or added coarse barriers losing potential communication overlap [9].…”
Section: Introductionmentioning
confidence: 99%