2015
DOI: 10.1002/cpe.3723
|View full text |Cite
|
Sign up to set email alerts
|

Task‐based FMM for heterogeneous architectures

Abstract: International audienceHigh performance fast multipole method is crucial for the numerical simulation of many physical problems. In a previous study, we have shown that task-based fast multipole method provides the flexibility required to process a wide spectrum of particle distributions efficiently on multicore architectures. In this paper, we now show how such an approach can be extended to fully exploit heterogeneous platforms. For that, we design highly tuned graphics processing unit (GPU) versions of the t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
42
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 38 publications
(42 citation statements)
references
References 42 publications
(60 reference statements)
0
42
0
Order By: Relevance
“…H etero P rio has been proposed in the context of task‐based runtime systems responsible for allocating tasks onto heterogeneous nodes typically consisting of a few CPUs and GPUs …”
Section: Presentation Of Heteropriomentioning
confidence: 99%
See 1 more Smart Citation
“…H etero P rio has been proposed in the context of task‐based runtime systems responsible for allocating tasks onto heterogeneous nodes typically consisting of a few CPUs and GPUs …”
Section: Presentation Of Heteropriomentioning
confidence: 99%
“…HETEROPRIO has been proposed in the context of task-based runtime systems responsible for allocating tasks onto heterogeneous nodes typically consisting of a few CPUs and GPUs. 17 Historically, most systems use scheduling strategies inspired by the well-known HEFT algorithm: tasks are ordered by priorities (which are computed offline) and the highest priority ready task is allocated onto the resource that is expected to complete it first, given the expected transfer times of its input data and the expected processing time of this task on this resource. These systems have shown some limits 13 in strongly heterogeneous and unrelated systems, which is the typical case of nodes consisting of both CPUs and GPUs.…”
Section: Affinity-based Schedulingmentioning
confidence: 99%
“…By providing the abstraction of its programming layer, OPENMP makes it possible to get the best of both worlds. In the future, we intend to explore the coupling of OPENMP with task-based runtime systems further, in particular focusing on the support for heterogeneous CPU+accelerator platforms that could be illustrated with a task-based FMM for heterogeneous machines [26].…”
Section: Resultsmentioning
confidence: 99%
“…Most of these tools support a core part of task-based RS, such as creating a graph of tasks (even if it is implemented differently) where tasks can read or write data. However, scheduling is an important factor in the performance (Agullo et al, 2016a), and few of these RS's propose a way to create a scheduler easily without having to go inside the RS's code. Moreover, specific features provide mechanisms to increase the degree of parallelism.…”
Section: Task-based Parallelizationmentioning
confidence: 99%