2015
DOI: 10.1002/cpe.3555
|View full text |Cite
|
Sign up to set email alerts
|

Faithful performance prediction of a dynamic task‐based runtime system for heterogeneous multi‐core architectures

Abstract: SUMMARYMulti-core architectures comprising several graphics processing units (GPUs) have become mainstream in the field of high-performance computing. However, obtaining the maximum performance of such heterogeneous machines is challenging as it requires to carefully off-load computations and manage data movements between the different processing units. The most promising and successful approaches so far build on task-based runtimes that abstract the machine and rely on opportunistic scheduling algorithms. As … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
31
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
3
3

Relationship

3
3

Authors

Journals

citations
Cited by 29 publications
(36 citation statements)
references
References 22 publications
(30 reference statements)
0
31
0
Order By: Relevance
“…Our methodology is configured with a faithful model of the computation and communication signatures of Ondes3D for representative workloads on scale. All performance modeling is carried out with SimGrid's SMPI emulation and trace replay mechanisms, which have received extensive validation efforts . We show that our methodology is faithful, in terms of total makespan as well as from the load balancing perspective observed in real AMPI runs.…”
Section: Introductionmentioning
confidence: 95%
“…Our methodology is configured with a faithful model of the computation and communication signatures of Ondes3D for representative workloads on scale. All performance modeling is carried out with SimGrid's SMPI emulation and trace replay mechanisms, which have received extensive validation efforts . We show that our methodology is faithful, in terms of total makespan as well as from the load balancing perspective observed in real AMPI runs.…”
Section: Introductionmentioning
confidence: 95%
“…Stanisic et al [9] simulate tasks' execution in order to isolate the scheduler's effect on performance from tasks' unpredictable behavior. In our work we execute the entire application on real hardware and we characterize the interaction between the scheduler and the tasks, which triggers online scheduling decisions and performance variation.…”
Section: Related Workmentioning
confidence: 99%
“…Typical approaches to optimizing scheduling algorithms consist of either providing an interactive visualization of the execution trace [1,5] or simulating the tasks execution to evaluate the overall scheduling policy in a controlled environment [4,9]. A developer then has to analyze the resulting profiling information and deduce if the scheduler behaves as expected, and qualitatively compare different schedulers.…”
Section: Introductionmentioning
confidence: 99%
“…Eventually, four papers were accepted for publication. They cover the following four Euro-Par topics: Performance Prediction and Evaluation, Distributed Systems and Algorithms, Theory and Algorithms for Parallel Computation and High-Performance and Scientific Applications.Topic 2 on Performance Prediction and Evaluation contributes the paper Performance prediction of dynamic task-based runtime system for heterogeneous multi-core architectures authored by Luka Stanisic, Samuel Thibault, Arnaud Legrand, Brice Videau and Jean-François Méhaut [1]. They address the challenge of deciding, with high accuracy and low effort, what computations to offload onto accelerators on a heterogeneous execution platform.…”
mentioning
confidence: 99%
“…Topic 2 on Performance Prediction and Evaluation contributes the paper Performance prediction of dynamic task-based runtime system for heterogeneous multi-core architectures authored by Luka Stanisic, Samuel Thibault, Arnaud Legrand, Brice Videau and Jean-François Méhaut [1]. They address the challenge of deciding, with high accuracy and low effort, what computations to offload onto accelerators on a heterogeneous execution platform.…”
mentioning
confidence: 99%