2012
DOI: 10.1016/j.parco.2011.10.003
|View full text |Cite
|
Sign up to set email alerts
|

DAGuE: A generic distributed DAG engine for High Performance Computing

Abstract: The frenetic development of the current architectures places a strain on the current state-of-the-art programming environments. Harnessing the full potential of such architectures has been a tremendous task for the whole scientific computing community. We present DAGuE a generic framework for architecture aware scheduling and management of micro-tasks on distributed many-core heterogeneous architectures. Applications we consider can be represented as a Direct Acyclic Graph of tasks with labeled edges designati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
54
0
1

Year Published

2012
2012
2019
2019

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 201 publications
(55 citation statements)
references
References 27 publications
0
54
0
1
Order By: Relevance
“…PaRSEC [13,15] provides a highly efficient platform for distributed task based parallel execution. An annotated user code is here first (automatically) translated into a parametrized DAG representation.…”
Section: Introductionmentioning
confidence: 99%
“…PaRSEC [13,15] provides a highly efficient platform for distributed task based parallel execution. An annotated user code is here first (automatically) translated into a parametrized DAG representation.…”
Section: Introductionmentioning
confidence: 99%
“…Figure 5A shows the distribution only for reused subgraphs for the Eager-Match detection strategy with and without inspection (EM+I and EM, respectively). Each column shows the percentage of reused subgraphs with 2 to 4 nodes ( [2,4]), 5 to 9 nodes ( [5,9]), 10 to 49 nodes ( [10,49]), 50…”
Section: Of 12mentioning
confidence: 99%
“…Effectively, the compiler performs static data flow analysis to convert an affine input serial program into a Direct Acyclic Graph (DAG), with program functions (kernels) as its nodes, and data dependency edges between kernels as its edges. Then, the run-time is responsible for addressing all DAG scheduling challenges, including background MPI data transfers between distributed resources [10].…”
Section: Compiler and Run-time Synergymentioning
confidence: 99%
“…A better solution would be to rely on a run-time system that can dynamically adapt the execution to the current hardware. DAGuE [10], which deploys dynamic micro-task scheduling, has been shown [11] to deliver portable high performance, on heterogeneous hardware for a class of regular problems, such as those occurring in linear algebra.…”
Section: Introductionmentioning
confidence: 99%