2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2016
DOI: 10.1109/ipdps.2016.90
|View full text |Cite
|
Sign up to set email alerts
|

Are Static Schedules so Bad? A Case Study on Cholesky Factorization

Abstract: Our goal is to provide an analysis and comparison of static and dynamic strategies for task graph scheduling on platforms consisting of heterogeneous and unrelated resources, such as GPUs and CPUs. Static scheduling strategies, that have been used for years, suffer several weaknesses. First, it is well known that underlying optimization problems are NP-Complete, what limits the capability of finding optimal solutions to small cases. Second, parallelism inside processing nodes makes it difficult to precisely pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
47
0
1

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 43 publications
(48 citation statements)
references
References 17 publications
0
47
0
1
Order By: Relevance
“…This area bound can be extended to the case of non‐independent tasks by adding a variable s i representing the start time of each task i , and the following set of constraints: -15pti,j2.56804ptsuch that2.56804ptij,1emsi+xipi+false(1xifalse)qisj 85pti,1emsi+xipi+false(1xifalse)qiAreaBoundfalse(scriptIfalse). …”
Section: Notations and First Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…This area bound can be extended to the case of non‐independent tasks by adding a variable s i representing the start time of each task i , and the following set of constraints: -15pti,j2.56804ptsuch that2.56804ptij,1emsi+xipi+false(1xifalse)qisj 85pti,1emsi+xipi+false(1xifalse)qiAreaBoundfalse(scriptIfalse). …”
Section: Notations and First Resultsmentioning
confidence: 99%
“…This H etero P rio allocation strategy has been studied in the context of StarPU for several linear algebra kernels and it has been proven experimentally that it enables to achieve a better utilization of slow resources than other standard HEFT‐based strategies.…”
Section: Presentation Of Heteropriomentioning
confidence: 99%
“…Task processing times are randomly generated and follow a gamma distribution with expected value 15 for the CPUs and 1 for the GPUs. These values are inspired from the measures in [2,7]. Moreover, the gamma distribution has been advocated for modeling job runtimes [19,29].…”
Section: Simulationsmentioning
confidence: 99%
“…The problem has also been studied in practical situations using specific runtime environments as XKaapi [10] and StarPU [2]. The study with StarPU has been focused on a single application, the Cholesky factorization [1], and later extended to more than 2 types of processors [7].…”
Section: Low-cost Approximation Algorithms For Scheduling Independentmentioning
confidence: 99%
“…Tasks correspond to four linear algebra kernels: GEMM, SYRK, TRSM, and POTRF. Their respective processing times on a CPU are set to 170ms, 95ms, 88ms, and 33ms, and on a GPU to 5.95ms, 3.65ms, 8.11ms, and 15.6ms, which corresponds to measures [1,6] made using the Chameleon software [12].…”
Section: Methodsmentioning
confidence: 99%