2015
DOI: 10.1145/2779052
|View full text |Cite
|
Sign up to set email alerts
|

Parallel Scheduling of Task Trees with Limited Memory

Abstract: Abstract:This paper investigates the execution of tree-shaped task graphs using multiple processors. Each edge of such a tree represents some large data. A task can only be executed if all input and output data fit into memory, and a data can only be removed from memory after the completion of the task that uses it as an input data. Such trees arise, for instance, in the multifrontal method of sparse matrix factorization. The peak memory needed for the processing of the entire tree depends on the execution ord… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
24
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
6
1

Relationship

4
3

Authors

Journals

citations
Cited by 23 publications
(24 citation statements)
references
References 23 publications
0
24
0
Order By: Relevance
“…The first heuristic, SplitSubtrees is adapted from [33], where the goal was to reduce the makespan while limiting the memory in a shared-memory environment. It creates a two-level partition with one connected component containing the root, executed first on a single processor (and called the sequential set), followed by the parallel processing of p − 1 independent subtrees.…”
Section: Two-level Heuristicmentioning
confidence: 99%
See 1 more Smart Citation
“…The first heuristic, SplitSubtrees is adapted from [33], where the goal was to reduce the makespan while limiting the memory in a shared-memory environment. It creates a two-level partition with one connected component containing the root, executed first on a single processor (and called the sequential set), followed by the parallel processing of p − 1 independent subtrees.…”
Section: Two-level Heuristicmentioning
confidence: 99%
“…In this case, we only consider the use of Step 1 directly followed by Step 3. The reference heuristic becomes SplitSubtrees, which was directly adapted from ideas from [33], resulting in a two-level split of the tree. Figure 11 reports the final makespan, after applying one heuristic of Step 1 followed by SplitAgain.…”
Section: Step 3: Reaching An Acceptable Number Of Subtreesmentioning
confidence: 99%
“…In several cases, the underlying task graph is a tree, with all dependences oriented towards the root, which notably simplifies the problem: this is the case for sparse direct solvers [13] but also in quantum chemistry computations [14]. For such trees, memory-aware parallel schedulers have been proposed [15] and the effect of processor mapping on memory consumption have recently been studied [7]. The problem of general task graphs handling large data has been identified by Ramakrishnan et al [6] who introduced clean-up jobs to reduce the memory footprint and propose some simple heuristics.…”
Section: Related Workmentioning
confidence: 99%
“…The first condition to verify is C1: Σ(p, π) ≤ Σ(p, γ). When t p is not defined, this condition directly derives from Equation (8). Consider now that t p is defined.…”
Section: Properties Of Min-cut Optimalitymentioning
confidence: 99%
“…Peak memory minimization problem has been addressed for applications whose task graphs are rooted trees [13,14,16,18]. This work aspires to be helpful in scheduling applications whose task graphs are series parallel [3,9,20], as theoretical understanding of the underlying layout problem is needed to reduce the peak memory in a parallel execution environment (see for example a previous study [8]). This paper is organized as follows.…”
Section: Introductionmentioning
confidence: 99%