2015
DOI: 10.1007/978-3-319-20086-6_11
|View full text |Cite
|
Sign up to set email alerts
|

Scheduling MapReduce Jobs and Data Shuffle on Unrelated Processors

Abstract: We propose constant approximation algorithms for generalizations of the Flexible Flow Shop (FFS) problem which form a realistic model for non-preemptive scheduling in MapReduce systems. Our results concern the minimization of the total weighted completion time of a set of MapReduce jobs on unrelated processors and improve substantially on the model proposed by Moseley et al. (SPAA 2011) in two directions. First, we consider each job consisting of multiple Map and Reduce tasks, as this is the key idea behind M… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
16
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(16 citation statements)
references
References 17 publications
0
16
0
Order By: Relevance
“…For single-round MapReduce scheduling, we obtain an 8-approximation algorithm. Our algorithm improves on the 12-approximation algorithm of [18,8], while refines their idea (of merging independent schedules of only map and only reduce tasks on their corresponding sets of processors into a single schedule) by applying a 2-approximation algorithm similar to that in [5, Lemma 6.1]. Note that [5] considers a set of orders of jobs, instead of jobs consisting of tasks, and the completion time of each order is specified by the completion of the job that finishes last.…”
Section: The Single-round Casementioning
confidence: 99%
See 4 more Smart Citations
“…For single-round MapReduce scheduling, we obtain an 8-approximation algorithm. Our algorithm improves on the 12-approximation algorithm of [18,8], while refines their idea (of merging independent schedules of only map and only reduce tasks on their corresponding sets of processors into a single schedule) by applying a 2-approximation algorithm similar to that in [5, Lemma 6.1]. Note that [5] considers a set of orders of jobs, instead of jobs consisting of tasks, and the completion time of each order is specified by the completion of the job that finishes last.…”
Section: The Single-round Casementioning
confidence: 99%
“…First, in terms of modeling the MapReduce scheduling process: (i) We consider the practical scenario of multiround multi-task MapReduce jobs and capture their task dependencies, and (ii) we study both identical and unrelated processors, thus dealing with data locality. Second, in terms of algorithm design and analysis: (i) We propose an algorithmic framework for the multi-round MapReduce scheduling problem with proven performance guarantees, distinguishing between the case of indistinguishable and disjoint (map and reduce) sets of identical or unrelated processors, and (ii) our algorithms are based on natural LP relaxations of the problem and improve on the approximation ratios achieved in previous work [18,8]. Third, in terms of experimental analysis, we focus on the most general case of unrelated processors and show that our algorithms have an excellent performance in practice.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations