2015 14th International Symposium on Parallel and Distributed Computing 2015
DOI: 10.1109/ispdc.2015.12
|View full text |Cite
|
Sign up to set email alerts
|

BOLAS: Bipartite-Graph Oriented Locality-Aware Scheduling for MapReduce Tasks

Abstract: Task scheduling is critical to reduce the makespan of MapReduce jobs. It is an effective approach for scheduling optimization by improving the data locality, which involves attempting to locate a task and its related data block on the same node. However, recent studies have been insufficient in addressing the locality issue. This paper proposes BOLAS, a MapReduce task scheduling algorithm, which models the scheduling process as a bipartite-graph matching problem trying best to assign data block to the nearest … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
6
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 15 publications
(7 citation statements)
references
References 26 publications
0
6
0
Order By: Relevance
“…Balancing the load of the system should be performed to improve the performance, and the main criteria in improving the performance are execution time and resource utilization. Their research results show that the efficient solution for having an optimized load balancing system is a good slot allocation strategy that allows a balanced workload 16–18 . It is the concern that we follow in this article.…”
Section: Related Workmentioning
confidence: 98%
See 1 more Smart Citation
“…Balancing the load of the system should be performed to improve the performance, and the main criteria in improving the performance are execution time and resource utilization. Their research results show that the efficient solution for having an optimized load balancing system is a good slot allocation strategy that allows a balanced workload 16–18 . It is the concern that we follow in this article.…”
Section: Related Workmentioning
confidence: 98%
“…Their research results show that the efficient solution for having an optimized load balancing system is a good slot allocation strategy that allows a balanced workload. [16][17][18] It is the concern that we follow in this article. Our solution is not dependent on the scheduling algorithm; the load portion's accurate calculation is the primary concern.…”
Section: Related Workmentioning
confidence: 99%
“…However, this work uses flow network, rather than more popular topology network to model data placement. Work in Reference 55 proposes BOLAS, a task scheduling algorithm, which models the scheduling process as a bipartite‐graph matching problem to assign data block to the nearest task. BOLAS solves the model using Kuhn‐Munkres (KM) optimal matching algorithm.…”
Section: Related Workmentioning
confidence: 99%
“…KM algorithm is a combinatorial optimization algorithm that solves the assignment problem in polynomial time which anticipates latest primal‐dual methods 72 . KM algorithm is widely used in weighted bipartite graph model to realize task scheduling, such as References 54,55,73. The KM algorithm 72 is an efficient way to find the maximum weight perfect matching in a weighted bipartite graph and find a good feasible labeling that remains enough edges in graph.…”
Section: A Data Locality Aware Task Schedulermentioning
confidence: 99%
“…Processing data within a requesting node for a dataintensive application represents a key factor to improve the scheduling performance in Hadoop. Many researchers (e.g., [56] [59] [60] [61]) have been working extensively to solve this problem by evaluating the impact of many factors on the data locality. This can help identify the correlation between data locality and those identified factors and hence schedule tasks on the processing nodes as close as possible to their input data.…”
Section: Data Locality-aware Schedulingmentioning
confidence: 99%