2015
DOI: 10.1007/978-3-319-28448-4_11
|View full text |Cite
|
Sign up to set email alerts
|

An Eye on the Elephant in the Wild: A Performance Evaluation of Hadoop’s Schedulers Under Failures

Abstract: Abstract. Large-scale data analysis has increasingly come to rely on MapReduce and its open-source implementation Hadoop. Recently, Hadoop has not only been used for running single batch jobs but it has also been optimized to simultaneously support the execution of multiple jobs belonging to multiple concurrent users. Several schedulers (i.e., Fifo, Fair, and Capacity schedulers) have been proposed to optimize locality executions of tasks but do not consider failures, although, evidence in the literature shows… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 16 publications
0
2
0
Order By: Relevance
“…Thus, having long running tasks, despite the failure injection and detection times, is common in Hadoop clusters. Therefore, Hadoop is vulnerable to incur serious performance degradations under fail-stop failures [10,22,23]. Our results demonstrate that Chronos recovers to a correct scheduling behavior within a couple of seconds only and reduces the job completion times (i.e., improves the performance of MapReduce jobs).…”
Section: Discussionmentioning
confidence: 84%
“…Thus, having long running tasks, despite the failure injection and detection times, is common in Hadoop clusters. Therefore, Hadoop is vulnerable to incur serious performance degradations under fail-stop failures [10,22,23]. Our results demonstrate that Chronos recovers to a correct scheduling behavior within a couple of seconds only and reduces the job completion times (i.e., improves the performance of MapReduce jobs).…”
Section: Discussionmentioning
confidence: 84%
“…However, striking a harmonious balance between optimizing data placement and accommodating real-time data access patterns may present its own set of challenges. In the wider landscape of cloud computing data centers, an exploration has been conducted concerning the optimization of software architecture by integrating Hadoop and MapReduce [23]. This research offers valuable insights into the interplay between these frameworks and cloud computing.…”
Section: Related Workmentioning
confidence: 99%