2017
DOI: 10.1016/j.jss.2017.09.001
|View full text |Cite
|
Sign up to set email alerts
|

Task Scheduling in Big Data Platforms: A Systematic Literature Review

Abstract: Context: Hadoop, Spark, Storm, and Mesos are very well known frameworks in both research and industrial communities that allow expressing and processing distributed computations on massive amounts of data. Multiple scheduling algorithms have been proposed to ensure that short interactive jobs, large batch jobs, and guaranteed-capacity production jobs running on these frameworks can deliver results quickly while maintaining a high throughput. However, only a few works have examined the effectiveness of these al… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
25
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 40 publications
(25 citation statements)
references
References 55 publications
(65 reference statements)
0
25
0
Order By: Relevance
“…This paper presents a systematic literature review (SLR) on the current state of research associated with big data technologies in manufacturing [37]. To apply big data technologies in manufacturing successfully, it is essential to systematically review the literature of big data technologies in manufacturing from the following three perspectives: manufacturing data, big data technologies and data applications in manufacturing.…”
Section: Methodsmentioning
confidence: 99%
“…This paper presents a systematic literature review (SLR) on the current state of research associated with big data technologies in manufacturing [37]. To apply big data technologies in manufacturing successfully, it is essential to systematically review the literature of big data technologies in manufacturing from the following three perspectives: manufacturing data, big data technologies and data applications in manufacturing.…”
Section: Methodsmentioning
confidence: 99%
“…It is a necessity for an efficient load balancing system to use optimized scheduling algorithms 14 . Soualhia et al 15 have done strong research on task scheduling in the Big Data platform. In this research, they mention that multiple jobs and tasks with different characteristics and different resource demands that are received by the scheduler in the big data cloud platforms and cause an imbalanced load in the system.…”
Section: Related Workmentioning
confidence: 99%
“…Reduce phase: In this phase, the reducer job is to process the input data that comes from the mapper by analyzing and merging it to produce the final output which is written to the HDFS in the cluster .Some other programming models such as Spark [44,45] and DataMPI [46] are competing with MapReduce. Table 3 summarizes the big data capabilities and the available primary technologies [5].Since MapReduce is an open source with high performance which is used by many big companies for processing batch jobs [47,48].…”
Section: Hadoop Mapreduce (Programming Paradigm)mentioning
confidence: 99%