2019
DOI: 10.1186/s40537-019-0253-9
|View full text |Cite
|
Sign up to set email alerts
|

HybSMRP: a hybrid scheduling algorithm in Hadoop MapReduce framework

Abstract: IntroductionDistributed and parallel processing is one of the best intelligent ways to store and compute big data [1]. Most definitions defined big data as characterized by the 3Vs: the extreme volume of data, the wide variety of data types and the velocity at which the data must be processed. MapReduce [2] is a programming model for big data processing. MapReduce programs are intrinsically parallel [3,4]. MapReduce executes the programs in two phases, map and reduce, so that each phase is defined by a functio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
20
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 26 publications
(22 citation statements)
references
References 24 publications
(30 reference statements)
0
20
0
Order By: Relevance
“…These job schedulers can be classified as to whether they consider data locality at the Map task level, data locality at the Reduce task level, or data locality at the job level (both Map and Reduce tasks) [8]. For instance, Hybrid scheduling MapReduce priority (HybSMRP) [9] was presented by Ghandomi et al in 2019 and is a hybrid scheduler that combines dynamic job priority and data localization. It determines job priority based on three parameters: running time, job size, and waiting time.…”
Section: Related Workmentioning
confidence: 99%
“…These job schedulers can be classified as to whether they consider data locality at the Map task level, data locality at the Reduce task level, or data locality at the job level (both Map and Reduce tasks) [8]. For instance, Hybrid scheduling MapReduce priority (HybSMRP) [9] was presented by Ghandomi et al in 2019 and is a hybrid scheduler that combines dynamic job priority and data localization. It determines job priority based on three parameters: running time, job size, and waiting time.…”
Section: Related Workmentioning
confidence: 99%
“…The dynamics and heterogeneity of computing nodes in larger networks were not discussed by their paper. Gandomi et al [19] combined two existing techniques: dynamic job prioritizing and data localization to form a hybrid scheduling algorithm, aiming at increasing data locality rate, and decreasing completion time. The proposed schedulers were evaluated on a Hadoop cluster of one master node and 20 slave nodes, which had homogeneous architecture with stable and fast network connections.…”
Section: Related Workmentioning
confidence: 99%
“…A hybrid scheduling algorithm, HybSMRP, is proposed in [7] to improve data local execution and job latency. Authors proposed two techniques to achieve their objectives: dynamic priority and localization ID.…”
Section: Literature Surveymentioning
confidence: 99%