2017
DOI: 10.13005/ojcst/10.01.26
|View full text |Cite
|
Sign up to set email alerts
|

Improved fair Scheduling Algorithm for Hadoop Clustering

Abstract: Traditional way of storing such a huge amount of data is not convenient because processing those data in the later stages is very tedious job. So nowadays, Hadoop is used to store and process large amount of data. When we look at the statistics of data generated in the recent years it is very high in the last 2 years. Hadoop is a good framework to store and process data efficiently. It works like parallel processing and there is no failure or data loss as such due to fault tolerance. Job scheduling is an impor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 10 publications
0
1
0
Order By: Relevance
“…When the dataset size is sufficiently large, the name node processes scan lead to failure, or high over head may be accrued. To overcome these issues of the conventional fair scheduler in Hadoop, the authors in [28] proposed an improved fair scheduling algorithm for clustering user jobs. The advantage of the improved fair scheduling scheme is its efficiency in producing throughput for datasets of variable size; however, the disadvantages are that long jobs can slow the algorithm and cause overloading issues at a node.…”
Section: Problem Statementmentioning
confidence: 99%
“…When the dataset size is sufficiently large, the name node processes scan lead to failure, or high over head may be accrued. To overcome these issues of the conventional fair scheduler in Hadoop, the authors in [28] proposed an improved fair scheduling algorithm for clustering user jobs. The advantage of the improved fair scheduling scheme is its efficiency in producing throughput for datasets of variable size; however, the disadvantages are that long jobs can slow the algorithm and cause overloading issues at a node.…”
Section: Problem Statementmentioning
confidence: 99%