2015
DOI: 10.7287/peerj.preprints.1320
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Hybrid HDFS: decreasing energy consumption and speeding up Hadoop using SSDs

Abstract: Apache Hadoop has evolved significantly over the last years, with more than 60 releases bringing new features. By implementing the MapReduce programming paradigm and leveraging HDFS, its distributed file system, Hadoop has become a reliable and fault tolerant middleware for parallel and distributed computing over large datasets.Nevertheless, Hadoop may struggle under certain workloads, resulting in poor performance and high energy consumption. Users increasingly demand that high performance computing solutions… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 16 publications
0
4
0
Order By: Relevance
“…The authors of [28] have told us that a number of users nowadays increasingly demand high-performance computing solutions being able to address sustainability and limit power consumption. They have also given an HDFS approach, a hybrid storage mechanism, which uses hard disk and solid-state disk combination to get better performance and save energy.…”
Section: Related Workmentioning
confidence: 99%
“…The authors of [28] have told us that a number of users nowadays increasingly demand high-performance computing solutions being able to address sustainability and limit power consumption. They have also given an HDFS approach, a hybrid storage mechanism, which uses hard disk and solid-state disk combination to get better performance and save energy.…”
Section: Related Workmentioning
confidence: 99%
“…Adding SSDs into the storage system is one of the most efficient ways to improve I/O intensive workload performance and reduce energy consumption in big data ecosystems [56]. Usually, SSD drives outperform HDD drives due to the SSD and HDD hardware architecture design differentiation [13].…”
Section: Storage Capability and Workload Characteristicmentioning
confidence: 99%
“…reduce cost [60] [62] and decrease energy consumption [56]. However, SSD is not always the optimal storage for MapReduce workloads.…”
Section: Hadoop Performance and Ssd Efficiencymentioning
confidence: 99%
See 1 more Smart Citation