2015
DOI: 10.1016/j.jpdc.2015.01.001
|View full text |Cite
|
Sign up to set email alerts
|

Performance and energy efficiency of big data applications in cloud environments: A Hadoop case study

Abstract: The exponential growth of scientific and business data has resulted in the evolution of the cloud computing environments and the MapReduce parallel programming model. The focus of cloud computing is increased utilization and power savings through consolidation while MapReduce enables large scale data analysis.Hadoop, an open source implementation of MapReduce has gained popularity in the last few years. In this paper, we evaluate Hadoop performance in both the traditional model of collocated data and compute s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
33
0
1

Year Published

2015
2015
2020
2020

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 64 publications
(34 citation statements)
references
References 16 publications
(20 reference statements)
0
33
0
1
Order By: Relevance
“…The authors in [31] use a load predictor that clusters historical resource utilisation, and select the cluster set with the highest similarity as a training sample into a Neural Network. However, these approaches simplify the impact of colocating VMs, which can lead to significant performance overhead [83,84]. The authors in [85] tackle colocation interference and perform four parallel classifications on each application to evaluate the impact of vertical and horizontal scale, server configuration, and the impact of colocating applications.…”
Section: Open Research Challengesmentioning
confidence: 99%
“…The authors in [31] use a load predictor that clusters historical resource utilisation, and select the cluster set with the highest similarity as a training sample into a Neural Network. However, these approaches simplify the impact of colocating VMs, which can lead to significant performance overhead [83,84]. The authors in [85] tackle colocation interference and perform four parallel classifications on each application to evaluate the impact of vertical and horizontal scale, server configuration, and the impact of colocating applications.…”
Section: Open Research Challengesmentioning
confidence: 99%
“…The exodus of data and services provides more flexibility in surroundings, where data locality might not have a considerable impact. An energy efficiency assessment of Hadoop on physical and virtual clusters in different configurations is performed [31].…”
Section: Introductionmentioning
confidence: 99%
“…And research show that, in 2008, the world's 4400 servers consumed electricity 0.8% percent, if you go like that, at that rate By 2020, that proportion * Lei Wang: wl@ln.sgcc.com.cn will be 3.2%. Epa (US Environmental Protection Agency) issued a report statement in 2006, the total electricity consumption of American IT agencies was 61 billion KWh, the electricity bill alone is $4.5 billion [5]- [7]. So that's a concern Big data storage and processing performance must also be used for energy consumption Give enough attention [8].…”
Section: Introductionmentioning
confidence: 99%