2011 IEEE/ACM 12th International Conference on Grid Computing 2011
DOI: 10.1109/grid.2011.21
|View full text |Cite
|
Sign up to set email alerts
|

Benchmarking MapReduce Implementations for Application Usage Scenarios

Abstract: Abstract-The MapReduce paradigm provides a scalable model for large scale data-intensive computing and associated fault-tolerance. With data production increasing daily due to ever growing application needs, scientific endeavors, and consumption, the MapReduce model and its implementations need to be further evaluated, improved, and strengthened. Several MapReduce frameworks with various degrees of conformance to the key tenets of the model are available today, each, optimized for specific features. HPC applic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
18
0

Year Published

2011
2011
2017
2017

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(19 citation statements)
references
References 17 publications
1
18
0
Order By: Relevance
“…In 2011 Fadika et al [9] presented a performance evaluation study to compare MapReduce platforms under a wide range of use cases. They compared the performance of MapReduce, Apache Hadoop, Twister, and LEMO.…”
Section: Related Workmentioning
confidence: 99%
“…In 2011 Fadika et al [9] presented a performance evaluation study to compare MapReduce platforms under a wide range of use cases. They compared the performance of MapReduce, Apache Hadoop, Twister, and LEMO.…”
Section: Related Workmentioning
confidence: 99%
“…Magellan staff also worked closely with the Grid Computing Research Laboratory at SUNY, Binghamton in a comparative benchmarking study of MapReduce implementations and an alternate implementation of MapReduce that can work in HPC environments. This collaboration resulted in two publications in Grid 2011 [25,24].…”
Section: Collaborations and Synergistic Activitiesmentioning
confidence: 99%
“…In addition Magellan staff participated in the study. The paper was published in Grid 2011 [24] In this study, we compare and study the performance of Hadoop [4], Twister [22] and LEMO-MR [26] in various real-world application usage scenarios, including data-intensive, iterative, CPU-intensive, and memory-intensive loads. We not only compare the chosen frameworks in several real-world application scenarios, but also in real-world cluster scenarios, including fault-prone clusters/applications, physically heterogeneous clusters and load-imbalanced clusters.…”
Section: Comparison Of Mapreduce Implementationsmentioning
confidence: 99%
“…CPU-intensive applications tend to utilize CPU 100% but CPU-light applications do not [16]. Memory usage, disk usage, network traffic usage etc.…”
Section: Realization Of Testsmentioning
confidence: 99%
“…Fadika et al performed tests to realize which framework shows high operating performance on which works by comparing two different MapReduce frameworks with Hadoop. In their study, tested applications are classified as CPU-intensive, memory-intensive, or data-intensive applications [16].…”
Section: Introductionmentioning
confidence: 99%