2015
DOI: 10.1002/cpe.3628
|View full text |Cite
|
Sign up to set email alerts
|

BSP cost and scalability analysis for MapReduce operations

Abstract: Data abundance poses the need for powerful and easy-to-use tools that support processing large amounts of data. MapReduce has been increasingly adopted for over a decade by many companies, and more recently, it has attracted the attention of an increasing number of researchers in several areas. One main advantage is that the complex details of parallel processing, such as complex network programming, task scheduling, data placement, and fault tolerance, are hidden in a conceptually simple framework. MapReduce … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
8
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
8

Relationship

2
6

Authors

Journals

citations
Cited by 15 publications
(9 citation statements)
references
References 73 publications
(68 reference statements)
1
8
0
Order By: Relevance
“…This is the most scalable configuration over all scenarios analyzed in [5]. Simulation results that goes up to 10000 nodes corroborate the limits stated in this and other theorems of [5]. …”
Section: Discussionsupporting
confidence: 83%
See 1 more Smart Citation
“…This is the most scalable configuration over all scenarios analyzed in [5]. Simulation results that goes up to 10000 nodes corroborate the limits stated in this and other theorems of [5]. …”
Section: Discussionsupporting
confidence: 83%
“…In this scenario, MapReduce (and its Hadoop implementation) emerged as a paramount framework that supports design patterns which represent general reusable solutions to commonly occurring problems across a variety of problem domains including analysis and assembly of biological sequences [ 4 ]. MapReduce has delivered outstanding performance and scalability for a myriad of applications running over hundreds to thousands of processing nodes [ 5 ]. On the other hand, over the last decade, cloud computing has emerged as a powerful platform for the agile and dynamic provisioning of computational resources for computational and data intensive problems.…”
Section: Introductionmentioning
confidence: 99%
“…MapReduce is a programming framework that enables us to process massive amount of data in parallel in a distributed computing environment. is framework consists of two main functions, namely, Map and Reduce that can effectively manage structured as well as unstructured data [95,96]. As the name MapReduce indicates, reducer function occurs after the completion of the mapper function.…”
Section: Platform and Tools For Healthcare Big Data Analyticsmentioning
confidence: 99%
“…In , the scalability of MapReduce computations is studied. Proposed in 2004, MapReduce became very popular and widespread, being adopted for the execution of various Big Data applications such as web indexing, social network applications, processing logs and raw data, queries processing on large‐volume datasets, and many others.…”
Section: Papers In This Issuementioning
confidence: 99%