2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (Ccgrid 2012) 2012
DOI: 10.1109/ccgrid.2012.135
|View full text |Cite
|
Sign up to set email alerts
|

MARLA: MapReduce for Heterogeneous Clusters

Abstract: Abstract-MapReduce has gradually become the framework of choice for "big data". The MapReduce model allows for efficient and swift processing of large scale data with a cluster of compute nodes. However, the efficiency here comes at a price. The performance of widely used MapReduce implementations such as Hadoop suffers in heterogeneous and load-imbalanced clusters. We show the disparity in performance between homogeneous and heterogeneous clusters in this paper to be high. Subsequently, we present MARLA, a Ma… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2012
2012
2019
2019

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 36 publications
(17 citation statements)
references
References 13 publications
(20 reference statements)
0
17
0
Order By: Relevance
“…Recall from the previous section that this same scenario also changes the power consumption of an individual node. Note that if our framework is properly tuned to have the appropriate boundary temperature for the given cluster, we can realize turnaround times on par with those discussed in [14]. Since we know that our execution times are similar to those of MARLA, we can see that power savings on the cluster level may be achieved.…”
Section: Comparison With Marlamentioning
confidence: 51%
See 1 more Smart Citation
“…Recall from the previous section that this same scenario also changes the power consumption of an individual node. Note that if our framework is properly tuned to have the appropriate boundary temperature for the given cluster, we can realize turnaround times on par with those discussed in [14]. Since we know that our execution times are similar to those of MARLA, we can see that power savings on the cluster level may be achieved.…”
Section: Comparison With Marlamentioning
confidence: 51%
“…The primary difference between our framework and MARLA is that we have scheduled this framework for energy awareness, whereas MARLA schedules for performance in heterogeneous clusters alone. Comparing our framework to MARLA is sufficient as MARLA has been compared to Hadoop and Mariane in our previous work [14]; showing improved performance in heterogeneous clusters.…”
Section: Comparison With Marlamentioning
confidence: 99%
“…In [15] a MapReduce framework for heterogeneous and loadimbalanced environments is described. The research presented in [16] and [17] focuses on a formulation of the MapReduce matchmaking and scheduling problem using linear programming.…”
Section: Related Workmentioning
confidence: 99%
“…This performance divergence between the two frameworks can be explained by the fact that Hadoop start-up cost is more visible in small data sizes. [10], [12], [13]. The MARISSA framework is designed as a lightweight MapReduce platform where the start-up overhead is minimal.…”
Section: B Data Transformation (Mr1)mentioning
confidence: 99%