2012
DOI: 10.1007/s10766-012-0227-4
|View full text |Cite
|
Sign up to set email alerts
|

Analytical Performance Models for MapReduce Workloads

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
48
0

Year Published

2014
2014
2018
2018

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 53 publications
(49 citation statements)
references
References 24 publications
0
48
0
Order By: Relevance
“…Using the model from [37] the work in [38] provides approximations for the number of jobs in a tandem system consisting of a map queue and a reduce queue in the heavy traffic regime. The work in [41] derives approximations of the mean response time in MapReduce systems using a mean value analysis technique and a closed FJ queueing system model from [39].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Using the model from [37] the work in [38] provides approximations for the number of jobs in a tandem system consisting of a map queue and a reduce queue in the heavy traffic regime. The work in [41] derives approximations of the mean response time in MapReduce systems using a mean value analysis technique and a closed FJ queueing system model from [39].…”
Section: Related Workmentioning
confidence: 99%
“…h (2) prior to N ; moreover, θ is the implicit solution from (41). Figure 10 illustratesR N as a function of N for several utilization levels ρ for both renewal (a) and non-renewal (b) input; recall that the utilization on each path is ρ N .…”
Section: Non-renewal Arrivalsmentioning
confidence: 99%
“…Some other approaches, e.g., [24], are based on an approximate mean value analysis technique and use an iterative hierarchical approach. Along the same lines, [34] combines a precedence graph and a QN to capture the intra-job synchronization constraints, thus being able to estimate the synchronization delays introduced by the communication among mappers and reducers. Unfortunately, even if the approach is rather accurate (around 15% accuracy on real systems), the authors assume that CPU slots are statically assigned to mappers and reducers, hence the proposed method cannot be adopted to estimate performance under the Hadoop 2.x dynamic resource assignment policy.…”
Section: Modeling Hadoop 2x Applications Performancementioning
confidence: 99%
“…MapReduce has been adopted in multiple application domains, e.g., machine learning, graph processing, and data mining [34], and its open source implementation, Hadoop 2.x, recently introduced a wide set of performance enhancements (e.g., SSD support, caching, and I/O barriers mitigation). IDC estimates that Hadoop touched half of the world data last year [20] supporting both traditional batch and interactive data analysis applications [32].…”
Section: Introductionmentioning
confidence: 99%
“…What is the asymptotic behavior of the log weight sum log (X (n) ) dependent on the problem/instance size n? As remarked above the measurements X (n) of a particular MapReduce Sets for Replicated Join Expression assignment instance depend on the value n. In analogy to information intention [7] we assume that the log weight sums converge according to an asymptotic equipartition property, i.e.,…”
Section: Typicality Of Instancesmentioning
confidence: 99%