The 2014 ACM International Conference on Measurement and Modeling of Computer Systems 2014
DOI: 10.1145/2591971.2591998
|View full text |Cite
|
Sign up to set email alerts
|

Balanced resource allocations across multiple dynamic MapReduce clusters

Abstract: Running multiple instances of the MapReduce framework concurrently in a multicluster system or datacenter enables data, failure, and version isolation, which is attractive for many organizations. It may also provide some form of performance isolation, but in order to achieve this in the face of time-varying workloads submitted to the MapReduce instances, a mechanism for dynamic resource (re-)allocations to those instances is required. In this paper, we present such a mechanism called Fawkes that attempts to ba… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
4
3
2

Relationship

2
7

Authors

Journals

citations
Cited by 22 publications
(10 citation statements)
references
References 32 publications
(22 reference statements)
0
10
0
Order By: Relevance
“…During the past decade, performance of MapReduce became a rich exploration domain, leading to several papers focusing on diverse aspects of MapReduce scheduling: data locality [31], stragglers [5], [6], resource heterogeneity [33], or elastic scaling [12], [13], [21]. State-of-the art schedulers for MapReducebased systems assume they have complete control over a fixed set of resources, thus they are typically deployed on dedicated clusters of machines.…”
Section: E Improvements From Tyrexmentioning
confidence: 99%
See 1 more Smart Citation
“…During the past decade, performance of MapReduce became a rich exploration domain, leading to several papers focusing on diverse aspects of MapReduce scheduling: data locality [31], stragglers [5], [6], resource heterogeneity [33], or elastic scaling [12], [13], [21]. State-of-the art schedulers for MapReducebased systems assume they have complete control over a fixed set of resources, thus they are typically deployed on dedicated clusters of machines.…”
Section: E Improvements From Tyrexmentioning
confidence: 99%
“…TYREX uses resource partitioning and work-conserving job migration across these partitions as its two main principles. A common way of partitioning the resources of a datacenter is to allocate disjoint sets of machines to multiple instances of the MapReduce framework [12]. However, this scheduling model is not attractive for jobs that are moved across partitions but still require access to the same data, as the cost of replicating the data across partitions may be prohibitive.…”
mentioning
confidence: 99%
“…However, the policy does not consider cluster shrink requests. Ghit et al [155] extended the above-mentioned policies by accounting dynamic demand (job, data, and task), dynamic usage (processor, disk, and memory), and actual performance (job slowdown, job throughput, and task throughput) analysis when resizing a MapReduce cluster.…”
Section: Resource Allocation Mechanisms For Geo-distributed Systemsmentioning
confidence: 99%
“…The highlighted components cover the minimum set of layers necessary for execution for the MapReduce ecosystem; the presence of several high-level languages indicates that the ecosystem has diverse users, with minimal expertise and ability in managing the ecosystem beyond the high-level language they know. This reference architecture was useful to our research, design, and engineering: with it as a guide, we have created the Fawkes elastic MapReduce system [94].…”
Section: Datacenters: Designing the Digital Factorymentioning
confidence: 99%