2015 IEEE 7th International Conference on Cloud Computing Technology and Science (CloudCom) 2015
DOI: 10.1109/cloudcom.2015.52
|View full text |Cite
|
Sign up to set email alerts
|

Gemini: An Adaptive Performance-Fairness Scheduler for Data-Intensive Cluster Computing

Abstract: In data-intensive cluster computing platforms such as Hadoop YARN, performance and fairness are two important concerns for users. Existing studies show that, because of the resource contention between users/jobs, there is a tradeoff between the performance and fairness. In our work, we observe that such trade-off is related to the resource demand of the workload and is changing with the variation of multi-resource demand of submitted jobs during the computation. We also find that having an algorithm to be awar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
13
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 15 publications
(13 citation statements)
references
References 27 publications
0
13
0
Order By: Relevance
“…Gemini [16] uses a model that captures the tradeoff between performance improvement and fairness loss for jobs scheduled in shared clusters. The model quantifies the complementarity in the resource demands of jobs and is trained on historic workload data.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Gemini [16] uses a model that captures the tradeoff between performance improvement and fairness loss for jobs scheduled in shared clusters. The model quantifies the complementarity in the resource demands of jobs and is trained on historic workload data.…”
Section: Related Workmentioning
confidence: 99%
“…These systems have become popular tools for workloads that range from data aggregation and search to relational queries, graph processing, and machine learning [17,9,8,15]. Jobs from these diverse domains stress different resources, while the resource demands typically also fluctuate significantly over the runtime of jobs [18,21,16]. Therefore, multiple jobs usually share cluster resources without isolation, so they can benefit from statistical multiplexing [26,20,23].…”
Section: Introductionmentioning
confidence: 99%
“…Gemini 11 uses a model that captures the tradeoff between performance improvement and fairness loss for jobs scheduled in shared clusters. The model quantifies the complementarity in the resource demands of jobs and is trained on historic workload data.…”
Section: Related Workmentioning
confidence: 99%
“…These systems have become popular tools for workloads that range from data aggregation and search to relational queries, graph processing, and machine learning 7‐9 . Jobs from these diverse domains stress different resources, while the resource demands typically also fluctuate significantly over the runtime of jobs 2,10‐12 . Therefore, multiple jobs usually share cluster resources without isolation, so they can benefit from statistical multiplexing 2,13,14 .…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation