Proceedings of the 2nd ACM Symposium on Cloud Computing 2011
DOI: 10.1145/2038916.2038919
|View full text |Cite
|
Sign up to set email alerts
|

Modeling and synthesizing task placement constraints in Google compute clusters

Abstract: Evaluating the performance of large compute clusters requires benchmarks with representative workloads. At Google, performance benchmarks are used to obtain performance metrics such as task scheduling delays and machine resource utilizations to assess changes in application codes, machine configurations, and scheduling algorithms. Existing approaches to workload characterization for high performance computing and grids focus on task resource requirements for CPU, memory, disk, I/O, network, etc. Such resource … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
88
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 153 publications
(88 citation statements)
references
References 30 publications
(27 reference statements)
0
88
0
Order By: Relevance
“…Alternatively, if a job prefers distributing its resources the scheduler will allocate RUs in different machines, racks and/or cluster switches, assuming knowledge of the cluster's topology. Placement preferences for reasons such as security [32] can also be specified in the form of attributes at submission time by the user. Sampling at high load: Equation 6 estimates the probability of finding near-optimal resources accurately when resources are not scarce.…”
Section: Scheduling With Guaranteesmentioning
confidence: 99%
“…Alternatively, if a job prefers distributing its resources the scheduler will allocate RUs in different machines, racks and/or cluster switches, assuming knowledge of the cluster's topology. Placement preferences for reasons such as security [32] can also be specified in the form of attributes at submission time by the user. Sampling at high load: Equation 6 estimates the probability of finding near-optimal resources accurately when resources are not scarce.…”
Section: Scheduling With Guaranteesmentioning
confidence: 99%
“…The priority determines the importance of each task. The task placement constraints specify additional scheduling constraints concerning the machine configurations, such as processor architecture of the physical machine [21]. To simplify, we do not consider the scheduling class, priority and task placement constraints in our model.…”
Section: Workload Analysismentioning
confidence: 99%
“…Finally, as an initial effort towards solving this problem, we currently consider that all the machines in the cluster are homogenous and with identical resource capacities. It is part of our future work to extend our model to consider machine heterogeneity (e.g., multiple generations of machines [21]). …”
Section: System Architecturementioning
confidence: 99%
“…This assumption is also in agreement with analysis and measurements taken from Google cluster traces. The Google traces were for general-purpose compute cluster recorded over 29 days (Reiss, Tumanov, Ganger, Katz, & Kozuch, 2012;Sharma, Chudnovsky, Hellerstein, Rifaat, & Das, 2011;Zhang, Hellerstein, & Boutaba, 2011). In addition, we assume the JS handles homogenous jobs (i.e., of the same type and the same SLO performance requirements).…”
Section: Analytical Modelmentioning
confidence: 99%