Proceedings of the 11th ACM Symposium on Cloud Computing 2020
DOI: 10.1145/3419111.3421305
|View full text |Cite
|
Sign up to set email alerts
|

Finding the right cloud configuration for analytics clusters

Abstract: Finding good cloud configurations for deploying a single distributed system is already a challenging task, and it becomes substantially harder when a data analytics cluster is formed by multiple distributed systems since the search space becomes exponentially larger. In particular, recent proposals for single system deployments rely on benchmarking runs that become prohibitively expensive as we shift to joint optimization of multiple systems, as users have to wait until the end of a long optimization run to st… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 27 publications
(16 citation statements)
references
References 24 publications
0
16
0
Order By: Relevance
“…We use default values to set application parameters except for executor 4 and memory parameters in Spark. In order to prevent out of memory (OOM) exceptions, we use Mesos [13] to watch the real usage of memory per executor.…”
Section: Evaluation 51 Experiments Setupmentioning
confidence: 99%
See 3 more Smart Citations
“…We use default values to set application parameters except for executor 4 and memory parameters in Spark. In order to prevent out of memory (OOM) exceptions, we use Mesos [13] to watch the real usage of memory per executor.…”
Section: Evaluation 51 Experiments Setupmentioning
confidence: 99%
“…However, we have illustrated that the way of reusing pre-trained model is fragile when workloads are from different frameworks. Vanir [4] combines a series of techniques such as Mondrian forest model and transfer learning to search the right cloud configurations. But their have not yet studied the correlation similarities or other types of similarities across frameworks.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Some configuration tuning systems have leveraged similarity to accelerate the tuning process: AROMA [11] clusters the Hadoop workloads then builds a performance model that guides the tuning of each workload cluster. Scout [43] and Vanir [44] exploit workload similarity to explore the search space more effectively for tuning cloud configurations (number of instances and their resource allocation). Ultimately, a joint solution for optimizing cloud instance and DISC framework configurations will be needed.…”
Section: Metricsmentioning
confidence: 99%