Proceedings of the 2022 International Conference on Management of Data 2022
DOI: 10.1145/3514221.3526157
|View full text |Cite
|
Sign up to set email alerts
|

LOCAT: Low-Overhead Online Configuration Auto-Tuning of Spark SQL Applications

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(9 citation statements)
references
References 31 publications
0
6
0
Order By: Relevance
“…Concretely, they collect training samples by running jobs with different configurations on a non-production cluster, and then train a performance model to suggest new configurations for Spark job running on the production cluster. Training such an accurate performance model involves lots of offline job executions (e.g., 1000-10000) [76]. This process is very time-consuming and expensive, and often incurs data security issues when accessing business data during multiple job executions in the non-production cluster.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…Concretely, they collect training samples by running jobs with different configurations on a non-production cluster, and then train a performance model to suggest new configurations for Spark job running on the production cluster. Training such an accurate performance model involves lots of offline job executions (e.g., 1000-10000) [76]. This process is very time-consuming and expensive, and often incurs data security issues when accessing business data during multiple job executions in the non-production cluster.…”
Section: Methodsmentioning
confidence: 99%
“…(C.1 Limited Functionality) Lots of methods (e.g., DAC [79], LOCAT [76]) are designed to minimize the execution time, i.e., finding the fastest configuration. However, the goal in many scenarios involves the execution cost [2,64] (i.e., the cheapest configuration), or more generalized objectives such as a weighted combination between runtime and cost.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations