Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data 2020
DOI: 10.1145/3318464.3380591
|View full text |Cite
|
Sign up to set email alerts
|

Black or White? How to Develop an AutoTuner for Memory-based Analytics

Abstract: There is a lot of interest today in building autonomous (or, self-driving) data processing systems. An emerging school of thought is to leverage AI-driven "black box" algorithms for this purpose. In this paper, we present a contrarian view. We study the problem of autotuning the memory allocation for applications running on modern distributed data processing systems. We show that an empirically-driven "white-box" algorithm, called RelM, that we have developed provides a close-to-optimal tuning at a fraction of… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
28
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 52 publications
(29 citation statements)
references
References 33 publications
0
28
0
Order By: Relevance
“…Recently, there has been an active research area on automatically tuning database configurations using Machine Learning (ML) techniques [3,18,24,44,47,55,91,92]. We summarize three key modules in the existing tuning systems: knob selection that prunes the configuration space, configuration optimization that samples promising configurations over the pruned space, and knowledge transfer that further speeds up the tuning process via historical data.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, there has been an active research area on automatically tuning database configurations using Machine Learning (ML) techniques [3,18,24,44,47,55,91,92]. We summarize three key modules in the existing tuning systems: knob selection that prunes the configuration space, configuration optimization that samples promising configurations over the pruned space, and knowledge transfer that further speeds up the tuning process via historical data.…”
Section: Introductionmentioning
confidence: 99%
“…Kunjir et al [38] investigate memory allocation auto-tuning for applications on distributed data processing systems and propose a white-box algorithm RelM. The RelM is developed to empirically model the interactions of memory management options and provides analytical models to estimate different competing memory pools requirements in an application.…”
Section: Tuningmentioning
confidence: 99%
“…Auto-tuning with a performance model which leverages an application characterization to aid the search: [15] and [30] are two publications in the area of Spark auto-tuning that proposes more similar systems to our work, generalizing unseen workloads by some characterization of the application. [15] approach to extract the features is more close to our work as it capture Task and Stage information from an Spark application, while [30] follows a more general approach by profiling the application to extract statistics like the average CPU and disk usage that can work with other Big Data frameworks rather than Spark.…”
Section: E Experiments For Rq2mentioning
confidence: 99%
“…In [30], a vector of statistics is extracted from the runtime of the application to be optimized. This feature vector is used to modify a BO procedure to guide the search process.…”
Section: E Experiments For Rq2mentioning
confidence: 99%
See 1 more Smart Citation