LOCAT: Low-Overhead Online Configuration Auto-Tuning of Spark SQL Applications

Xin, Jinhan; Hwang, Kai; Yu, Zhibin

doi:10.48550/arxiv.2203.14889

Cited by 1 publication

(5 citation statements)

References 52 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…CherryPick [1] directly performs BO on a discretized search space. LOCAT [38] further combines dynamic sensitivity analysis and datasize-aware Gaussian process (GP) to perform optimization on important parameters. Despite the competitive converged results, the aforementioned methods suffer from the re-optimization issue [23], which is, the performance model needs retraining and still requires a number of online configuration evaluations for each coming task.…”

Section: Related Workmentioning

confidence: 99%

“…Benchmarks. For end-to-end comparison, we follow LOCAT [38] and use three SQL-related tasks from the widely used Spark benchmark HiBench [16]: (1) 'Join' is a query that executes in two phases: if M ≠ ∅ then 3:…”

Section: Setupsmentioning

confidence: 99%

“…optimization. We first run Bayesian optimization over three different search spaces: 1) Rover space (10 params); 2) the Tuneful [8] space (30 params); 3) the Tuneful space with dynamic shrinking used in LOCAT [38]. The results are shown in Figure 8(a).…”

Section: Memory Ratiomentioning

confidence: 99%

“…Recent studies [1,8,38] apply the Bayesian optimization (BO) framework to reduce the required number of evaluations to find a near-optimal configuration. In brief, BO trains a surrogate on evaluated configurations and their performance, and then selects the next configuration by balancing exploration and exploitation.…”

Section: Introductionmentioning

confidence: 99%

“…Baselines. For end-to-end comparison, we compare Rover with the following SOTA BO-based tuning algorithms: (1) CherryPick [1]: A method with a discretized search space; (2) Tuneful [8]: A method that explores significant parameters and applies a multi-task GP to use the most similar previous task; (3) LOCAT [38] that identifies important parameters and dynamically reduces the search space; (4) ResTune [44]: A transfer learning method that uses all the history knowledge to accelerate the tuning process. Experiment Settings.…”

mentioning

confidence: 99%

See 4 more Smart Citations

Rover: An online Spark SQL tuning service via generalized transfer learning

Shen¹,

Ren²,

Lu³

et al. 2023

Preprint

View full text Add to dashboard Cite

Distributed data analytic engines like Spark are common choices to process massive data in industry. However, the performance of Spark SQL highly depends on the choice of configurations, where the optimal ones vary with the executed workloads. Among various alternatives for Spark SQL tuning, Bayesian optimization (BO) is a popular framework that finds near-optimal configurations given sufficient budget, but it suffers from the re-optimization issue and is not practical in real production. When applying transfer learning to accelerate the tuning process, we notice two domain-specific challenges: 1) most previous work focus on transferring tuning history, while expert knowledge from Spark engineers is of great potential to improve the tuning performance but is not well studied so far; 2) history tasks should be carefully utilized, where using dissimilar ones lead to a deteriorated performance in production.In this paper, we present Rover, a deployed online Spark SQL tuning service for efficient and safe search on industrial workloads. To address the challenges, we propose generalized transfer learning to boost the tuning performance based on external knowledge, including expert-assisted Bayesian optimization and controlled history transfer. Experiments on public benchmarks and real-world tasks show the superiority of Rover over competitive baselines. Notably, Rover saves an average of 50.1% of the memory cost on 12k real-world Spark SQL tasks in 20 iterations, among which 76.2% of the tasks achieve a significant memory reduction of over 60%. CCS CONCEPTS• Computing methodologies → Search methodologies; • Information systems → Data management systems.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Setupsmentioning

confidence: 99%