Proceedings of the 1st Workshop on Machine Learning and Systems 2021
DOI: 10.1145/3437984.3458841
|View full text |Cite
|
Sign up to set email alerts
|

High-Dimensional Bayesian Optimization with Multi-Task Learning for RocksDB

Abstract: RocksDB is a general-purpose embedded key-value store used in multiple different settings. Its versatility comes at the cost of complex tuning configurations. This paper investigates maximizing the throughput of RocksDB IO operations by auto-tuning ten parameters of varying ranges. Off-theshelf optimizers struggle with high-dimensional problem spaces and require a large number of training samples.We propose two techniques to tackle this problem: multitask modeling and dimensionality reduction through clusterin… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 10 publications
(1 citation statement)
references
References 28 publications
(37 reference statements)
0
1
0
Order By: Relevance
“…For instance, JanusGraph itself mainly focuses on graph serialization and query execution, while providing adapters to integrate third‐party softwares as its functional module for data storage and indices. Unfortunately, although there are already a few significant works toward automatically tuning parameters for different databases such as HBase, 4 Elasticsearch, 5 RocksDB, 6 and MySQL, 7 these solutions cannot be directly applied in the scenarios of modularized GDBs because they solely consider one specific software. What is worse, due to the complicated interactions across different modules, sequentially tuning each software with previous solutions may also fail to efficiently find the optimal configuration for modularized GDBs.…”
Section: Introductionmentioning
confidence: 99%
“…For instance, JanusGraph itself mainly focuses on graph serialization and query execution, while providing adapters to integrate third‐party softwares as its functional module for data storage and indices. Unfortunately, although there are already a few significant works toward automatically tuning parameters for different databases such as HBase, 4 Elasticsearch, 5 RocksDB, 6 and MySQL, 7 these solutions cannot be directly applied in the scenarios of modularized GDBs because they solely consider one specific software. What is worse, due to the complicated interactions across different modules, sequentially tuning each software with previous solutions may also fail to efficiently find the optimal configuration for modularized GDBs.…”
Section: Introductionmentioning
confidence: 99%