Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data 2020
DOI: 10.1145/3318464.3389711
|View full text |Cite
|
Sign up to set email alerts
|

ALEX: An Updatable Adaptive Learned Index

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
122
1

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
4
1

Relationship

2
7

Authors

Journals

citations
Cited by 172 publications
(125 citation statements)
references
References 17 publications
2
122
1
Order By: Relevance
“…Recursive model indexes (RMIs) are one such class of models [8] (although others [3][4][5]11] exist as well) , combining simpler machine learning models together into a multistaged structure. For example, as depicted in Figure 1, an RMI with two stages, a linear stage and a cubic stage, would first use a linear model to make an initial prediction of an index for a specific key (stage 1).…”
Section: Introductionmentioning
confidence: 99%
“…Recursive model indexes (RMIs) are one such class of models [8] (although others [3][4][5]11] exist as well) , combining simpler machine learning models together into a multistaged structure. For example, as depicted in Figure 1, an RMI with two stages, a linear stage and a cubic stage, would first use a linear model to make an initial prediction of an index for a specific key (stage 1).…”
Section: Introductionmentioning
confidence: 99%
“…Closest to our work is a proposal of learned partition adviser using deep RL [19]; it focuses on replication and coarse-grained partitioning (e.g., hash) along entire attribute(s), unlike qd-tree which partitions based on a rich set of fine-grained candidate cuts. In this space, machine learning has also been used to revisit tuning [47], workload forecasting [28], data structures and indexes [12,20,23,32], and query optimization [13,24,29,50]. Our qd-tree may be viewed as a learned physical design or indexing tool: it optimizes for scan-based workloads, common in big data analytics, to minimize the I/O cost of block accesses.…”
Section: Related Workmentioning
confidence: 99%
“…If we follow this assumption, by learning the Cumulative Distribution Function (CDF) of the input data, the mapping function of an index can be learned. Due to the complexity of the CDF, a single ML model learned over the complete data cannot provide the desired accuracy [4]. To address this issue, a Recursive Model Index (RMI, for short) has been introduced.…”
Section: Part 1: Learned Index Structuresmentioning
confidence: 99%
“…These initial works have been focused on read-only workloads. To deal with updates, a new class of updatable adaptive learned indexes has been proposed, e.g., [4]. It has been demonstrated that a careful space-time trade-off can lead to an updatable data structure.…”
Section: Introductionmentioning
confidence: 99%