Proceedings of the 2019 International Conference on Management of Data 2019
DOI: 10.1145/3299869.3324961
|View full text |Cite
|
Sign up to set email alerts
|

A Layered Aggregate Engine for Analytics Workloads

Abstract: is paper introduces LMFAO (Layered Multiple Functional Aggregate Optimization), an in-memory optimization and execution engine for batches of aggregates over the input database. e primary motivation for this work stems from the observation that for a variety of analytics over databases, their data-intensive tasks can be decomposed into groupby aggregates over the join of the input database relations. We exemplify the versatility and competitiveness of LMFAO for a handful of widely used analytics: learning ridg… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
56
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
1
1

Relationship

2
4

Authors

Journals

citations
Cited by 52 publications
(57 citation statements)
references
References 44 publications
0
56
0
Order By: Relevance
“…Recent work puts forward new optimization and evaluation strategies that go beyond the capabilities of existing database management systems. Recent experiments confirm this observation: Whereas existing query processing techniques are mature at executing one query, they miss opportunities for systematically sharing computation across several queries in a batch [50].…”
Section: Structure-aware Learningmentioning
confidence: 85%
See 3 more Smart Citations
“…Recent work puts forward new optimization and evaluation strategies that go beyond the capabilities of existing database management systems. Recent experiments confirm this observation: Whereas existing query processing techniques are mature at executing one query, they miss opportunities for systematically sharing computation across several queries in a batch [50].…”
Section: Structure-aware Learningmentioning
confidence: 85%
“…The tightly-integrated systems F [51], AC/DC [3], and LMFAO [50] are data structure-aware in that they exploit the structure and sparsity of the database to lower the complexity and drastically improve the runtime performance of the learning process. In contrast, we call all the other systems structure-agnostic, since they do not exploit properties of the input database.…”
Section: Structure-aware Learningmentioning
confidence: 99%
See 2 more Smart Citations
“…This work opens exciting avenues of future research. New technical developments include: a compilation approach to capture LMFAO's efficient support for categorical variables with multi-root join trees and group-by aggregates [47]; support for parallelization and many-core architectures; and an investigation of the trade-off between runtime performance and size of generated C++ code for models with high degree and many parameters (e.g., factorization machines). We would also like to improve the usability of IFAQ as follows: build an IFAQ library of optimization algorithms and ML models beyond the simple ones discussed in this paper and including boosting trees, random forests, and neural networks; generate optimized code for model selection over different subsets of the given variables; allow IFAQ to work directly on Jupyter notebooks that specify the construction of the data matrix and the model training; and investigate whether the IFAQ compilation techniques can be incorporated into popular data science tools such as Scikit and TensorFlow.…”
Section: Discussionmentioning
confidence: 99%