Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data 2020
DOI: 10.1145/3318464.3380575
|View full text |Cite
|
Sign up to set email alerts
|

DB4ML - An In-Memory Database Kernel with Machine Learning Support

Abstract: In this paper, we revisit the question of how ML algorithms can be best integrated into existing DBMSs to not only avoid expensive data copies to external ML tools but also to comply with regulatory reasons. The key observation is that database transactions already provide an execution model that allows DBMSs to efficiently mimic the execution model of modern parallel ML algorithms. As a main contribution, this paper presents DB4ML, an in-memory database kernel that allows applications to implement user-define… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
2

Relationship

0
10

Authors

Journals

citations
Cited by 25 publications
(11 citation statements)
references
References 22 publications
0
11
0
Order By: Relevance
“…Another relevant issue is investigating a learned strategy [33][34][35] to implement the adaptive mechanism in AMG-Buffer. It has been a hot topic in recent years to use machine learning models for optimizing database components [36]. In future work, it will be possible to consider a machine-learning-based approach to provide a more general solution to selecting the migration granularity for page replacements.…”
Section: Discussionmentioning
confidence: 99%
“…Another relevant issue is investigating a learned strategy [33][34][35] to implement the adaptive mechanism in AMG-Buffer. It has been a hot topic in recent years to use machine learning models for optimizing database components [36]. In future work, it will be possible to consider a machine-learning-based approach to provide a more general solution to selecting the migration granularity for page replacements.…”
Section: Discussionmentioning
confidence: 99%
“…Later in the 2010's, MADlib [27] suggested using UDAs/UDFs for embedding ML into the database. Apache Spark's MLlib [47], Apache Mahout Samsara [75] and others [23,30,40,77,82,83,88] can be seen as a continuation of this effort. A broad overview of in-DB ML techniques can be found in [39].…”
Section: Related Workmentioning
confidence: 99%
“…BAGUA is built on decades of research regarding distributed machine learning systems and algorithms. Plenty of them are from the database community [52,53,54,55,56,57,58,59,60,61,46,47]. We now summarize related work and discuss some in details to provide backgrounds and contexts.…”
Section: Preliminaries and Related Workmentioning
confidence: 99%