2021
DOI: 10.48550/arxiv.2109.06339
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ML Based Lineage in Databases

Michael Leybovich,
Oded Shmueli

Abstract: We track the lineage of tuples throughout their database lifetime. That is, we consider a scenario in which tuples (records) that are produced by a query may affect other tuple insertions into the DB, as part of a normal workflow. As time goes on, exact provenance explanations for such tuples become deeply nested, increasingly consuming space, and resulting in decreased clarity and readability.We present a novel approach for approximating lineage tracking, using a Machine Learning (ML) and Natural Language Pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 26 publications
0
1
0
Order By: Relevance
“…We previously devised [19,21] a novel approach to lineage tracking, which is based on ML and NLP techniques. The main idea is summarizing, and thereby approximating, the lineage of every tuple (denoted TV ), or each column of the tuple (denoted CV ), with a set of up to 𝑚𝑎𝑥_𝑣𝑒𝑐𝑡𝑜𝑟𝑠 (a hyperparameter) low-dimensional (e.g., 200) vectors.…”
Section: Lineage Vectors Adorned Db 31 Solution Overviewmentioning
confidence: 99%
“…We previously devised [19,21] a novel approach to lineage tracking, which is based on ML and NLP techniques. The main idea is summarizing, and thereby approximating, the lineage of every tuple (denoted TV ), or each column of the tuple (denoted CV ), with a set of up to 𝑚𝑎𝑥_𝑣𝑒𝑐𝑡𝑜𝑟𝑠 (a hyperparameter) low-dimensional (e.g., 200) vectors.…”
Section: Lineage Vectors Adorned Db 31 Solution Overviewmentioning
confidence: 99%