2021
DOI: 10.48550/arxiv.2106.01543
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Niffler: A Reference Architecture and System Implementation for View Discovery over Pathless Table Collections by Example

Abstract: Identifying a project-join view (PJ-view) over collections of tables is the first step of many data management projects, e.g., assembling a dataset to feed into a business intelligence tool, creating a training dataset to fit a machine learning model, and more. When the table collections are large and lack join information-such as when combining databases, or on data lakes-query by example (QBE) systems can help identify relevant data, but they are designed under the assumption that join information is availab… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 22 publications
0
1
0
Order By: Relevance
“…Using this graph, Aurum outputs all the transitively connected join paths of hop = k given a source and a target node. Niffler [11] uses the same EKG to find all possible combinations of columns, generates only 2-hop join paths between these columns and ranks the paths according tot the score from EKG. Our ranking function is customised for ML classification.…”
Section: B Join Path Discoverymentioning
confidence: 99%
“…Using this graph, Aurum outputs all the transitively connected join paths of hop = k given a source and a target node. Niffler [11] uses the same EKG to find all possible combinations of columns, generates only 2-hop join paths between these columns and ranks the paths according tot the score from EKG. Our ranking function is customised for ML classification.…”
Section: B Join Path Discoverymentioning
confidence: 99%