Proceedings of the 2018 International Conference on Management of Data 2018
DOI: 10.1145/3183713.3196916
|View full text |Cite
|
Sign up to set email alerts
|

Discovering Graph Functional Dependencies

Abstract: This paper studies discovery of GFDs, a class of functional dependencies defined on graphs. We investigate the fixed-parameter tractability of three fundamental problems related to GFD discovery. We show that the implication and satisfiability problems are fixed-parameter tractable, but the validation problem is co-W[1]-hard. We introduce notions of reduced GFDs and their topological support, and formalize the discovery problem for GFDs. We develop algorithms for discovering GFDs and computing their covers. Mo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
3
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 25 publications
(4 citation statements)
references
References 45 publications
0
3
0
Order By: Relevance
“…Data profiling is also heavily used in graph data, e.g., using Petri Nets in process mining to recover missing events [105], [106] and clean event data [107], discovering keys for graphs and applying keys to study entity matching [108], or defining functional dependencies for graphs [21] and discovering them [109]. However, the above studies still seem to be difficult when encountering large graphs.…”
Section: B Sampling For Profiling Graph Datamentioning
confidence: 99%
See 1 more Smart Citation
“…Data profiling is also heavily used in graph data, e.g., using Petri Nets in process mining to recover missing events [105], [106] and clean event data [107], discovering keys for graphs and applying keys to study entity matching [108], or defining functional dependencies for graphs [21] and discovering them [109]. However, the above studies still seem to be difficult when encountering large graphs.…”
Section: B Sampling For Profiling Graph Datamentioning
confidence: 99%
“…In this case, one has to design two parallel scalable algorithms, in MapReduce and a vertex-centric asynchronous model. In order to find Graph Functional Dependencies, Fan et al [109] have to deal with large-scale graphs by designing effective pruning strategies, using parallel algorithms, and adding processors. As mentioned earlier, designing parallel algorithms is difficult.…”
Section: B Sampling For Profiling Graph Datamentioning
confidence: 99%
“…This differs from our approach, which is specifically developed for graphs. More similar methods are used by (Fan et al, 2018;Yu and Heflin, 2011), which start out with minimal constraints and extend these iteratively until all options are exhausted. However, these algorithms only consider FDs.…”
Section: Related Workmentioning
confidence: 99%
“…NGDs. We extended the algorithm of [22] to discover NGDs from the graphs. The algorithm interleaves "vertical levelwise expansion" for mining frequent patterns Q and "horizontal levelwise expansion" for mining literals in X → Y , in a single process.…”
Section: Experimental Studymentioning
confidence: 99%