2019
DOI: 10.1007/978-3-030-10928-8_33
|View full text |Cite
|
Sign up to set email alerts
|

Revisiting Conditional Functional Dependency Discovery: Splitting the “C” from the “FD”

Abstract: Many techniques for cleaning dirty data are based on enforcing some set of integrity constraints. Conditional functional dependencies (CFDs) are a combination of traditional Functional dependencies (FDs) and association rules, and are widely used as a constraint formalism for data cleaning. However, the discovery of such CFDs has received limited attention. In this paper, we regard CFDs as an extension of association rules, and present three general methodologies for (approximate) CFD discovery, each using a d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 20 publications
0
5
0
Order By: Relevance
“…Integrity Constraints Discovery. Due to the importance of ICs, many techniques and systems have been proposed for discovering different kinds of ICs, namely, FDs [26,30,43], temporal FDs [2], differential dependencies [38], conditional differential dependencies [25], CFDs [14,35], and order dependencies [40]. These techniques work on entire values and adapting them to PFD discovery is quite challenging; we have to carefully combine pattern discovery and data dependency discovery, to effectively and efficiently discover PFDs.…”
Section: Related Workmentioning
confidence: 99%
“…Integrity Constraints Discovery. Due to the importance of ICs, many techniques and systems have been proposed for discovering different kinds of ICs, namely, FDs [26,30,43], temporal FDs [2], differential dependencies [38], conditional differential dependencies [25], CFDs [14,35], and order dependencies [40]. These techniques work on entire values and adapting them to PFD discovery is quite challenging; we have to carefully combine pattern discovery and data dependency discovery, to effectively and efficiently discover PFDs.…”
Section: Related Workmentioning
confidence: 99%
“…Discovering data dependencies is an active and long-standing research problem within the database and data mining communities. Indeed, the problem is well-studied for the relational data setting, with volumes of contributions for functional dependencies (FDs) [8,19,44,18] and its numerous extensions (e.g., conditional FDs [7,26,24], distance-based FDs [10,9,14,15,13], etc).…”
Section: Introductionmentioning
confidence: 99%
“…Integrity constraints may be obtained by domain experts; however, this is often an expensive task that requires expertise not only in the domain but also in the constraint language. In the past two decades, extensive effort has been invested in exploring the challenge of automatically discovering constraints from the data itself, for different types of constraints, including the classic Functional Dependencies (FDs) [15,21,23,26,30,34,35,41], the more general Conditional FDs (CFDs) [9,13,38], and the more general Denial Constraints (DCs) [4,11,36,37].…”
Section: Introductionmentioning
confidence: 99%
“…Most of the work to date on approximate constraint discovery has focused on approximate FDs [12,23,25] or CFDs [9,13,38]. Chu et al [11] and later Pena et al [36,37] considered approximate DCs.…”
Section: Introductionmentioning
confidence: 99%