2013
DOI: 10.14778/2536360.2536363
|View full text |Cite
|
Sign up to set email alerts
|

The LLUNATIC data-cleaning framework

Abstract: Data-cleaning (or data-repairing) is considered a crucial problem in many database-related tasks. It consists in making a database consistent with respect to a set of given constraints. In recent years, repairing methods have been proposed for several classes of constraints. However, these methods rely on ad hoc decisions and tend to hard-code the strategy to repair conflicting values. As a consequence, there is currently no general algorithm to solve database repairing problems that involve different kinds of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
84
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 141 publications
(87 citation statements)
references
References 23 publications
0
84
0
Order By: Relevance
“…Many approaches [6,20,4,9,10,19,16] tackle the problem of detecting and repairing dirty data based on predefined data quality rules. [9] proposes a way to combine multiple rules together and to perform data cleaning work holistically.…”
Section: Related Workmentioning
confidence: 99%
“…Many approaches [6,20,4,9,10,19,16] tackle the problem of detecting and repairing dirty data based on predefined data quality rules. [9] proposes a way to combine multiple rules together and to perform data cleaning work holistically.…”
Section: Related Workmentioning
confidence: 99%
“…Several algorithms have been proposed for repairing inconsistent data, mainly based on declarative data quality rules (such as in Σ D ) [1,10,3]. These rules naturally have a static semantics for violation detection (as described above) and a dynamic semantics to remove them.…”
Section: Target Errors Detectionmentioning
confidence: 99%
“…Data cleaning focuses on detecting and repairing errors on a database by using declarative constraints [3,15,7,10,1]. In our target Error module, we can use any of these algorithms.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In every instance of an incomplete input, our traditional database instincts tell us that the solution is to fix the problem: we should either replicate the data sources comprising the distributed system or make them more reliable; we should add replication and failover to the nodes of our parallel DBMS; or we should embark on data cleaning and repairing efforts to fix the incomplete tables and views in the single node system [4,5,7,10,14,16,20,30,[33][34][35][36]. However, these solutions can be either financially costly, performance hindering, or both.…”
Section: Introductionmentioning
confidence: 99%