2014 IEEE 30th International Conference on Data Engineering 2014
DOI: 10.1109/icde.2014.6816654
|View full text |Cite
|
Sign up to set email alerts
|

Mapping and cleaning

Abstract: We address the challenging and open problem of bringing together two crucial activities in data integration and data quality, i.e., transforming data using schema mappings, and fixing conflicts and inconsistencies using data repairing. This problem is made complex by several factors. First, schema mappings and data repairing have traditionally been considered as separate activities, and research has progressed in a largely independent way in the two fields. Second, the elegant formalizations and the algorithms… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
41
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
2
2

Relationship

2
7

Authors

Journals

citations
Cited by 35 publications
(41 citation statements)
references
References 24 publications
0
41
0
Order By: Relevance
“…Line 13 ensures that µ is correctly updated even if several EGDs are applied in a single iteration. Since both EGDs and TGDs can be applied in an iteration, both I and N must be updated at the iteration end (lines [14][15][16]. When h(x i ) and h(x j ) are both labeled nulls in line 10, we can replace either value with the other.…”
Section: Implementing the Chasementioning
confidence: 99%
“…Line 13 ensures that µ is correctly updated even if several EGDs are applied in a single iteration. Since both EGDs and TGDs can be applied in an iteration, both I and N must be updated at the iteration end (lines [14][15][16]. When h(x i ) and h(x j ) are both labeled nulls in line 10, we can replace either value with the other.…”
Section: Implementing the Chasementioning
confidence: 99%
“…Again, this is an important example but only evaluates one dimension. Geerts et al [2014] address two data quality issues: transforming data using schema mappings and fixing data conflicts and inconsistencies. The example mapping provided describes a source database that contains data on Treatments and Physicians and a second target database detailing Prescriptions and Doctors.…”
Section: Related Workmentioning
confidence: 99%
“…Despite the fact that ++Spicy is based on pure SQL, we have shown that our custom chase exhibits better performance on large instances in our tested scenarios. The Llunatic mapping and cleaning system [7] is tackling the problem of mapping and cleaning in the same chase step. The Llunatic chase efficiently covers a range of scenarios, such as repairing source data by also exploiting user feedback, that are not covered in our algorithm.…”
Section: Related Workmentioning
confidence: 99%
“…Many existing approaches have aimed at rewriting the dependencies, so as to further leverage the power of an external, general evaluation engine, such as a traditional RDBMS [17,15] or a Datalog engine [12,10]. Alternatively, custom chase engines have been used and optimized for computing DE solutions [7,16].…”
Section: Introductionmentioning
confidence: 99%