Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data 2011
DOI: 10.1145/1989323.1989373
|View full text |Cite
|
Sign up to set email alerts
|

Interaction between record matching and data repairing

Abstract: Central to a data cleaning system are record matching and data repairing. Matching aims to identify tuples that refer to the same real-world object, and repairing is to make a database consistent by fixing errors in the data by using constraints. These are treated as separate processes in current data cleaning systems, based on heuristic solutions. This paper studies a new problem, namely, the interaction between record matching and data repairing. We show that repairing can effectively help us identify matche… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
54
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 87 publications
(54 citation statements)
references
References 35 publications
0
54
0
Order By: Relevance
“…Each source has schema and instance level. Then approaches Record Matching and Data repairing (Fan, Ma et al 2014) have been presented, that are used to eliminate data de-duplication and enhance the quality of data. With the help of these two approaches designers are able to -clean‖ the data, which is further utilized for strategic decision making, thus enhancing the quality of data warehouse as well.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…Each source has schema and instance level. Then approaches Record Matching and Data repairing (Fan, Ma et al 2014) have been presented, that are used to eliminate data de-duplication and enhance the quality of data. With the help of these two approaches designers are able to -clean‖ the data, which is further utilized for strategic decision making, thus enhancing the quality of data warehouse as well.…”
Section: Discussionmentioning
confidence: 99%
“…However, data profiling techniques still need to be addressed as they are a necessary part in cleaning process. On the other hand, cleaning of corrupted data requires some iterative and probabilistic models as they can efficiently clean the data (Fan, Ma et al 2014). There are certain limitations to the present work, cleaning of multiple relations that involve dependencies is much needed to be explored in future.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Preferred values can be found from, e.g., master data [24], tuple-certainty and value-accuracy [19], freshness and currency [17], just to name a few.…”
Section: Introductionmentioning
confidence: 99%