2018
DOI: 10.1007/978-981-13-3095-7_12
|View full text |Cite
|
Sign up to set email alerts
|

A Hybrid Data Deduplication Approach in Entity Resolution Using Chromatic Correlation Clustering

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
18
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
2
1
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(18 citation statements)
references
References 25 publications
0
18
0
Order By: Relevance
“…The duplicate records are put into clusters and if they don't have a common key or are noisy, then accurate deduplication becomes a challenge. Haruna et al [1], presented a hybrid based data deduplication approach in ER. Where a machine-based system, the Cosine similarity function [11], [12], was first used on sets of data to calculate for the similarity scores between each pair of records using metrics with a set threshold.…”
Section: Introductionmentioning
confidence: 99%
See 4 more Smart Citations
“…The duplicate records are put into clusters and if they don't have a common key or are noisy, then accurate deduplication becomes a challenge. Haruna et al [1], presented a hybrid based data deduplication approach in ER. Where a machine-based system, the Cosine similarity function [11], [12], was first used on sets of data to calculate for the similarity scores between each pair of records using metrics with a set threshold.…”
Section: Introductionmentioning
confidence: 99%
“…Finally, the clusters were submitted to the crowdsourcing platform, for the humans to thoroughly examine the pairs of records, to confirm their equivalence and submit their answers. Based on the crowd's confidence [9] and triangular similarity scores [1], a permanent cluster is either formed, implying the records in it are almost equal, or otherwise not formed.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations