2022
DOI: 10.1109/tkde.2020.2991063
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Entity Resolution With Blocking and Graph Algorithms

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
11
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(11 citation statements)
references
References 31 publications
0
11
0
Order By: Relevance
“…Others aim to improve the quality of human labelers [45,54]. Alternative research directions aim to surpass the need for humans in-the-loop by introducing unsupervised learning solutions [24,52,57] and transfer learning methods [22,47] or limiting the amount of labels for training using active learning mechanisms [21,23,30] and few-shot learning [23,55]. In this work we analyze and aim to improve the quality of labels provided in a crowdsourcing setting.…”
Section: Related Workmentioning
confidence: 99%
“…Others aim to improve the quality of human labelers [45,54]. Alternative research directions aim to surpass the need for humans in-the-loop by introducing unsupervised learning solutions [24,52,57] and transfer learning methods [22,47] or limiting the amount of labels for training using active learning mechanisms [21,23,30] and few-shot learning [23,55]. In this work we analyze and aim to improve the quality of labels provided in a crowdsourcing setting.…”
Section: Related Workmentioning
confidence: 99%
“…Unsupervised ER approaches [5], [46], [49] are designed to perform ER without labeling. ZeroER [46] learns the match and mismatch distributions based on Gaussian Mixture Models.…”
Section: Related Workmentioning
confidence: 99%
“…EMBDI [5] performs ER by learning a compact graph-based representation for each tuple. ITER+CliqueRank [49] first constructs a bipartite graph to model the relationship between tuple pairs, and then develops an iterative-based ranking algorithm to estimate the similarity of tuple pairs. Despite the benefit of zero label requirement, unsupervised approaches are highly errorsensitive and may suffer from poor ER results when errors are contained in datasets.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Generally, unsupervised entity resolution systems resort to matching threshold setting and end the entity matching process at that stage, such as in [13]. Other systems further expand the pipeline to identify entity profiles generalizing the entity matching output to more than two entity clusters.…”
Section: Introductionmentioning
confidence: 99%