2018 IEEE 34th International Conference on Data Engineering (ICDE) 2018
DOI: 10.1109/icde.2018.00107
|View full text |Cite
|
Sign up to set email alerts
|

Enabling Quality Control for Entity Resolution: A Human and Machine Cooperation Framework

Abstract: Even though many machine algorithms have been proposed for entity resolution, it remains very challenging to find a solution with quality guarantees. In this paper, we propose a novel HUman and Machine cOoperation (HUMO) framework for entity resolution (ER), which divides an ER workload between the machine and the human. HUMO enables a mechanism for quality control that can flexibly enforce both precision and recall levels. We introduce the optimization problem of HUMO, minimizing human cost given a quality re… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(14 citation statements)
references
References 28 publications
0
14
0
Order By: Relevance
“…The r-HUMO framework is built on the recently proposed HUMO framework [14], [15], which can enforce quality guarantees at both precision and recall fronts. The general idea of HUMO and r-HUMO was similar to the Fellegi-Sunter theory of record linking [3], which also proposed to divide an ER workload into three parts based on match probability.…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…The r-HUMO framework is built on the recently proposed HUMO framework [14], [15], which can enforce quality guarantees at both precision and recall fronts. The general idea of HUMO and r-HUMO was similar to the Fellegi-Sunter theory of record linking [3], which also proposed to divide an ER workload into three parts based on match probability.…”
Section: Related Workmentioning
confidence: 99%
“…the set of instance pairs with the feature f For presentation simplicity, we summarize the frequently used notations in Table 1. Formally, we define the problem of entity resolution with quality guarantees [14], [15] as follows:…”
Section: Notationmentioning
confidence: 99%
See 3 more Smart Citations