2021
DOI: 10.1080/10618600.2020.1825451
|View full text |Cite
|
Sign up to set email alerts
|

d-blink: Distributed End-to-End Bayesian Entity Resolution

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
23
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(24 citation statements)
references
References 44 publications
1
23
0
Order By: Relevance
“…Handing out multiple posterior sets of matched entities may be impractical, together with the associated variables needed for analysis, especially if the analysis requires a large number of posterior draws. Although there are improvements in the direction of scalability (Marchant et al, 2019), there still does not exist any reported Bayesian linkage application to files of the size of a population census. Goldstein et al (2012) and Gutman et al (2015) apply multiple imputation techniques to analysis of linkage data, which do not handle the problem of linkage data structure like the other Bayesian methods above.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Handing out multiple posterior sets of matched entities may be impractical, together with the associated variables needed for analysis, especially if the analysis requires a large number of posterior draws. Although there are improvements in the direction of scalability (Marchant et al, 2019), there still does not exist any reported Bayesian linkage application to files of the size of a population census. Goldstein et al (2012) and Gutman et al (2015) apply multiple imputation techniques to analysis of linkage data, which do not handle the problem of linkage data structure like the other Bayesian methods above.…”
Section: Related Workmentioning
confidence: 99%
“…Handing out multiple posterior sets of matched entities may be impractical, together with the associated variables needed for analysis, especially if the analysis requires a large number of posterior draws. Although there are improvements in the direction of scalability (Marchant et al., 2019), there still does not exist any reported Bayesian linkage application to files of the size of a population census.…”
Section: Introductionmentioning
confidence: 99%
“…We thank Professor Murray for this observation as we believe that the uniform prior on the linkage structure (or label space) may not be adequate in some record linkage problems where N is fixed and given, since prior uncertainty may be too low. Nevertheless, it has been shown to work quite well in a number of situations, where N is not known, namely, in the work of Marchant et al (2019) on an application to the United States Census Bureau.…”
Section: Prior Modeling and Identifiabilitymentioning
confidence: 99%
“…Moreover, the prior probability that two records are co-referent can be shown to be B n−1 /B n . In fact, if we turn to the recent literature on Bayesian record linkage that take a clustering approach, most assume a uniform prior on the linkage structure or co-reference matrix Marchant et al, 2019;McVeigh et al, 2020). Therefore, we ask why these particular assumptions should be considered less dangerous than ours?…”
Section: Prior Modeling and Identifiabilitymentioning
confidence: 99%
See 1 more Smart Citation