2016
DOI: 10.1177/0962280215626180
|View full text |Cite
|
Sign up to set email alerts
|

Automated linkage of patient records from disparate sources

Abstract: We introduce an automated method of record linkage that has two key features, automated selection of match field interactions to include in the model for estimation and automated threshold determination for classifying record pairs to matches or non-matches. We applied our method to two real-world examples. The first example demonstrated results consistent with our earlier work: When data quality is adequate and the match field discriminating power is high, matching algorithms exhibit similar performance. The … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 23 publications
0
5
0
Order By: Relevance
“…Prior research has demonstrated its impaired performance when conditional dependence exists, as well as the potential gain in matching accuracy when conditional dependence latent class models are used (Xu et al, 2019). However, the success of the conditional dependence models is heavily dependent on the use of correct conditional dependence structure (Li et al, 2018). Existing approaches for the identification of the conditional dependence structure, including the correlation residual plot, the log‐odds ratio check, and the bivariate residual approach, have been shown to have poor performance (Oberski et al, 2013; Subtil et al, 2012).…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Prior research has demonstrated its impaired performance when conditional dependence exists, as well as the potential gain in matching accuracy when conditional dependence latent class models are used (Xu et al, 2019). However, the success of the conditional dependence models is heavily dependent on the use of correct conditional dependence structure (Li et al, 2018). Existing approaches for the identification of the conditional dependence structure, including the correlation residual plot, the log‐odds ratio check, and the bivariate residual approach, have been shown to have poor performance (Oberski et al, 2013; Subtil et al, 2012).…”
Section: Discussionmentioning
confidence: 99%
“…Conditional dependence among matching fields may exist in one or both latent classes. When it does, the FS model provides an inadequate fit to the data, yields biassed parameter estimates, and produces impaired matching accuracy (Li et al, 2018; Xu et al, 2019).…”
Section: Proposed Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The INPC is routinely used for health services and public health research [ 16 ], including prior studies on STIs [ 17 – 20 ]. Probabilistic matching techniques, described previously [ 21 , 22 ], were used to match the mother’s social security number, last name, first name, date of birth, and gender from the birth certificate to her medical records.…”
Section: Methodsmentioning
confidence: 99%
“…Synthetic data is widely used in testing, validating, and evaluating different data linkage methods, frameworks, and algorithms either as the primary dataset or comparison dataset [30, [54][55][56][57]. A research project compared the performance of different algorithms in terms of linkage accuracy and speed using nine million synthetic records [58].…”
Section: Plos Digital Healthmentioning
confidence: 99%