2012
DOI: 10.1016/j.jbi.2011.10.006
|View full text |Cite
|
Sign up to set email alerts
|

A transparent and transportable methodology for evaluating Data Linkage software

Abstract: There has been substantial growth in Data Linkage (DL) activities in recent years. This reflects growth in both the demand for, and the supply of, linked or linkable data. Increased utilisation of DL "services" has brought with it increased need for impartial information about the suitability and performance capabilities of DL software programs and packages. Although evaluations of DL software exist; most have been restricted to the comparison of two or three packages. Evaluations of a large number of packages… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
38
0
2

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 37 publications
(40 citation statements)
references
References 20 publications
0
38
0
2
Order By: Relevance
“…Each dataset was de-duplicated using a single probabilistic linkage strategy, based on a previously published ‘default’ linkage strategy [31] (no linkages were conducted between any of the four datasets). This default strategy utilised two sets of blocks (Soundex of surname concatenated with first initial, and full date of birth), with all available variables used in comparisons.…”
Section: Methodsmentioning
confidence: 99%
“…Each dataset was de-duplicated using a single probabilistic linkage strategy, based on a previously published ‘default’ linkage strategy [31] (no linkages were conducted between any of the four datasets). This default strategy utilised two sets of blocks (Soundex of surname concatenated with first initial, and full date of birth), with all available variables used in comparisons.…”
Section: Methodsmentioning
confidence: 99%
“…Matching strategies used for the datasets were based on the strategies used in a published evaluation of linkage software [25]. Two blocking strategies were used; last name Soundex with first name initial, and date of birth with sex.…”
Section: Methodsmentioning
confidence: 99%
“…Currently there are a range of desktop applications that perform this function and although these are usually easy to implement and use, they can struggle to handle medium (>1 million) and large scale (>10 million) linkages [27]. Few, if any, commercial packages exist which have the capacity and functionality to undertake on-going record linkage.…”
Section: Methodsmentioning
confidence: 99%