2017
DOI: 10.1186/s12911-017-0478-5
|View full text |Cite
|
Sign up to set email alerts
|

Evaluating privacy-preserving record linkage using cryptographic long-term keys and multibit trees on large medical datasets

Abstract: BackgroundIntegrating medical data using databases from different sources by record linkage is a powerful technique increasingly used in medical research. Under many jurisdictions, unique personal identifiers needed for linking the records are unavailable. Since sensitive attributes, such as names, have to be used instead, privacy regulations usually demand encrypting these identifiers. The corresponding set of techniques for privacy-preserving record linkage (PPRL) has received widespread attention. One recen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 22 publications
(20 citation statements)
references
References 37 publications
0
20
0
Order By: Relevance
“…The number of hashes computed for each bigram in each field depended on the weight of the field, as well as the average length of the field. Address information was not added to record-level Bloom filters as preliminary testing indicated reduced linkage quality when these fields were included; previous research has also noted this issue [17]. The middle name field was also excluded due to its high proportion of missing values.…”
Section: Linkage Methodsmentioning
confidence: 99%
“…The number of hashes computed for each bigram in each field depended on the weight of the field, as well as the average length of the field. Address information was not added to record-level Bloom filters as preliminary testing indicated reduced linkage quality when these fields were included; previous research has also noted this issue [17]. The middle name field was also excluded due to its high proportion of missing values.…”
Section: Linkage Methodsmentioning
confidence: 99%
“…PPRL techniques may be particularly useful given the reticence of many custodians to provide person identifiers for linkage given privacy concerns. The CDL, along with a number of other groups, have played a key role in developing, implementing and popularising techniques allowing record linkage to occur on encoded personal identifiers [13][14][15][16].…”
Section: From Research To Practice: Privacy-preserving Record Linkagementioning
confidence: 99%
“…The method provides strong protection as the encoding process is irreversible and the encoded output is distorted to the extent that accidental recognition of an individual is impossible. The Bloom Filter encoding means that the matching can be carried out within the context of a traditional Fellegi-Sunter probabilistic linkage [13][14][15][16]. These techniques have now been deployed for project-based linkage, with the CDL using privacy-preserving record linkage methods for a number of real-world projects, including an NHMRCfunded project investigating the continuity of care provided by primary and secondary health services and a more recent project linking health and non-health records to create a Social Investment Data Resource (described below).…”
Section: From Research To Practice: Privacy-preserving Record Linkagementioning
confidence: 99%
“…GRLC applied for research grants with all three groups. Due to this collaborative efforts, two joint papers have been published so far (Brown et al, 2017;Christen et al, 2017).…”
Section: International Cooperationmentioning
confidence: 99%
“…Combing external blocks such as year of birth with multi-bit trees allows for privacy preserving linkage of two census scale data sets within a few hours (Schnell, 2014a). For most applications, this solution is sufficient with regard to speed, accuracy and privacy (Brown et al, 2017). Therefore, this combination is provided with the record-linkage software of the GRLC (see Section 2.2).…”
Section: High-speed Pprlmentioning
confidence: 99%