2021
DOI: 10.1186/s12911-021-01550-6
|View full text |Cite
|
Sign up to set email alerts
|

Record linkage under suboptimal conditions for data-intensive evaluation of primary care in Rio de Janeiro, Brazil

Abstract: Background Linking Brazilian databases demands the development of algorithms and processes to deal with various challenges including the large size of the databases, the low number and poor quality of personal identifiers available to be compared (national security number not mandatory), and some characteristics of Brazilian names that make the linkage process prone to errors. This study aims to describe and evaluate the quality of the processes used to create an individual-linked database for … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1

Relationship

3
2

Authors

Journals

citations
Cited by 8 publications
(10 citation statements)
references
References 29 publications
(31 reference statements)
2
8
0
Order By: Relevance
“…By adopting preprocessing routines and considering the relationship time as a function of the number of true pairs detected, the recognition of the best blocking and processing strategies constitutes an advance for the systematic adoption of this method in the daily life of the services, especially when considering sexual orientation and gender identity as exposure variables in future analyses. These procedures resulted in a true-pair detection rate with proportions compatible with those found in other investigations whose outcome variable was the registration of death in SIM (21,35).…”
Section: Discussionsupporting
confidence: 80%
See 1 more Smart Citation
“…By adopting preprocessing routines and considering the relationship time as a function of the number of true pairs detected, the recognition of the best blocking and processing strategies constitutes an advance for the systematic adoption of this method in the daily life of the services, especially when considering sexual orientation and gender identity as exposure variables in future analyses. These procedures resulted in a true-pair detection rate with proportions compatible with those found in other investigations whose outcome variable was the registration of death in SIM (21,35).…”
Section: Discussionsupporting
confidence: 80%
“…The adoption of scores ≥17.9 reduced 56.16% of the pairs to be reviewed, with 90.04% of the pairs being truly negative. In other words, with high specificity (>90.00%) and with the loss of only 25 true pairs erroneously classified as false (98.46% sensitivity), our strategy 1 demonstrated excellent properties and elements compatible with other linkage strategies with Brazilian datasets (21,35). In addition, when incorporating the deaths into the SINAN database, the degrees of agreement for the kappa indicator are high and close to 1 for all the variables of interest in this study, revealing that the adoption of a higher cutoff point (≥17.9) does not imply losses for future analyses.…”
Section: Discussionmentioning
confidence: 67%
“…Coverage and data quality for the city of Rio de Janeiro is high (e.g., 99.7% of all births registered). 38 , 40 We identified all live births to mothers within the Cadastro Único including infants already registered in Cadastro Único and those who were not. Stillbirth records were not available for linkage.…”
Section: Methodsmentioning
confidence: 99%
“…Only infants within the Cadastro Único were linked to hospitalisation records as infant names are not recorded on birth certificates, inhibiting linkage. All datasets were linked via a combination of deterministic and probabilistic approaches (as published elsewhere 38 ) which involved matching name, date of birth and tax numbers using deterministic linkage, phonetic matching, and Levenshtein distance matching.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation