2018
DOI: 10.2196/10436
|View full text |Cite
|
Sign up to set email alerts
|

Where No Universal Health Care Identifier Exists: Comparison and Determination of the Utility of Score-Based Persons Matching Algorithms Using Demographic Data

Abstract: BackgroundA universal health care identifier (UHID) facilitates the development of longitudinal medical records in health care settings where follow up and tracking of persons across health care sectors are needed. HIV case-based surveillance (CBS) entails longitudinal follow up of HIV cases from diagnosis, linkage to care and treatment, and is recommended for second generation HIV surveillance. In the absence of a UHID, records matching, linking, and deduplication may be done using score-based persons matchin… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
8
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 13 publications
(10 citation statements)
references
References 12 publications
2
8
0
Order By: Relevance
“…Additionally, newly diagnosed persons are not assigned unique identifiers (a comprehensive care clinic number) until they receive care, and the current unique identifier is facility based, making matching and deduplication of cases more difficult, hence some repeat testers may have been missed. However, in the absence of unique identifiers, we utilized Jaro-Winkler score-based persons matching algorithm to match and de-duplicate records (Waruru et al, 2018). Limitations of the pilot system were data abstraction occurring six months after initial entry at testing and care facilities, which limits the public health response, and that data abstraction from paper records is resource intensive.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Additionally, newly diagnosed persons are not assigned unique identifiers (a comprehensive care clinic number) until they receive care, and the current unique identifier is facility based, making matching and deduplication of cases more difficult, hence some repeat testers may have been missed. However, in the absence of unique identifiers, we utilized Jaro-Winkler score-based persons matching algorithm to match and de-duplicate records (Waruru et al, 2018). Limitations of the pilot system were data abstraction occurring six months after initial entry at testing and care facilities, which limits the public health response, and that data abstraction from paper records is resource intensive.…”
Section: Discussionmentioning
confidence: 99%
“…Matching and de-duplication of cases were performed using a probabilistic matching algorithm (Waruru et al, 2018). Variables used were Soundex of first name (a phonetic algorithm for indexing names by sound as the way they are pronounced in English), secondary double metaphone of the middle name (where the middle name was available), secondary double metaphone of last name, the first character of sex at birth, and year of birth.…”
Section: Methodsmentioning
confidence: 99%
“…Text-based matching is often associated with multiple registration: one patient may have multiple IDs or one ID may be associated with multiple individuals or incomplete registration. Misidentification not only compromises patient care at the individual level but also limits utilization of surveillance data from routine programme data due to inaccuracies [20]. Double-registration could lead to overestimation of HIV incidence, with HIV re-testers being counted as newly diagnosed cases.…”
Section: Introductionmentioning
confidence: 99%
“…Instead, employing longitudinal information, programs should adopt sequential time-defined patient-level “cohort” reporting, which can be used to assess LTC changes over time and to monitor LTC in populations who may be less likely to achieve full LTC (e.g., adolescents) [69]. Where universal national health identifiers are not available, cohort-based reporting within a geographic area (e.g., district or province) may be assisted by electronic health record data capture at all service entry points along the LTC pathway [70] and the use of probabilistic patient matching algorithms to identify silent transfers, “side door” LTC pathway entry [47], and minimize duplicate patient records [71].…”
Section: Monitoring and Measuring Linkage To Carementioning
confidence: 99%