2020
DOI: 10.2196/22649
|View full text |Cite
|
Sign up to set email alerts
|

Detecting Miscoded Diabetes Diagnosis Codes in Electronic Health Records for Quality Improvement: Temporal Deep Learning Approach

Abstract: Background Diabetes affects more than 30 million patients across the United States. With such a large disease burden, even a small error in classification can be significant. Currently billing codes, assigned at the time of a medical encounter, are the “gold standard” reflecting the actual diseases present in an individual, and thus in aggregate reflect disease prevalence in the population. These codes are generated by highly trained coders and by health care providers but are not always accurate. … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
2
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 38 publications
(47 reference statements)
0
4
0
Order By: Relevance
“…Relatedly, use of ICD10 codes from claims to identify individuals with diabetes and SUDs has limitations. As a result, we may have misclassified individuals by type of diabetes or presence of an SUD [ 54 , 55 ]. Third, in using claims data, we may have missed disease-specific services if SUD or diabetes diagnoses were not recorded.…”
Section: Discussionmentioning
confidence: 99%
“…Relatedly, use of ICD10 codes from claims to identify individuals with diabetes and SUDs has limitations. As a result, we may have misclassified individuals by type of diabetes or presence of an SUD [ 54 , 55 ]. Third, in using claims data, we may have missed disease-specific services if SUD or diabetes diagnoses were not recorded.…”
Section: Discussionmentioning
confidence: 99%
“…In our study, we omitted the use of POS tagging and applied NER embedding only that minimized the code and kept it simple. Our experimental study took inspiration from the experiment run in [38] that predicted miscoded diabetes ICD-10 labels in a large EHR dataset extracted from CERNER Health Facts, a HIPPA-compliant repository maintained by the University of South California.…”
Section: Related Workmentioning
confidence: 99%
“…Data collected from EHRs is sparse, noisy, heterogeneous, and unstructured [56,70,55]. Such data also presents other challenges including the problem of missing data and class imbalance [71,72].…”
Section: Data Preprocessingmentioning
confidence: 99%