2021
DOI: 10.2196/23456
|View full text |Cite
|
Sign up to set email alerts
|

Natural Language Processing of Clinical Notes to Identify Mental Illness and Substance Use Among People Living with HIV: Retrospective Cohort Study

Abstract: Background Mental illness and substance use are prevalent among people living with HIV and often lead to poor health outcomes. Electronic medical record (EMR) data are increasingly being utilized for HIV-related clinical research and care, but mental illness and substance use are often underdocumented in structured EMR fields. Natural language processing (NLP) of unstructured text of clinical notes in the EMR may more accurately identify mental illness and substance use among people living with HIV… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
14
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 26 publications
(15 citation statements)
references
References 35 publications
(50 reference statements)
1
14
0
Order By: Relevance
“…Our findings suggest that, although substance use and mental health screening was occurring prior to TAPS/AOQ implementation, screening practices were highly variable, which is consistent with previous reports of substance use and mental health screening in similar settings [ 16 , 17 ]. As noted above, standard KPNC practice already routinely screened all primary care patients for alcohol and tobacco use [ 46 , 47 ], but the TAPS/AOQ enabled screening for tobacco, alcohol, and all major drug classes in a single instrument.…”
Section: Discussionsupporting
confidence: 89%
See 1 more Smart Citation
“…Our findings suggest that, although substance use and mental health screening was occurring prior to TAPS/AOQ implementation, screening practices were highly variable, which is consistent with previous reports of substance use and mental health screening in similar settings [ 16 , 17 ]. As noted above, standard KPNC practice already routinely screened all primary care patients for alcohol and tobacco use [ 46 , 47 ], but the TAPS/AOQ enabled screening for tobacco, alcohol, and all major drug classes in a single instrument.…”
Section: Discussionsupporting
confidence: 89%
“…Routine screening is essential for identifying these comorbidities [10][11][12][13], but is often underutilized due to lack of resources, time constraints, and stigma [14,15]. When screening does occur, there is variability in question content, frequency, and documentation by providers [16,17], and patients often underreport symptoms, particularly alcohol and other drug use problems [15,18].…”
mentioning
confidence: 99%
“…Hospitals and health systems have made, and continue to make, substantial investments in their EMR systems. Although a systematic collection of salient medical and social data remains a work in progress, successful efforts using NLP algorithm have enabled efficient mining of rich free-text medical notes for various risk assessment or decision-making tools aimed at reducing the occurrences of adverse health events and wasteful spending [22][23][24]. Our study aligns with this work to identify caregiver availability for patients whose well-being depends on caregivers.…”
Section: Comparison With Prior Workmentioning
confidence: 65%
“…Our effectiveness results agree with the literature [ 83 , 88 ], in which a Macro-F1 score >80% is considered a successful extraction of medical records. Even though there is still a need to cover more tasks related to ICHOM patient-reported outcome measures [ 3 , 74 , 76 , 85 ], we hypothesized that these tasks comprise a feeling state, and the lack of normalization of data contained in EMRs may explain the fact that these task categories did not perform very well [ 70 , 89 ]. Medical records related to baseline characteristics and care processes typically contain much more structured data (eg, numerical values for tasks) than medical patient-reported outcomes, which focus more on unstructured data [ 83 , 90 ].…”
Section: Discussionmentioning
confidence: 99%