Objective Social determinants of health (SDoH) are nonclinical dispositions that impact patient health risks and clinical outcomes. Leveraging SDoH in clinical decision-making can potentially improve diagnosis, treatment planning, and patient outcomes. Despite increased interest in capturing SDoH in electronic health records (EHRs), such information is typically locked in unstructured clinical notes. Natural language processing (NLP) is the key technology to extract SDoH information from clinical text and expand its utility in patient care and research. This article presents a systematic review of the state-of-the-art NLP approaches and tools that focus on identifying and extracting SDoH data from unstructured clinical text in EHRs. Materials and Methods A broad literature search was conducted in February 2021 using 3 scholarly databases (ACL Anthology, PubMed, and Scopus) following Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. A total of 6402 publications were initially identified, and after applying the study inclusion criteria, 82 publications were selected for the final review. Results Smoking status (n = 27), substance use (n = 21), homelessness (n = 20), and alcohol use (n = 15) are the most frequently studied SDoH categories. Homelessness (n = 7) and other less-studied SDoH (eg, education, financial problems, social isolation and support, family problems) are mostly identified using rule-based approaches. In contrast, machine learning approaches are popular for identifying smoking status (n = 13), substance use (n = 9), and alcohol use (n = 9). Conclusion NLP offers significant potential to extract SDoH data from narrative clinical notes, which in turn can aid in the development of screening tools, risk prediction models, and clinical decision support systems.
Objective: To identify depression subphenotypes from Electronic Health Records (EHRs) using machine learning methods, and analyze their characteristics with respect to patient demographics, comorbidities, and medications. Materials and Methods: Using EHRs from the INSIGHT Clinical Research Network (CRN) database, multiple machine learning (ML) algorithms were applied to analyze 11 275 patients with depression to discern depression subphenotypes with distinct characteristics. Results: Using the computational approaches, we derived three depression subphenotypes: Phenotype_A (n = 2791; 31.35%) included patients who were the oldest (mean (SD) age, 72.55 (14.93) years), had the most comorbidities, and took the most medications. The most common comorbidities in this cluster of patients were hyperlipidemia, hypertension, and diabetes. Phenotype_B (mean (SD) age, 68.44 (19.09) years) was the largest cluster (n = 4687; 52.65%), and included patients suffering from moderate loss of body function. Asthma, fibromyalgia, and Chronic Pain and Fatigue (CPF) were common comorbidities in this subphenotype. Phenotype_C (n = 1452; 16.31%) included patients who were younger (mean (SD) age, 63.47 (18.81) years), had the fewest comorbidities, and took fewer medications. Anxiety and tobacco use were common comorbidities in this subphenotype. Conclusion: Computationally deriving depression subtypes can provide meaningful insights and improve understanding of depression as a heterogeneous disorder. Further investigation is needed to assess the utility of these derived phenotypes to inform clinical trial design and interpretation in routine patient care.
Substance use disorders (SUDs) commonly co-occur with mental illness. However, the ongoing addiction crisis raises the question of how opioid use disorder (OUD) impacts healthcare utilization relative to other SUDs. This study examines the utilization patterns of patients with major depressive disorder (MDD) and: (1) co-occurring OUD (MDD-OUD); (2) a co-occurring SUD other than OUD (MDD-NOUD); and (3) no co-occurring SUD (MDD-NSUD). We analyzed electronic health records (EHRs) derived from multiple health systems across the New York City (NYC) metropolitan area between January 2008 and December 2017. 11,275 patients aged ≥18 years with a gap of 30–180 days between 2 consecutive MDD diagnoses and an antidepressant prescribed 0–180 days after any MDD diagnosis were selected, and prevalence of any SUD was 24%. Individuals were stratified into comparison groups and matched on age, gender, and select underlying comorbidities. Prevalence rates and encounter frequencies were measured and compared across outpatient, inpatient, and emergency department (ED) settings. Our key findings showed that relative to other co-occurring SUDs, OUD was associated with larger increases in the rates and odds of using substance-use-related services in all settings, as well as services that integrate mental health and substance abuse treatments in inpatient and ED settings. OUD was also associated with larger increases in total encounters across all settings. These findings and our proposed policy recommendations could inform efforts towards targeted OUD interventions, particularly for individuals with underlying mental illness whose treatment and recovery are often more challenging.
Highlights Longitudinal questionnaire data were used to measure antidepressant treatment outcome. Machine learning models were used to predict outcome from electronic health records. The gradient boosting decision tree model achieved the best predictive results. Diagnostic codes and baseline severity were strong predictors of treatment outcome.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.