2021
DOI: 10.1371/journal.pone.0248360
|View full text |Cite
|
Sign up to set email alerts
|

Integrating human services and criminal justice data with claims data to predict risk of opioid overdose among Medicaid beneficiaries: A machine-learning approach

Abstract: Health system data incompletely capture the social risk factors for drug overdose. This study aimed to improve the accuracy of a machine-learning algorithm to predict opioid overdose risk by integrating human services and criminal justice data with health claims data to capture the social determinants of overdose risk. This prognostic study included Medicaid beneficiaries (n = 237,259) in Allegheny County, Pennsylvania enrolled between 2015 and 2018, randomly divided into training, testing, and validation samp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(11 citation statements)
references
References 49 publications
1
10
0
Order By: Relevance
“…We applied several commonly used methods such as multivariate logistic regression, penalised regression, random forests, and gradient-boosting machine (GBM) to develop and test overdose prediction algorithms. Consistent with previous studies, 3 , 5 GBM yielded the best prediction results (C-statistic of 0·841 for GBM vs up to 0·820 for other methods; appendix p 11 ) with an ability to handle complex interactions between predictors and outcomes. The study’s objective was to externally validate the best-performing algorithm; thus, we focused on reporting the GBM model ( appendix pp 1 – 2 ).…”
Section: Methodssupporting
confidence: 86%
See 3 more Smart Citations
“…We applied several commonly used methods such as multivariate logistic regression, penalised regression, random forests, and gradient-boosting machine (GBM) to develop and test overdose prediction algorithms. Consistent with previous studies, 3 , 5 GBM yielded the best prediction results (C-statistic of 0·841 for GBM vs up to 0·820 for other methods; appendix p 11 ) with an ability to handle complex interactions between predictors and outcomes. The study’s objective was to externally validate the best-performing algorithm; thus, we focused on reporting the GBM model ( appendix pp 1 – 2 ).…”
Section: Methodssupporting
confidence: 86%
“…This study expanded our previous work using machine-learning approaches to improve accuracy of predicting overdose in the subsequent 3 months in a large state Medicaid dataset and broaden applicability of these models across state Medicaid programmes. 5 Our best-performing GBM has several advantages, including handling missing data automatically, no additional feature selection process required prior to the GBM modelling, greater flexibility in hyper parameter tuning to include complex interactions between predictors and outcomes, and often providing better performance compared with other approaches. 3 , 5 We acknowledge, however, that the flexibility during model tuning can be time-consuming and computationally expensive.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…This plethora of data can aid with in-depth phenotyping, including developing predictive models to identify at-risk individuals [21,33,34,58,[65][66][67][68][69][70][71]. The scalability of these big data efforts can allow for the identification of novel clusters and etiologies for OUD [68].…”
Section: Health Systems-based Cohortsmentioning
confidence: 99%