2021
DOI: 10.1016/j.jad.2021.09.018
|View full text |Cite
|
Sign up to set email alerts
|

Comparing machine learning to a rule-based approach for predicting suicidal behavior among adolescents: Results from a longitudinal population-based survey

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(10 citation statements)
references
References 26 publications
1
9
0
Order By: Relevance
“…The fine-tuned model reached an excellent AUC of 92.1% ( 61 ). This figure is near the highest AUC reported in existing literature, ranging from 0.716 to 0.925 ( 23 , 25 , 26 , 28 ). With a high specificity of 0.90 and a high sensitivity of 0.77, the model could correctly classify 77% of adolescents reporting a recent suicide attempt as well as 90% of adolescents not reporting a recent suicide attempt.…”
Section: Discussionsupporting
confidence: 54%
See 1 more Smart Citation
“…The fine-tuned model reached an excellent AUC of 92.1% ( 61 ). This figure is near the highest AUC reported in existing literature, ranging from 0.716 to 0.925 ( 23 , 25 , 26 , 28 ). With a high specificity of 0.90 and a high sensitivity of 0.77, the model could correctly classify 77% of adolescents reporting a recent suicide attempt as well as 90% of adolescents not reporting a recent suicide attempt.…”
Section: Discussionsupporting
confidence: 54%
“…However, modifying the underlying distribution of the outcome breaches the principal machine learning assumption that the training and testing datasets are sampled from the same population ( 45 ). To resolve this issue, previous machine learning studies on suicide attempt prediction have often balanced the testing dataset as well [see for example, ( 26 , 29 )], rendering the model inapplicable to the real-world problem. Simply put, when the distributions of both the training and testing datasets differ from the underlying population distribution, the performance of the model on a real-world dataset with severe class imbalance would be unknown ( 45 ).…”
Section: Methodsmentioning
confidence: 99%
“…Moreover, the process of variable selection enables models to achieve higher accuracy and better generalization capabilities. For example, van Vuuren and colleagues [20] found that LASSO created a model that was able to classify students as at risk for suicide with a higher accuracy than simple inclusion rules (i.e., predicting based on history of suicide alone). Pratik and colleagues [21] utilized Elastic Net to select variables that were able to predict smoking addiction in young adults with higher accuracy than previous research.…”
Section: Variable Selection In Machine Learningmentioning
confidence: 99%
“…In the following paragraphs, we will summarize three major points: First, the most important variables across the three models by far were indicators of previous self-harm. The question arises as to what extent algorithms that can incorporate several hundreds of variables have incremental value over a simple decision rule that classifies every adolescent who ever showed previous self-harming behavior as ''at risk'' (e.g., Van Vuuren et al, 2021). Using such a single-item decision rule (i.e., classifying every 17-year-old who confirmed previous self-harm at 14 years of age as ''at risk''), balanced accuracy was only slightly lower than that in the first model (.74 vs. .76), but sensitivity was substantially lower (.59 vs .69).…”
Section: Important Variables In Predicting Suicide Attemptsmentioning
confidence: 99%