Findings of the Association for Computational Linguistics: EMNLP 2020 2020
DOI: 10.18653/v1/2020.findings-emnlp.337
|View full text |Cite
|
Sign up to set email alerts
|

Do Models of Mental Health Based on Social Media Data Generalize?

Abstract: Proxy-based methods for annotating mental health status in social media have grown popular in computational research due to their ability to gather large training samples. However, an emerging body of literature has raised new concerns regarding the validity of these types of methods for use in clinical applications. To further understand the robustness of distantly supervised mental health models, we explore the generalization ability of machine learning classifiers trained to detect depression in individuals… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

3
41
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
2
2

Relationship

0
6

Authors

Journals

citations
Cited by 37 publications
(44 citation statements)
references
References 69 publications
3
41
0
Order By: Relevance
“…The clinically-annotated datasets that do exist are either proprietary or do not provide a clear mechanism for inquiring about availability. The dearth of large, shareable datasets based on actual clinical diagnoses and medical ground truth is problematic given recent research that calls into question the validity of proxy-based mental health annotations (Ernala et al, 2019;Harrigian et al, 2020). By leveraging privacypreserving technology (e.g.…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…The clinically-annotated datasets that do exist are either proprietary or do not provide a clear mechanism for inquiring about availability. The dearth of large, shareable datasets based on actual clinical diagnoses and medical ground truth is problematic given recent research that calls into question the validity of proxy-based mental health annotations (Ernala et al, 2019;Harrigian et al, 2020). By leveraging privacypreserving technology (e.g.…”
Section: Discussionmentioning
confidence: 99%
“…Mental Health Models. We create mental health models for these datasets based on recent work (Harrigian et al, 2020a;. Following standard pre-processing procedures, we filter numeric values, username mentions, retweets and urls from raw tweet text.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations