ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9414129
|View full text |Cite
|
Sign up to set email alerts
|

Estimating Severity of Depression From Acoustic Features and Embeddings of Natural Speech

Abstract: Major depressive disorder, referred to as depression, is a leading cause of disability, absence from work, and premature death. Automatic assessment of depression from speech is a critical step towards improving diagnosis and treatment of depression. Previous works on depression assessment from speech considered various acoustic features extracted from speech to estimate depression severity. But performance of these approaches is not at clinical standards, and thus requires further improvement. In this work, w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 12 publications
(9 citation statements)
references
References 24 publications
(26 reference statements)
0
7
0
Order By: Relevance
“…Furthermore, the TDNN x-vectors or F-TDNN x-vectors based on MFCC have demonstrated better performance than PLP i-vectors for the automatic detection of PD ( Moro-Velazquez et al, 2020 ). Besides, the x-vector technique has been used as an advanced method for emotion recognition ( Pappagari et al, 2020b ), AD detection ( Pappagari et al, 2020a ), and depression detection ( Dumpala et al, 2021 , 2022 ; Egas-López et al, 2022 ). Consequently, depression detection is carried out in this study using the x-vector approach with the i-vector framework as the baseline.…”
Section: Related Workmentioning
confidence: 99%
“…Furthermore, the TDNN x-vectors or F-TDNN x-vectors based on MFCC have demonstrated better performance than PLP i-vectors for the automatic detection of PD ( Moro-Velazquez et al, 2020 ). Besides, the x-vector technique has been used as an advanced method for emotion recognition ( Pappagari et al, 2020b ), AD detection ( Pappagari et al, 2020a ), and depression detection ( Dumpala et al, 2021 , 2022 ; Egas-López et al, 2022 ). Consequently, depression detection is carried out in this study using the x-vector approach with the i-vector framework as the baseline.…”
Section: Related Workmentioning
confidence: 99%
“…The periodic training strategy defines the indefinite parameter ξ in the training rounds to be adjusted. The specific calculation of ξ is shown in Equation (16), where e n is the total number of training rounds, e i is the ith training round and f c is the adjustment factor.…”
Section: Periodic Focal Loss Functionmentioning
confidence: 99%
“…[15] combines a CNN network with a Growing Self‐organizing Map (GSOM) to identify psychological stress states based on spectral features. [16] proposes a multi‐task TCN learning model to estimate the degree of depression, combined with related tasks such as emotion and emotion recognition to identify depression. [17] proposes a self‐supervised acoustic feature extraction pre‐training network model for depression which uses a convolutional encoding‐decoding structure and combines it with the LSTM network for depression recognition.…”
Section: Introductionmentioning
confidence: 99%
“…DAIC-WOZ [12]. According to [49], even slight differences of the psychological state have the potential to cause a noticeable change in the acoustic domain, which makes audio data a competitive modality to be used as input to depression estimation frameworks [9,10,17,31,39,49,61]. Mel-cepstral features and formantfrequency tracks are two main arousal representations introduced by Williamson et al…”
Section: Related Workmentioning
confidence: 99%