ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9054323
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting Vocal Tract Coordination Using Dilated CNNS For Depression Detection In Naturalistic Environments

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 31 publications
(27 citation statements)
references
References 27 publications
0
22
0
Order By: Relevance
“…Previous studies in depression prediction using speech [15,16] have shown the superiority of MFCCs over other audio based features like extended Geneva Minimalistic Acoustic Parameter Set (eGeMAPS) [7] and DEEP SPECTRUM features [1]. Huang et al [9] showed with their depression classification study that coordination features computed from MFCCs perform better with respect to formants and eGeMAPS features. So to compare how robust and effective the TVs are for detecting schizophrenia, we chose MFCCs as the baseline audio features for our study.…”
Section: Mel-frequency Cepstral Coefficients (Mfccs)mentioning
confidence: 99%
See 3 more Smart Citations
“…Previous studies in depression prediction using speech [15,16] have shown the superiority of MFCCs over other audio based features like extended Geneva Minimalistic Acoustic Parameter Set (eGeMAPS) [7] and DEEP SPECTRUM features [1]. Huang et al [9] showed with their depression classification study that coordination features computed from MFCCs perform better with respect to formants and eGeMAPS features. So to compare how robust and effective the TVs are for detecting schizophrenia, we chose MFCCs as the baseline audio features for our study.…”
Section: Mel-frequency Cepstral Coefficients (Mfccs)mentioning
confidence: 99%
“…Huang et al [9] in a recent study with MDD introduces a new channel delay correlation method inspired by TDEC, which uses a different correlation structure with correlations starting from 0 to a delay of 'D' frames (a design choice). The delayed autocorrelations and cross-correlations across channels are stacked to form the FVTC correlation structure.…”
Section: Full Vocal Tract Coordination (Fvtc)mentioning
confidence: 99%
See 2 more Smart Citations
“…Sample features include voice quality [17] [16], articulation [18] [19] [20], speech rate [19], and spectral [9] features. Advances in deep learning [21] have led to improved results in a range of affective and behavioral health tasks [22][23] [24][25] [26][27] [28]. In deep learning the focus is to learn feature representation from data.…”
Section: Introductionmentioning
confidence: 99%