2022
DOI: 10.48550/arxiv.2202.05912
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals

Abstract: In this paper, a data augmentation method is proposed for depression detection from speech signals. Samples for data augmentation were created by changing the frame-width and the frame-shift parameters during the feature extraction process. Unlike other data augmentation methods (such as VTLP, pitch perturbation, or speed perturbation), the proposed method does not explicitly change acoustic parameters but rather the time-frequency resolution of frame-level features. The proposed method was evaluated using two… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 21 publications
0
1
0
Order By: Relevance
“…Among others, acoustic features such as x-vectors [18], i-vectors [19] and other speaker embeddings [20] have been shown to be effective in the diagnosis of a speaker's mental state. These features, however, also carry information about a speaker's identity [21] which can be counter-productive to privacy preservation-a key factor in the adoption of digital mental-health screening systems [22].…”
Section: Introductionmentioning
confidence: 99%
“…Among others, acoustic features such as x-vectors [18], i-vectors [19] and other speaker embeddings [20] have been shown to be effective in the diagnosis of a speaker's mental state. These features, however, also carry information about a speaker's identity [21] which can be counter-productive to privacy preservation-a key factor in the adoption of digital mental-health screening systems [22].…”
Section: Introductionmentioning
confidence: 99%