Interspeech 2022 2022
DOI: 10.21437/interspeech.2022-11480
|View full text |Cite
|
Sign up to set email alerts
|

Discriminative Feature Representation Based on Cascaded Attention Network with Adversarial Joint Loss for Speech Emotion Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…However, one challenge for SER and Emotional TTS is the lack of data, which has not been taken into account in most related approaches [3,4,5,6,7]. In reality, emotional speech data is more difficult and expensive to acquire compared to Neutral style speech, which results in a common issue that the speech data distributions are highly skewed toward the Neutral emotion class.…”
Section: Introductionmentioning
confidence: 99%
“…However, one challenge for SER and Emotional TTS is the lack of data, which has not been taken into account in most related approaches [3,4,5,6,7]. In reality, emotional speech data is more difficult and expensive to acquire compared to Neutral style speech, which results in a common issue that the speech data distributions are highly skewed toward the Neutral emotion class.…”
Section: Introductionmentioning
confidence: 99%