2024
DOI: 10.1109/taffc.2023.3290795
|View full text |Cite
|
Sign up to set email alerts
|

Adversarial Domain Generalized Transformer for Cross-Corpus Speech Emotion Recognition

Abstract: Speech emotion recognition (SER) promotes the development of intelligent devices, which enable natural and friendly human-computer interactions. However, the recognition performance of existing approaches is significantly reduced on unseen datasets, and the lack of sufficient training data limits the generalizability of deep learning models. In this work, we analyze the impact of the domain generalization method on cross-corpus SER and propose an adversarial domain generalized transformer (ADoGT), which is aim… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 59 publications
0
0
0
Order By: Relevance
“…Speech processing is focused on enabling machines to understand and interpret human speech with the ultimate objective of creating systems that facilitate natural and intuitive interaction between humans and machines (Hickok and Poeppel 2007). AEs have found numerous applications in speech processing, especially in speech denoising (Bhangale and Kothandaraman 2022;Tanveer et al 2023), speech recognition (Kumar et al 2022;Sayed et al 2023), speech representation (Alex and Mary 2023;Seki et al 2023), speech compression (Li et al 2021;Srikotr 2022), feature representation (Shixin et al 2022;Tian et al 2022), and speech emotion recognition (Dutt and Gader 2023;Gao et al 2023).…”
Section: Speech Processingmentioning
confidence: 99%
“…Speech processing is focused on enabling machines to understand and interpret human speech with the ultimate objective of creating systems that facilitate natural and intuitive interaction between humans and machines (Hickok and Poeppel 2007). AEs have found numerous applications in speech processing, especially in speech denoising (Bhangale and Kothandaraman 2022;Tanveer et al 2023), speech recognition (Kumar et al 2022;Sayed et al 2023), speech representation (Alex and Mary 2023;Seki et al 2023), speech compression (Li et al 2021;Srikotr 2022), feature representation (Shixin et al 2022;Tian et al 2022), and speech emotion recognition (Dutt and Gader 2023;Gao et al 2023).…”
Section: Speech Processingmentioning
confidence: 99%