2021
DOI: 10.3390/app12010327
|View full text |Cite
|
Sign up to set email alerts
|

A Proposal for Multimodal Emotion Recognition Using Aural Transformers and Action Units on RAVDESS Dataset

Abstract: Emotion recognition is attracting the attention of the research community due to its multiple applications in different fields, such as medicine or autonomous driving. In this paper, we proposed an automatic emotion recognizer system that consisted of a speech emotion recognizer (SER) and a facial emotion recognizer (FER). For the SER, we evaluated a pre-trained xlsr-Wav2Vec2.0 transformer using two transfer-learning techniques: embedding extraction and fine-tuning. The best accuracy results were achieved when… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 42 publications
(12 citation statements)
references
References 69 publications
(78 reference statements)
0
12
0
Order By: Relevance
“…As a new type of therapy, music therapy takes the function of music as the basis and intervenes the performers through various forms of music, so as to relieve the performers' artistic psychological intervention. This paper studies and analyzes the psychological intervention of music therapy in vocal music performance students' artistic psychological intervention [ 4 6 ].…”
Section: Introductionmentioning
confidence: 99%
“…As a new type of therapy, music therapy takes the function of music as the basis and intervenes the performers through various forms of music, so as to relieve the performers' artistic psychological intervention. This paper studies and analyzes the psychological intervention of music therapy in vocal music performance students' artistic psychological intervention [ 4 6 ].…”
Section: Introductionmentioning
confidence: 99%
“…Among its advantages is having a proportional number of files in each emotion, which avoids the problems caused by training algorithms with unbalanced data. In addition, RAVDEESS is a reference dataset in the research community that has been used in several works [23][24][25][26][27][28][29][30][31][32][33][34][35][36][37][38][39].…”
Section: Databasementioning
confidence: 99%
“…Luna-Jimenez and Kleinlein [36] proposed an automatic emotion-recognizer system that had a speech emotion recognizer (SER) and a facial emotion recognizer (FER). Eight emotions were classified, and they achieved 86.70% accuracy on the RAVDESS dataset using a subject-wise 5-CV evaluation.…”
Section: Review Of Related Literaturementioning
confidence: 99%