2020
DOI: 10.3390/electronics9101725
|View full text |Cite
|
Sign up to set email alerts
|

A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

Abstract: In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation only. The assumption is that the speech audio signal carries sufficient emotional in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
15
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 27 publications
(15 citation statements)
references
References 47 publications
(55 reference statements)
0
15
0
Order By: Relevance
“…As for cross-linguistic studies, Rajoo and Aun [ 37 ] proved the strong language-dependent nature of SER, which was further explored by Fu et al [ 38 ], who trained algorithms with combinations of three languages, obtaining accuracies which, preliminarily, outlined the possibility of a cross-language model for German and Chinese, while Italian was not recognized as successfully—possibly due to the unbalanced dataset. Li and Akagi [ 39 ] obtained interesting results, merging widely known existing datasets, whereas Tamulevičius et al [ 40 ] obtained high accuracies with a CNN-based approach. However, their dataset is highly unbalanced, and the emotions have been acted by non-professionals.…”
Section: Introductionmentioning
confidence: 99%
“…As for cross-linguistic studies, Rajoo and Aun [ 37 ] proved the strong language-dependent nature of SER, which was further explored by Fu et al [ 38 ], who trained algorithms with combinations of three languages, obtaining accuracies which, preliminarily, outlined the possibility of a cross-language model for German and Chinese, while Italian was not recognized as successfully—possibly due to the unbalanced dataset. Li and Akagi [ 39 ] obtained interesting results, merging widely known existing datasets, whereas Tamulevičius et al [ 40 ] obtained high accuracies with a CNN-based approach. However, their dataset is highly unbalanced, and the emotions have been acted by non-professionals.…”
Section: Introductionmentioning
confidence: 99%
“…Various kinds of emotions based on cross‐linguistic speech were established by Tamulevicius et al 30 The information regarding different emotions was gathered from several languages, including Spanish, English, German, Polish, Lithuanian, and Serbian. The emotions to cross‐linguistic speech dataset had attained a size greater than 10,000 clips of emotions.…”
Section: Related Workmentioning
confidence: 99%
“…This speaker-independent approach cannot be employed if there are several speakers. The researchers (Tamulevičius et al, 2020) have performed emotion recognition from the speech data on the various emotional speech-based databases belonging to multiple languages. As per the analysis done by the researchers (Khalil et al, 2019), it is easy and efficient if we extract the emotions from the speech to determine the sentiment.…”
Section: Positivementioning
confidence: 99%