Interspeech 2016 2016
DOI: 10.21437/interspeech.2016-333
|View full text |Cite
|
Sign up to set email alerts
|

Enhancing Multilingual Recognition of Emotion in Speech by Language Identification

Abstract: We investigate, for the first time, if applying model selection based on automatic language identification (LID) can improve multilingual recognition of emotion in speech. Six emotional speech corpora from three language families (Germanic, Romance, Sino-Tibetan) are evaluated. The emotions are represented by the quadrants in the arousal/valence plane, i. e., positive/negative arousal/valence. Four selection approaches for choosing an optimal training set depending on the current language are compared: within … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
3
3

Relationship

1
8

Authors

Journals

citations
Cited by 26 publications
(16 citation statements)
references
References 19 publications
0
16
0
Order By: Relevance
“…Most of the existing approaches are tailored toward specific databases, which could be one of the main factors making this task tough to solve. While the system is trained on a particular database, it faces the issues of different subjects, their ethnicity, appearance, culture, sex and age [31], contextual meaning of sentences, and the background noise [32]. Consequently, the algorithm does not work well when dealing with natural environment [33].…”
Section: Introductionmentioning
confidence: 99%
“…Most of the existing approaches are tailored toward specific databases, which could be one of the main factors making this task tough to solve. While the system is trained on a particular database, it faces the issues of different subjects, their ethnicity, appearance, culture, sex and age [31], contextual meaning of sentences, and the background noise [32]. Consequently, the algorithm does not work well when dealing with natural environment [33].…”
Section: Introductionmentioning
confidence: 99%
“…An approach to multilingual emotion classification using language identification and model selection is presented in [7]. In contrast to this work where language-dependent models are trained and then selected accordingly, we examine the performance of one model trained on multiple languages.…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, speech linguistic features carry information about the culture and the way emotions are expressed or perceived. In [5], a model selection technique based on language identification is proposed to improve speech emotion recognition accuracy. As per the census, tribal people make up about 8.2 percent of the India's total population.…”
Section: Motivationmentioning
confidence: 99%