2021
DOI: 10.1007/s12559-021-09865-2
|View full text |Cite
|
Sign up to set email alerts
|

When Old Meets New: Emotion Recognition from Speech Signals

Abstract: Speech is one of the most natural communication channels for expressing human emotions. Therefore, speech emotion recognition (SER) has been an active area of research with an extensive range of applications that can be found in several domains, such as biomedical diagnostics in healthcare and human–machine interactions. Recent works in SER have been focused on end-to-end deep neural networks (DNNs). However, the scarcity of emotion-labeled speech datasets inhibits the full potential of training a deep network… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(3 citation statements)
references
References 60 publications
0
3
0
Order By: Relevance
“…As illustrated, IMEMD-CRNN consists of three modules: IMEMD-based emotional speech signal decomposition, extraction of time-frequency features from IMFs, and speech emotion recognition based on CRNN. Arano et al (2021) show that effective hand-crafted features, compared to sophisticated deep-learning feature sets, can still have better performance. Therefore, we combine IMEMD-based features with CRNN network in order to improve the robustness and accuracy of the speech emotion recognition system.…”
Section: Methodsmentioning
confidence: 99%
“…As illustrated, IMEMD-CRNN consists of three modules: IMEMD-based emotional speech signal decomposition, extraction of time-frequency features from IMFs, and speech emotion recognition based on CRNN. Arano et al (2021) show that effective hand-crafted features, compared to sophisticated deep-learning feature sets, can still have better performance. Therefore, we combine IMEMD-based features with CRNN network in order to improve the robustness and accuracy of the speech emotion recognition system.…”
Section: Methodsmentioning
confidence: 99%
“…The expression by different people is not exactly the same, so it is difficult to obtain a unified and recognized emotion description in the field of speech emotion recognition [11] . Most of the current studies used for speech emotion recognition are based on the discrete emotions.…”
Section: Discrete Model For Speech Emotionmentioning
confidence: 99%
“…The classification task in machine learning is normally performed using a single classifier, hierarchical classifier, or classifier ensemble approach. Araño et al [6] utilized a hybrid set of features for classifying emotions from speech consisting of MFCCs and image features extracted from spectrograms. The 1 MFCCs features along with the long short-term memory (LSTM) network performed better as compared to the SVM classifier.…”
Section: Introductionmentioning
confidence: 99%