2021
DOI: 10.1007/s40747-021-00295-z
|View full text |Cite|
|
Sign up to set email alerts
|

Emotion classification from speech signal based on empirical mode decomposition and non-linear features

Abstract: Emotion recognition system from speech signal is a widely researched topic in the design of the Human–Computer Interface (HCI) models, since it provides insights into the mental states of human beings. Often, it is required to identify the emotional condition of the humans as cognitive feedback in the HCI. In this paper, an attempt to recognize seven emotional states from speech signals, known as sad, angry, disgust, happy, surprise, pleasant, and neutral sentiment, is investigated. The proposed method employs… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
11
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 54 publications
(15 citation statements)
references
References 46 publications
0
11
0
Order By: Relevance
“…The sentiment and emotion analysis of multi-modal dialogue such as video, audio, image attracts more and more attention [7,[22][23][24]. To improve the performance of the classifier, the image-text pair in the existing unimodal data is usually used as a multi-modal comprehensive data set [25].…”
Section: Multimodal Sentiment Analysismentioning
confidence: 99%
“…The sentiment and emotion analysis of multi-modal dialogue such as video, audio, image attracts more and more attention [7,[22][23][24]. To improve the performance of the classifier, the image-text pair in the existing unimodal data is usually used as a multi-modal comprehensive data set [25].…”
Section: Multimodal Sentiment Analysismentioning
confidence: 99%
“…In addition, this preliminary work adds another path for future development and applications of IA and IF in different domains where time frequency analysis is required. The EMD method has been used in speech recognition systems [ 30 , 31 , 32 ] and human emotion recognition system [ 33 ]. The EMD method has also been used to perform classification of respiratory sound in conjunction with FFT to extract features [ 34 ].…”
Section: Introductionmentioning
confidence: 99%
“…Extract features from these five frequency bands. These features usually include power spectral density (PSD) [ 11 ], differential entropy (DE) [ 12 , 13 ], differential asymmetry (DASM) [ 14 ], and rational asymmetry (RASM) [ 15 , 16 ]. EEG signal is a non-stationary and non-linear random signal [ 17 ].…”
Section: Introductionmentioning
confidence: 99%