2018
DOI: 10.11591/ijeei.v6i1.409
|View full text |Cite
|
Sign up to set email alerts
|

A Review on Emotion Recognition Algorithms using Speech Analysis

Abstract: In recent years, there is a growing interest in speech emotion recognition (SER) by analyzing input speech. SER can be considered as simply pattern recognition task which includes features extraction, classifier, and speech emotion database. The objective of this paper is to provide a comprehensive review on various literature available on SER. Several audio features are available, including linear predictive coding coefficients (LPCC), Melfrequency cepstral coefficients (MFCC), and Teager energy based feature… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
5

Relationship

2
8

Authors

Journals

citations
Cited by 18 publications
(8 citation statements)
references
References 19 publications
0
8
0
Order By: Relevance
“…The scope for future improvements is very appealing in this field. Different multimodel deep learning techniques can be used along with different architectures to improve the performance parameters [20][21][22][23][24][25][26][27]. Apart from recognizing the emotions only, there can be further addition of intensity scale.…”
Section: Resultsmentioning
confidence: 99%
“…The scope for future improvements is very appealing in this field. Different multimodel deep learning techniques can be used along with different architectures to improve the performance parameters [20][21][22][23][24][25][26][27]. Apart from recognizing the emotions only, there can be further addition of intensity scale.…”
Section: Resultsmentioning
confidence: 99%
“…While such end-to-end DNNs may provide the valuable results, the limitation due to the scarcity of emotion-labeled speech datasets hinders the training of DNNs from scratch. A relevant number of studies, therefore, still employ traditional handcrafted speech features, particularly MFCCs, which are reportedly one of the most conventional and effective feature sets [3,41]. In [42], for example, MFCCs achieved notable performance on the Audio Video Emotion Challenge 2016.…”
Section: Speech Emotion Recognitionmentioning
confidence: 99%
“…For examples, the sadness, happy, joy, neutral etc., all are considered to be non-violent and only the anger is defined as violence. In earlier there are so many approaches are developed to detect the emotion from the audio signals [23]. With the help of MFCCs of speech signals, S. Demircan and H. Kahramanl [24] developed an emotion recognition system with unsupervised learning.…”
Section: Literature Surveymentioning
confidence: 99%