2014
DOI: 10.5120/17775-8913
|View full text |Cite
|
Sign up to set email alerts
|

Emotion Recognition from Speech using Discriminative Features

Abstract: Creating an accurate Speech Emotion Recognition (SER) system depends on extracting features relevant to that of emotions from speech. In this paper, the features that are extracted from the speech samples include Mel Frequency Cepstral Coefficients (MFCC), energy, pitch, spectral flux, spectral roll-off and spectral stationarity. In order to avoid the 'curse of dimensionality', statistical parameters, i.e. mean, variance, median, maximum, minimum, and index of dispersion have been applied on the extracted feat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 22 publications
(19 reference statements)
0
1
0
Order By: Relevance
“…The following parameters were selected for the speech signal: f 0 (fundamental frequency), MFCC (Mel Frequency Cepstral Coefficients), jitter and shimmer. It was determined that the first 13 MFCC coefficients describing the frequency parameters of speech would be used for emotion recognition because they contain most of the information regarding the emotion to be recognized [ 77 ]. The fundamental frequency f 0 , on the other hand, contains information about the pitch of the voice, and therefore allows us to take, e.g., gender into account, without the need for additional determination.…”
Section: Methodsmentioning
confidence: 99%
“…The following parameters were selected for the speech signal: f 0 (fundamental frequency), MFCC (Mel Frequency Cepstral Coefficients), jitter and shimmer. It was determined that the first 13 MFCC coefficients describing the frequency parameters of speech would be used for emotion recognition because they contain most of the information regarding the emotion to be recognized [ 77 ]. The fundamental frequency f 0 , on the other hand, contains information about the pitch of the voice, and therefore allows us to take, e.g., gender into account, without the need for additional determination.…”
Section: Methodsmentioning
confidence: 99%