2014
DOI: 10.1007/978-3-319-07491-7_32
|View full text |Cite
|
Sign up to set email alerts
|

Deep Learning for Emotional Speech Recognition

Abstract: Abstract. Emotional speech recognition is a multidisciplinary research area that has received increasing attention over the last few years. The present paper considers the application of restricted Boltzmann machines (RBM) and deep belief networks (DBN) to the difficult task of automatic Spanish emotional speech recognition. The principal motivation lies in the success reported in a growing body of work employing these techniques as alternatives to traditional methods in speech processing and speech recognitio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0
2

Year Published

2014
2014
2021
2021

Publication Types

Select...
6
2
2

Relationship

0
10

Authors

Journals

citations
Cited by 22 publications
(11 citation statements)
references
References 16 publications
0
9
0
2
Order By: Relevance
“…Deep Neural Networks (DNNs) are large scale version of feedforward neural networks that have been successfully validated for learning the complex functional relations. Use of DNN for detecting emotions from speech and music have been well accounted in literature [35]. Each feature dimension was normalized to have zero mean and unit variance before feeding it to DNN.…”
Section: Resultsmentioning
confidence: 99%
“…Deep Neural Networks (DNNs) are large scale version of feedforward neural networks that have been successfully validated for learning the complex functional relations. Use of DNN for detecting emotions from speech and music have been well accounted in literature [35]. Each feature dimension was normalized to have zero mean and unit variance before feeding it to DNN.…”
Section: Resultsmentioning
confidence: 99%
“…In the works of Brueckner et al further related speaker states and traits from the ISCA Interspeech Computational Paralinguistics Challenges have been considered -often outperforming the best results obtained in those [ [3,4,5,6]]. Further examples for emotional speech recognition include [ [7,25,26,1,21,29]]. In a related way, deep learning has also been successfully applied to emotion recognition in music [ [31]].…”
Section: Deep Learningmentioning
confidence: 98%
“…Layered (i.e. stacked) RBMs provide a vetted system for using probabilistic models to infer relationships between features in a variety of fields [11,13,26,27]. RBMs also have an impressive ability to provide contextual inference in noisy datasets, however an alternative is to use Generative Stochastic Networks (GSNs).…”
Section: Future Workmentioning
confidence: 99%