ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019
DOI: 10.1109/icassp.2019.8682283
|View full text |Cite
|
Sign up to set email alerts
|

Speech Emotion Recognition Using Deep Neural Network Considering Verbal and Nonverbal Speech Sounds

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
43
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 86 publications
(44 citation statements)
references
References 18 publications
1
43
0
Order By: Relevance
“…Huang's approach, CNN was trained both on verbal and nonverbal segments of speech and CNN learned features were used by LSTM for recognizing speech emotions [219].…”
Section: Cnns Based Speech Recognitionmentioning
confidence: 99%
“…Huang's approach, CNN was trained both on verbal and nonverbal segments of speech and CNN learned features were used by LSTM for recognizing speech emotions [219].…”
Section: Cnns Based Speech Recognitionmentioning
confidence: 99%
“…The late-fusion of facial expressions, speech signals, and text information reached the third-best result, with a CCC of 0.27 for arousal and 0.35 for valence. The complex attention-based network proposed by Huang et al [10] was able to achieve a CCC of 0.31 in arousal and 0.45 in valence, using only visual information.…”
Section: Omg-emotionmentioning
confidence: 99%
“…Deep learning models usually learn how to represent affective features by updating filters based on a large number of data samples, using strongly supervised learning methods [9][10][11][12][13][14]. As a result, these models can extract facial features for a collection of different individuals, which contributes to their generalization of expression representations enabling a universal and automatic FER machine.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Since the output value of the nerve cell exceeds the appropriate range of the activation function itself before passing the activation function, the nerve cell may fail to work [30]. In order to solve this problem, this article introduces the bn [31], the batch normalization method is as follows:…”
Section: B Lstm-based Emotion Fusion and Recognitionmentioning
confidence: 99%