2021
DOI: 10.1109/access.2021.3069818
|View full text |Cite
|
Sign up to set email alerts
|

Autoencoder With Emotion Embedding for Speech Emotion Recognition

Abstract: An important part of the human-computer interaction process is speech emotion recognition (SER), which has been receiving more attention in recent years. However, although a wide diversity of methods has been proposed in SER, these approaches still cannot improve the performance. A key issue in the low performance of the SER system is how to effectively extract emotion-oriented features. In this paper, we propose a novel algorithm, an autoencoder with emotion embedding, to extract deep emotion features. Unlike… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 37 publications
(12 citation statements)
references
References 53 publications
0
7
0
Order By: Relevance
“…Zhang et al [61] proposed a novel methodology for recognizing emotions using speech. The authors used autoencoders along with emotion embedding to extract the deep emotion features.…”
Section: Related Workmentioning
confidence: 99%
“…Zhang et al [61] proposed a novel methodology for recognizing emotions using speech. The authors used autoencoders along with emotion embedding to extract the deep emotion features.…”
Section: Related Workmentioning
confidence: 99%
“…In [ 21 ], the authors proposed a Focal Loss-based Convolutional Recurrent Neural Networks (FL-CRNN) deep learning model with variable input length for speech emotion recognition. In [ 22 ], the authors proposed an Automatic Encoder with Emotion Embedding (AEEE) to extract deep emotional features. In [ 23 ], Ozseven proposed a Statistical Feature Selection method based on the change of emotion on acoustic features (SFS-AF).…”
Section: Related Workmentioning
confidence: 99%
“…Khattak et al ( 16 ) proposed an efficient deep learning technology, which used convolution neural networks to classify emotions from facial images and effectively detect age and gender from facial expressions. Zhang and Zhang ( 17 ) proposed an automatic encoder with emotion embedding to extract deep emotional features. Panahi et al ( 18 ) studied the effectiveness of fractional Fourier transform as a new feature extraction method in improving the accuracy of emotion recognition in physiological signals.…”
Section: Related Workmentioning
confidence: 99%