2021
DOI: 10.7717/peerj-cs.766
|View full text |Cite
|
Sign up to set email alerts
|

Effect on speech emotion classification of a feature selection approach using a convolutional neural network

Abstract: Speech emotion recognition (SER) is a challenging issue because it is not clear which features are effective for classification. Emotionally related features are always extracted from speech signals for emotional classification. Handcrafted features are mainly used for emotional identification from audio signals. However, these features are not sufficient to correctly identify the emotional state of the speaker. The advantages of a deep convolutional neural network (DCNN) are investigated in the proposed work.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8
1
1

Relationship

1
9

Authors

Journals

citations
Cited by 24 publications
(10 citation statements)
references
References 60 publications
0
10
0
Order By: Relevance
“…is, however, has restricted the diversified development of music education [13][14][15]. Due to the multiculturalism need and the pluralistic society, teachers should use sufficient excellent traditional music knowledge and OES on the Internet to supplement students outside the classroom.…”
Section: Development and Utilization Of Music Oermentioning
confidence: 99%
“…is, however, has restricted the diversified development of music education [13][14][15]. Due to the multiculturalism need and the pluralistic society, teachers should use sufficient excellent traditional music knowledge and OES on the Internet to supplement students outside the classroom.…”
Section: Development and Utilization Of Music Oermentioning
confidence: 99%
“…Accordingly, researchers use random forest (RF) and ANN classifiers' performance scores as metrics of the collected emotional frequencies' efficacy. Identifying the most essential and discriminative characteristics for each SER was done using the feature selection (FS) method [20].…”
Section: Introductionmentioning
confidence: 99%
“…Furthermore, as recommended in [39], the experiments were performed using speaker-independent Leave-One-Speaker-Out (LOSO) or Leave-One-Speaker-Group-Out (LOSGO) techniques too. As we know, emotions are deeply linked to the identity of the speaker.…”
Section: Resultsmentioning
confidence: 99%