2022
DOI: 10.1007/978-3-031-05936-0_31
|View full text |Cite
|
Sign up to set email alerts
|

Leveraged Mel Spectrograms Using Harmonic and Percussive Components in Speech Emotion Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 28 publications
0
2
0
Order By: Relevance
“…In another work, Hason Rudd et al (2022) [10] first converted the speech signals into the mel spectrogram representation. Then, they applied a VGG16 model to extract feature maps with various dimensions and signal sampling ratios.…”
Section: Related Workmentioning
confidence: 99%
“…In another work, Hason Rudd et al (2022) [10] first converted the speech signals into the mel spectrogram representation. Then, they applied a VGG16 model to extract feature maps with various dimensions and signal sampling ratios.…”
Section: Related Workmentioning
confidence: 99%
“…Al-onazi et al [39] also proposed to augment a combination of first and second delta MFCCs, chroma grams, tonnetz, and spectral contrast which were used as input to a transformer-based SER model. In [40], harmonic and percussive components were extracted from the mel spectrograms and later concatenated with the mel spectrograms to form augmented input to a pre-trained VGG16 model for SER. They also experimented with a combination of MFCCs, mel spectrograms and chroma grams.…”
Section: Related Workmentioning
confidence: 99%