2022
DOI: 10.1109/access.2021.3136251
|View full text |Cite
|
Sign up to set email alerts
|

Bangla Speech Emotion Recognition and Cross-Lingual Study Using Deep CNN and BLSTM Networks

Abstract: In this study, we have presented a deep learning-based implementation for speech emotion recognition (SER). The system combines a deep convolutional neural network (DCNN) and a bidirectional long-short term memory (BLSTM) network with a time-distributed flatten (TDF) layer. The proposed model has been applied for the recently built audio-only Bangla emotional speech corpus SUBESCO. A series of experiments were carried out to analyze all the models discussed in this paper for baseline, cross-lingual, and multil… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
16
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 37 publications
(16 citation statements)
references
References 52 publications
0
16
0
Order By: Relevance
“…Te model implemented for the RAVDESS database outperforms the state-of-the-art accuracy giving 97.14% highest recognition rate [78]. Combining DCNN and BLSTM, the model proposed by Sultana et al [3] obtained state-of-the-art efciency with 82.7% and 86.9% accuracy for the RAVDESS and SUBESCO databases for English and Bangla languages, respectively.…”
Section: Speech Emotion Recognition For Indo-aryan and Dravidian Lang...mentioning
confidence: 95%
See 3 more Smart Citations
“…Te model implemented for the RAVDESS database outperforms the state-of-the-art accuracy giving 97.14% highest recognition rate [78]. Combining DCNN and BLSTM, the model proposed by Sultana et al [3] obtained state-of-the-art efciency with 82.7% and 86.9% accuracy for the RAVDESS and SUBESCO databases for English and Bangla languages, respectively.…”
Section: Speech Emotion Recognition For Indo-aryan and Dravidian Lang...mentioning
confidence: 95%
“…Terefore, deep learning approaches have become more In recent times, multitask learning and attention mechanism are also being used for improved performance [58,59]. For cross-corpus and cross-lingual speech emotion recognition, the transfer learning technique is being widely used [3,60,61].…”
Section: Speech Emotion Recognition Trendsmentioning
confidence: 99%
See 2 more Smart Citations
“…Apart from CNNs [ 13 , 14 , 15 ], Long Short-term Memory Networks (LSTM) [ 16 ], DNN [ 17 , 18 ] and CNN-LSTM hybrids have also shown promising results in the field of emotion recognition. LSTMs are more advanced Recurrent Neural Networks (RNN) optimized to use gates to control information flow.…”
Section: Introductionmentioning
confidence: 99%