2022
DOI: 10.1016/j.dib.2022.108091
|View full text |Cite
|
Sign up to set email alerts
|

BanglaSER: A speech emotion recognition dataset for the Bangla language

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 14 publications
(7 citation statements)
references
References 15 publications
0
7
0
Order By: Relevance
“…Neural network-based DL models have been investigated in recent SER studies. Among different DL models, CNN [1,4], and LSTM network [5] are the base of the proposed SER model. CNN is the most well-known DL architecture motivated by natural creatures' basic visual attention mechanism [4].…”
Section: Conflicts Of Interestmentioning
confidence: 99%
See 2 more Smart Citations
“…Neural network-based DL models have been investigated in recent SER studies. Among different DL models, CNN [1,4], and LSTM network [5] are the base of the proposed SER model. CNN is the most well-known DL architecture motivated by natural creatures' basic visual attention mechanism [4].…”
Section: Conflicts Of Interestmentioning
confidence: 99%
“…The LSTM is a kind of RNN made up of recurrently associated memory blocks, including memory cells with self-connections that record the network's temporal states [5]. It is mainly effective in learning sequential data in the form of time steps.…”
Section: Conflicts Of Interestmentioning
confidence: 99%
See 1 more Smart Citation
“…As mentioned, the dataset is a Bengali dataset. With the advancement of natural language processing, some other research has been conducted on Bengali datasets for NLP purposes [4] .…”
Section: Data Descriptionmentioning
confidence: 99%
“…Except laboratory curated ones, material sources of EmoFilm [6] , VESUS [7] and EmoSpeech [8] are film or in wild. Only a few laboratory curated datasets are available for Bangla language, such as SUBESCO [9] and BanglaSER [10] . As an example, the 7000 samples of popular Bangla SUBESCO [9] dataset are developed with only 10 speech dialogs repeatedly reading by 20 actors.…”
Section: Data Descriptionmentioning
confidence: 99%