The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2022
DOI: 10.1016/j.specom.2021.12.004
|View full text |Cite
|
Sign up to set email alerts
|

Bangladeshi Bangla speech corpus for automatic speech recognition research

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 12 publications
(10 citation statements)
references
References 4 publications
0
8
0
Order By: Relevance
“…Developed in 2016, Bangla Babel is likewise a telephone conversation-based speech corpus, comprising 215 hours of speech [18]. However, Babel has West Bengal accented speech that is distinct from the Bangladeshi Bangla accent [7]. Ahmed et al prepared 960 hours of broadcast Bangla speech corpus by transcribing speech data in an automated way with pre-trained ASR models [19].…”
Section: Related Work In Banglamentioning
confidence: 99%
See 2 more Smart Citations
“…Developed in 2016, Bangla Babel is likewise a telephone conversation-based speech corpus, comprising 215 hours of speech [18]. However, Babel has West Bengal accented speech that is distinct from the Bangladeshi Bangla accent [7]. Ahmed et al prepared 960 hours of broadcast Bangla speech corpus by transcribing speech data in an automated way with pre-trained ASR models [19].…”
Section: Related Work In Banglamentioning
confidence: 99%
“…Bangla is a morphologically rich language from the Indo-Aryan language sub-group. Kibria et al developed SUBAK.KO, an annotated speech corpus for speech recognition research comprising 241 hours of Bangladeshi Bangla speech data, to address the dearth of annotated speech datasets in Bangla [7]. SUBAK.KO contains 229 hours of clean read speech and 12 hours of broadcast speech utterances.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The output state of the hidden layer of the network is limited, so that the nodes of the hidden layer enter the sparse state, and the average output of the nodes of the hidden layer is equal to 0. In this way, the proportion of active nodes is relatively small, and the homogeneity of 3 Wireless Communications and Mobile Computing the characteristics of the nodes of the hidden layer will not occur [16]. The loss function of sparse autoencoder is shown in…”
Section: Encoder Categorymentioning
confidence: 99%
“…Almost 200 million people worldwide say Bangla as their first language which is the 4th among all over the world [11]. Bangla natural language processing (BNLP) [12] resources are very less compared to the English language but it is growing very rapidly in some field such as speech recognition [13]- [16], Bangla offensive word recognition [17], but there are more less work for Bangla music and statistics regarding the classification of musical genres have been provided in literature. Now we are aware of improvements made thus far in both this sector and Bangla music.…”
Section: Introductionmentioning
confidence: 99%