SUST Bangla Emotional Speech Corpus (SUBESCO): An audio-only emotional speech corpus for Bangla

Sultana, Sadia; Rahman, M. Shahidur; Selim, Mohammad Reza; Iqbal, M. Zafar

doi:10.1371/journal.pone.0250173

Cited by 36 publications

(18 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To a degree, the neutral emotional class is mistaken for the sad class. Note that, similar confusion has been seen in prior studies too [ 2 , 11 ]. Recognition rates are highest for angry followed by neutral, happy, sad, and surprise emotions.…”

Section: Experimental Design Materials and Methodssupporting

confidence: 82%

“…There are several SER datasets available in literature covering English, Greek, Korean, and German languages, such as IEMOCAP [5] , RAVDESS [6] , MSP-IMPROV [2] , AESDD [7] , SAVEE [8] , CADKES [9] , and EMO-DB [10] . However, there is only one dataset for the SER task in the Bangla language, namely SUBESCO [11] . A simple comparison of these public SER datasets with BanglaSER is shown in Table 1 .…”

Section: Data Descriptionmentioning

confidence: 99%

See 1 more Smart Citation

BanglaSER: A speech emotion recognition dataset for the Bangla language

et al. 2022

View full text Add to dashboard Cite

Section: Experimental Design Materials and Methodssupporting

confidence: 82%

Section: Data Descriptionmentioning

confidence: 99%

BanglaSER: A speech emotion recognition dataset for the Bangla language

et al. 2022

View full text Add to dashboard Cite

“…Most recently, another database, whose verification is based on perception tests and statistical analyses, came from the Bangla language. The SUST Bangla Emotional Speech Corpus (SUBESCO) [16] involves 20 actors portraying 7 emotions.…”

Section: Existing Databasesmentioning

confidence: 99%

“…However, the relationship was not significant (p = . 16). The identification accuracy-naturalness relationship is captured in Fig.…”

Section: Identification Accuracy and Naturalness Relationshipmentioning

confidence: 99%

“…In the study of speech emotion recognition, validated emotional speech databases constitute a crucial building block for developing and evaluating speech emotion recognizers [13]. To date, numerous emotional speech databases have been created in many languages, including Arabic [14], [15], Bangla [16], Mandarin Chinese [17], [18], Danish [19], English [20]- [22], German [23], [24], Italian [25], and Persian [26]. However, the majority of the databases come from highresource languages such as English, Mandarin Chinese, and German [27].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Chung-Ang Auditory Database of Korean Emotional Speech: A Validated Set of Vocal Expressions With Different Intensities

Nam

Lee

2022

IEEE Access

View full text Add to dashboard Cite

A growing body of evidence indicates that intensity plays a role in emotion perception. However, only a few databases have been explicitly designed to provide emotional stimuli that are expressed at varying intensities. We developed and validated a Korean audio-only database of emotional expressions. Eighteen actors were recorded using twenty-five sentences with strong and moderate intensities for "neutral," "happiness," "sadness," "anger," "fear," and "boredom" emotions. Twenty-five native Korean-speaking adults completed the emotion identification and naturalness rating tasks. All listeners were presented with the full set of 5400 recordings in a six-alternative forced-choice paradigm, yielding 135000 judgements for identification and naturalness, respectively. Raw and unbiased hit rates were calculated, with identification responses significantly above chance level for every emotion at both intensities. The overall raw hit rates reached 87% and 78% for the strong and moderate stimuli, respectively, indicating that strong emotional expressions were more accurately identified than their moderate counterparts. Similarly, a recognition advantage for strong intensity over moderate intensity was observed for each emotion at both intensities. High inter-and intra-rater reliabilities were found in listeners' identifying emotion categories and assigning naturalness ratings, respectively. Further, there was a strong association between identification accuracy and the degree of naturalness; more natural variants of an emotion were more accurately identified than its less natural counterparts. These results confirm that the proposed database will serve as a valuable source for emotion research.

show abstract

EmoBone: A Multinational Audio Dataset of Emotional Bone Conducted Speech

Hosain,

Sugiura,

Rahman

et al. 2024

IEEJ Transactions Elec Engng

View full text Add to dashboard Cite

This paper introduces EmoBone, a comprehensive audio‐only emotional bone‐conducted speech dataset featuring speakers from various countries. The dataset comprises speeches from 28 individuals representing 10 different nations, with each participant delivering 10 sentences designed to evoke distinct emotions. In addition to an air‐conducted microphone, the recordings utilized bone conduction technology, transmitting sound directly to the speakers' inner ears, ensuring high‐quality emotional speech recordings. To assess the validity of the dataset, 80 university students from Bangladesh listened to the recordings and successfully identified the expressed emotions with an accuracy exceeding 76%. Statistical methods were also employed to evaluate the reliability of the dataset, revealing a high level of agreement among raters. EmoBone, with a cumulative duration surpassing 19 h and 15 680 unique utterances, stands as the most extensive emotional speech dataset available. This makes it a valuable tool for studying how emotional speech varies across cultures. Furthermore, due to its utilization of bone conduction technology, EmoBone facilitates the study of acoustic features in emotional speech from diverse dimensions. The data that supports the findings of this study is available upon reasonable request. © 2024 Institute of Electrical Engineer of Japan and Wiley Periodicals LLC.

show abstract

SUST Bangla Emotional Speech Corpus (SUBESCO): An audio-only emotional speech corpus for Bangla

Cited by 36 publications

References 36 publications

BanglaSER: A speech emotion recognition dataset for the Bangla language

BanglaSER: A speech emotion recognition dataset for the Bangla language

Chung-Ang Auditory Database of Korean Emotional Speech: A Validated Set of Vocal Expressions With Different Intensities

EmoBone: A Multinational Audio Dataset of Emotional Bone Conducted Speech

Contact Info

Product

Resources

About