2018
DOI: 10.1109/access.2018.2881096
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of an Arabic Speech Corpus of Emotions: A Perceptual and Statistical Analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
16
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 28 publications
(19 citation statements)
references
References 20 publications
2
16
0
1
Order By: Relevance
“…From the results, one can see that the average accuracy is slightly better for males at 98.04% compared to females at 96.77%. This result is similar to those presented in our previous work [49]. Table 7 presents the normalized confusion matrix for English using the EPST corpus, which includes five females and three males (CC, MF, and CL).…”
Section: Resultssupporting
confidence: 87%
See 1 more Smart Citation
“…From the results, one can see that the average accuracy is slightly better for males at 98.04% compared to females at 96.77%. This result is similar to those presented in our previous work [49]. Table 7 presents the normalized confusion matrix for English using the EPST corpus, which includes five females and three males (CC, MF, and CL).…”
Section: Resultssupporting
confidence: 87%
“…The sampling rate was set to 16,000 Hz. For evaluation purposes, a blind human perceptual test was conducted in both phases [49]. In Phase 1, the total duration of recording was 2 h and 55 min.…”
Section: Selected Speech Corpora a Ksu Emotions Corpusmentioning
confidence: 99%
“…Overall raw hit rate for Phase 1 was 71%, and for Phase 2, it was 80%. If we look at the perceived hit rates for other relevant audio-only datasets including: Arabic database: 80% for 800 sentences [ 21 ], EMOVO: 80% for 588 files [ 13 ], German database: 85% for 800 sentences [ 17 ], MES-P: 86.54% for 5376 stimuli [ 23 ], Indonesian speech corpus: 62% for 1357 audios [ 24 ], Montreal affective voices: 69% for 90 stimuli [ 54 ], Portuguese dataset: 75% for 190 sentences [ 55 ], RAVDESS: 62.5% for 1440 audio-only speech [ 17 ]; these results confirm that the perceptual hit rate of SUBESCO was comparable to existing emotional speech sets. Unbiased hit rates were also reported along with raw hit rates to address false alarms.…”
Section: Discussionmentioning
confidence: 99%
“…Since the last decade, deep learning has arisen as a new attractive area of machine learning, and ever since has been examined and utilized in a range of different research topics [1]. Deep learning consists of a multiple of machine learning algorithms fed with inputs in the form of multiple layered models.…”
Section: Introductionmentioning
confidence: 99%