2022
DOI: 10.3390/s22072461
|View full text |Cite
|
Sign up to set email alerts
|

The Emotion Probe: On the Universality of Cross-Linguistic and Cross-Gender Speech Emotion Recognition via Machine Learning

Abstract: Machine Learning (ML) algorithms within a human–computer framework are the leading force in speech emotion recognition (SER). However, few studies explore cross-corpora aspects of SER; this work aims to explore the feasibility and characteristics of a cross-linguistic, cross-gender SER. Three ML classifiers (SVM, Naïve Bayes and MLP) are applied to acoustic features, obtained through a procedure based on Kononenko’s discretization and correlation-based feature selection. The system encompasses five emotions (d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
15
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 27 publications
(20 citation statements)
references
References 64 publications
1
15
0
Order By: Relevance
“…First of all, traditional ML algorithms could provide reliable results in the case of small-to-medium size of training data [51][52][53]. Second, SVM and MLP classifiers often demonstrate better performance than others [22,23,54]. Some experiments have shown a supercity of SVM and MLP for emotion recognition over classical Random Forest, K-NN, etc.…”
Section: Classifiersmentioning
confidence: 99%
See 2 more Smart Citations
“…First of all, traditional ML algorithms could provide reliable results in the case of small-to-medium size of training data [51][52][53]. Second, SVM and MLP classifiers often demonstrate better performance than others [22,23,54]. Some experiments have shown a supercity of SVM and MLP for emotion recognition over classical Random Forest, K-NN, etc.…”
Section: Classifiersmentioning
confidence: 99%
“…MLP is the "basic" example of NN. It is stated in [22] that MLP is the most effective speech emotion classifier, with accuracies higher than 90% for single-language approaches, followed closely by SVM. The results show that MLP outperforms SVM in overall emotion classification performance, and even though SVM training is faster compared to MLP, the ultimate accuracy of MLP is higher than that of SVM [57].…”
Section: Classifiersmentioning
confidence: 99%
See 1 more Smart Citation
“…Properly validated AI tools can reduce the possible subjectivity bias and “enrich” the scale when applied to the vocal test (even enabling daily evaluations), as different studies have already demonstrated [ 11 , 12 , 20 , 21 , 22 ]. However, the human voice can be potentially influenced by other issues ranging from environmental conditions to subject-specific characteristics [ 23 , 24 , 25 ], so that other forms of evidence are mandatory. In particular for PD, the effect of medication on speech production is still poorly addressed, with results ranging from no effects [ 26 ] to meaningful ones [ 27 ], while the differences can even depend on the specific phonemes investigated [ 23 , 28 , 29 ].…”
Section: Introductionmentioning
confidence: 99%
“…Other than being a reliable means to non-empirically quantify voice impairment in diseases that affect phonatory production, voice analysis is also a completely non-invasive, low-cost and pseudo-real-time solution for deploying telemedicine assessments. Voice-based AI solutions have been successfully experimentally investigated and employed in other medical fields such as dysphonia [ 31 , 32 , 33 ], COVID-19 and pulmonary diseases [ 20 , 22 , 34 , 35 ], and even emotion and stress recognition [ 24 , 36 ].…”
Section: Introductionmentioning
confidence: 99%