Proceedings of the 30th ACM International Conference on Multimedia 2022
DOI: 10.1145/3503161.3551591
|View full text |Cite
|
Sign up to set email alerts
|

The ACM Multimedia 2022 Computational Paralinguistics Challenge

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 27 publications
(14 citation statements)
references
References 42 publications
0
14
0
Order By: Relevance
“…Comparing the current models that used 3-s KSoF data with Schuller et al (2022) showed that our models performed less well. Schuller et al (2022) reported only unweighted average recall (UAR), achieving a 37.6 UAR in test using a set of one hundred principal components from a 6,373-feature set. In comparison, using a feature set of 1,136 on the KSoF intervals, the G-SVM yielded a UAR 25.47.…”
Section: Discussionmentioning
confidence: 84%
See 2 more Smart Citations
“…Comparing the current models that used 3-s KSoF data with Schuller et al (2022) showed that our models performed less well. Schuller et al (2022) reported only unweighted average recall (UAR), achieving a 37.6 UAR in test using a set of one hundred principal components from a 6,373-feature set. In comparison, using a feature set of 1,136 on the KSoF intervals, the G-SVM yielded a UAR 25.47.…”
Section: Discussionmentioning
confidence: 84%
“…This does not invalidate the conclusion that event-based approaches lead to better machine learning models since the UAR of the UCLASS event-based G-SVM (UAR = 40.66) outperformed Schuller’s reference. Rather, models can be further improved by: (a) Supplying a richer feature set as demonstrated by Schuller et al (2022) ; and (b) Using event-based segmentation methods.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Collecting a balanced dataset is difficult and expensive for the stuttering detection task. Other datasets such as Kassel State of Fluency (KSoF) (not publicly accessible) [36], Flu-encyBank [28] also suffer from this issue. Over the years, the class imbalance problem is one of the main concerns due to its prevalence, especially in the biomedical domain.…”
Section: A Class Imbalancementioning
confidence: 99%
“…To evaluate the model performance, we use the following metrics: macro F1-score and accuracy which are the standard and are widely used in the stuttered speech domain [27], [28], [31], [36], [67]. The macro F1-score (F 1 ) (which combines the advantages of both precision and recall in a single metric unlike unweighted average recall which only takes recall into account) from equation ( 3) is often used in class imbalance scenarios with the intention to give equal importance to frequent and infrequent classes, and also is more robust towards the error type distribution [68].…”
Section: Evaluation Metricsmentioning
confidence: 99%