Interspeech 2011 2011
DOI: 10.21437/interspeech.2011-801
|View full text |Cite
|
Sign up to set email alerts
|

The INTERSPEECH 2011 speaker state challenge

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
44
0
4

Year Published

2013
2013
2022
2022

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 143 publications
(51 citation statements)
references
References 14 publications
0
44
0
4
Order By: Relevance
“…To make a fair comparison with the current benchmark and future studies based on MSC-COVID-19, we use the unweighted average recall (UAR) as the main evaluation metric (e.g., to optimize the models’ hyperparameters on the dev set). UAR takes the data imbalance characteristics into account [85] , which can avoid an overly optimistic evaluation by using the weighted average recall (WAR), i.e., accuracy. Its value is defined as \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}\begin{equation*} \text {UAR}=\frac {\sum \nolimits _{i=1}^{N_{\mathrm{class}}}{\mathrm{ Recall}}_{i}}{N_{\mathrm{class}}}\tag{1}\end{equation*} \end{document} where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}${\mathrm{ Recall}}_{i}$ \end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$N_{\mathrm{class}}$ \end{document} are the Recall of the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$i$ \end{document} th class and the number of classes, respectively.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…To make a fair comparison with the current benchmark and future studies based on MSC-COVID-19, we use the unweighted average recall (UAR) as the main evaluation metric (e.g., to optimize the models’ hyperparameters on the dev set). UAR takes the data imbalance characteristics into account [85] , which can avoid an overly optimistic evaluation by using the weighted average recall (WAR), i.e., accuracy. Its value is defined as \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}\begin{equation*} \text {UAR}=\frac {\sum \nolimits _{i=1}^{N_{\mathrm{class}}}{\mathrm{ Recall}}_{i}}{N_{\mathrm{class}}}\tag{1}\end{equation*} \end{document} where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}${\mathrm{ Recall}}_{i}$ \end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$N_{\mathrm{class}}$ \end{document} are the Recall of the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$i$ \end{document} th class and the number of classes, respectively.…”
Section: Methodsmentioning
confidence: 99%
“…It is also found that COVID-19 patients should have a lack of appetite [29] , which can be detected via the eating behavior analysis while speaking [30] . Sleepiness assessment can be implemented in both a binary classification task [31] and a regression estimation task (with Karolinska sleepiness scale) [32] . Considering the high mortality risk among the elderly group (a slightly higher mortality rate in male individuals) [24] , age and gender information could be of interest to be identified by speech [33] , [34] .…”
Section: Background and Motivationmentioning
confidence: 99%
“…The OpenFace (Baltrušaitis et al, 2015(Baltrušaitis et al, , 2018 software package was used to extract 17 different Ekman and Friesen (1976) facial action units (FACs) defined by per video frame. Audeering's openS-MILE (Schuller et al, 2009;Eyben et al, 2010) toolkit was used to extract 34 different audio features, including pitch, intensity, speech rate, and MFCCs per frame. Finally, Amazon Transcribe Using the output of Amazon Transcribe, each participant's text was divided into a sequential list of tokens, where a token could be a spoken word or a period of silence.…”
Section: Feature Extractionmentioning
confidence: 99%
“…The OpenFace (Baltrušaitis et al, 2015(Baltrušaitis et al, , 2018 software package was used to extract 17 different Ekman and Friesen (1976) facial action units (FACs) defined by per video frame. Audeering's openS-MILE (Schuller et al, 2009;Eyben et al, 2010) toolkit was used to extract 34 different audio features, including pitch, intensity, speech rate, and MFCCs per frame. Finally, Amazon Transcribe Using the output of Amazon Transcribe, each participant's text was divided into a sequential list of tokens, where a token could be a spoken word or a period of silence.…”
Section: Feature Extractionmentioning
confidence: 99%