2011
DOI: 10.1007/978-3-642-24571-8_53
|View full text |Cite
|
Sign up to set email alerts
|

AVEC 2011–The First International Audio/Visual Emotion Challenge

Abstract: Abstract. The Audio/Visual Emotion Challenge and Workshop (AVEC 2011) is the first competition event aimed at comparison of multimedia processing and machine learning methods for automatic audio, visual and audiovisual emotion analysis, with all participants competing under strictly the same conditions. This paper first describes the challenge participation conditions. Next follows the data used -the SEMAINE corpus -and its partitioning into train, development, and test partitions for the challenge with label… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
258
0

Year Published

2011
2011
2015
2015

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 208 publications
(260 citation statements)
references
References 13 publications
2
258
0
Order By: Relevance
“…The problem with this approach is that the feature space will end up extremely large (5900 dimensions of visual and 1941 of audio features in the case of Schuller et al [21]). This high dimensionality issue can be partially solved by performing dimensionality reduction or feature selection.…”
Section: Approachmentioning
confidence: 99%
See 2 more Smart Citations
“…The problem with this approach is that the feature space will end up extremely large (5900 dimensions of visual and 1941 of audio features in the case of Schuller et al [21]). This high dimensionality issue can be partially solved by performing dimensionality reduction or feature selection.…”
Section: Approachmentioning
confidence: 99%
“…For all our experiments we used the dataset provided by Schuller et al [21]. The dataset consist of 95 video and audio recorded dyadic interaction sessions between human participants and a virtual agent operated by a human.…”
Section: Datasetmentioning
confidence: 99%
See 1 more Smart Citation
“…For the audio dataset, each uttered word is described by a vector of 1941 features and a set of four labels representing the level of activation, valence, expectation and power. The detailed description of the features can be found in [13]. In order to reduce the dimension of the feature vector, PCA was used and only 100 principle components were selected in the following experiments as they covered most of the variance.…”
Section: Dataset and Featuresmentioning
confidence: 99%
“…Naturalistic expressions, differently from acted ones, change slowly as a person interacts with the environment. The AVEC challenge [13] provides a unique dataset of naturalistic audio and facial expressions to help address this issue. These data have been recorded at a high sampling rate making it possible to capture and analyze the slow transition between affective expressions.…”
Section: Introductionmentioning
confidence: 99%