AVEC 2011–The First International Audio/Visual Emotion Challenge

Schuller, Björn; Valstar, Michel; Eyben, Florian; McKeown, Gary; Cowie, Roddy; Pantić, Maja

doi:10.1007/978-3-642-24571-8_53

Cited by 208 publications

(260 citation statements)

References 13 publications

Supporting

Mentioning

258

Contrasting

Order By: Relevance

“…The problem with this approach is that the feature space will end up extremely large (5900 dimensions of visual and 1941 of audio features in the case of Schuller et al [21]). This high dimensionality issue can be partially solved by performing dimensionality reduction or feature selection.…”

Section: Approachmentioning

confidence: 99%

“…For all our experiments we used the dataset provided by Schuller et al [21]. The dataset consist of 95 video and audio recorded dyadic interaction sessions between human participants and a virtual agent operated by a human.…”

Section: Datasetmentioning

confidence: 99%

“…Our model is evaluated on the First International Audio/Visual Emotion Challenge (AVEC 2011) dataset. A complete description of the challenge and the dataset can be found in Shuller et al [21]. For the challenge the originally continuous dimensions were redefined as binary ones based on whether they were above or below average, this reduced a regression problem into a classification one.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Modeling Latent Discriminative Dynamic of Multi-dimensional Affective Signals

Ramirez

Baltrušaitis

Morency

2011

Affective Computing and Intelligent Interaction

View full text Add to dashboard Cite

Abstract. During face-to-face communication, people continuously exchange para-linguistic information such as their emotional state through facial expressions, posture shifts, gaze patterns and prosody. These affective signals are subtle and complex. In this paper, we propose to explicitly model the interaction between the high level perceptual features using Latent-Dynamic Conditional Random Fields. This approach has the advantage of explicitly learning the sub-structure of the affective signals as well as the extrinsic dynamic between emotional labels. We evaluate our approach on the Audio-Visual Emotion Challenge (AVEC 2011) dataset. By using visual features easily computable using off-theshelf sensing software (vertical and horizontal eye gaze, head tilt and smile intensity), we show that our approach based on LDCRF model outperforms previously published baselines for all four affective dimensions. By integrating audio features, our approach also outperforms the audio-visual baseline.

show abstract

Section: Approachmentioning

confidence: 99%

Section: Datasetmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Modeling Latent Discriminative Dynamic of Multi-dimensional Affective Signals

Ramirez

Baltrušaitis

Morency

2011

Affective Computing and Intelligent Interaction

View full text Add to dashboard Cite

show abstract

“…For the audio dataset, each uttered word is described by a vector of 1941 features and a set of four labels representing the level of activation, valence, expectation and power. The detailed description of the features can be found in [13]. In order to reduce the dimension of the feature vector, PCA was used and only 100 principle components were selected in the following experiments as they covered most of the variance.…”

Section: Dataset and Featuresmentioning

confidence: 99%

“…Naturalistic expressions, differently from acted ones, change slowly as a person interacts with the environment. The AVEC challenge [13] provides a unique dataset of naturalistic audio and facial expressions to help address this issue. These data have been recorded at a high sampling rate making it possible to capture and analyze the slow transition between affective expressions.…”

Section: Introductionmentioning

confidence: 99%

Naturalistic Affective Expression Classification by a Multi-stage Approach Based on Hidden Markov Models

Meng

Bianchi-Berthouze

2011

Affective Computing and Intelligent Interaction

View full text Add to dashboard Cite

Abstract. In naturalistic behaviour, the affective states of a person change at a rate much slower than the typical rate at which video or audio is recorded (e.g. 25fps for video). Hence, there is a high probability that consecutive recorded instants of expressions represent a same affective content. In this paper, a multi-stage automatic affective expression recognition system is proposed which uses Hidden Markov Models (HMMs) to take into account this temporal relationship and finalize the classification process. The hidden states of the HMMs are associated with the levels of affective dimensions to convert the classification problem into a best path finding problem in HMM. The system was tested on the audio data of the Audio/Visual Emotion Challenge (AVEC) datasets showing performance significantly above that of a one-stage classification system that does not take into account the temporal relationship, as well as above the baseline set provided by this Challenge. Due to the generality of the approach, this system could be applied to other types of affective modalities.

show abstract

Appendix

2013

Computational Paralinguistics

View full text Add to dashboard Cite

AVEC 2011–The First International Audio/Visual Emotion Challenge

Cited by 208 publications

References 13 publications

Modeling Latent Discriminative Dynamic of Multi-dimensional Affective Signals

Modeling Latent Discriminative Dynamic of Multi-dimensional Affective Signals

Naturalistic Affective Expression Classification by a Multi-stage Approach Based on Hidden Markov Models

Appendix

Contact Info

Product

Resources

About