A 3-D Audio-Visual Corpus of Affective Communication

Fanelli, Gabriele; Gall, Jüergen; Romsdorfer, Harald; Weise, Thibaut; Gool, Luc Van

doi:10.1109/tmm.2010.2052239

Cited by 104 publications

(57 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…We further performed 5-fold, subject-independent cross validations on the B3D(AC) 2 database [24], and the BU 3DF E database rendered both in frontal pose and with random rotations added. Table 4 shows mean and standard deviation of the errors in millimeters for all the analyzed facial features.…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Random Forests for Real Time 3D Face Analysis

et al. 2012

Self Cite

View full text Add to dashboard Cite

We present a random forest-based framework for real time head pose estimation from depth images and extend it to localize a set of facial features in 3D. Our algorithm takes a voting approach, where each patch extracted from the depth image can directly cast a vote for the head pose or each of the facial features. Our system proves capable of handling large rotations, partial occlusions, and the noisy depth data acquired using commercial sensors. Moreover, the algorithm works on each frame independently and achieves real time performance without resorting to parallel computations on a GPU. We present extensive experiments on publicly available, challenging datasets and present a new annotated head pose database recorded using a Microsoft Kinect.

show abstract

Section: Methodsmentioning

confidence: 99%

“…As a first dataset, we chose B3D(AC) 2 [24]. This is a relatively naturalistic and large set of high quality, dynamic facial scans, with subjects recorded using the 3D scanner of [66] while pronouncing a set of 40 predefined sentences both in a neutral and in an induced emotional state.…”

Section: Datasetsmentioning

confidence: 99%

Random Forests for Real Time 3D Face Analysis

et al. 2012

Self Cite

View full text Add to dashboard Cite

show abstract

“…Some of our experiments use the Biwi 3D Audiovisual Corpus of Affective Communication, Fanelli et al [2010a]. This re-acted corpus comprises of depth scanned data captured using a novel method allowing for 3D data capture using relatively inexpensive equipment (a digital projector and three cameras), by employing phase shifting and stereo unwrapping (see Weise et al [2007], for a detailed description).…”

Section: Biwi 3d Audiovisual Corpus Of Affective Communicationmentioning

confidence: 99%

Expressive Modulation of Neutral Visual Speech

Shaw

Theobald

2016

IEEE MultiMedia

View full text Add to dashboard Cite

“…Experiments were conducted on the BIWI 3D Audiovisual Corpus of Affective Communication [40] comprising a total of 1109 sentences (4.67 seconds long on average) uttered by 14 native English speakers (6 males and 8 females). The dense dynamic face scans were acquired at 25 frames per second by a realtime 3D scanner and the voice signal was captured by a professional microphone at a sampling rate of 16kHz.…”

Section: Audio-visual Affective Speechmentioning

confidence: 99%

“…In the offline version, the 3D BIWI dataset [40] is used to train the GMM model between any two speakers. The training is done on 40 utterances performed in a neutral manner by both speakers.…”

Section: Audio-visual Face Retargetingmentioning

confidence: 99%

Reactive Statistical Mapping: Towards the Sketching of Performative Control with Data

d’Alessandro

Tilmanne

Astrinaki

et al. 2014

Innovative and Creative Developments in Multimodal Interaction Systems

View full text Add to dashboard Cite

Abstract. This paper presents the results of our participation to the ninth eNTERFACE workshop on multimodal user interfaces. Our target for this workshop was to bring some technologies currently used in speech recognition and synthesis to a new level, i.e. being the core of a new HMM-based mapping system. The idea of statistical mapping has been investigated, more precisely how to use Gaussian Mixture Models and Hidden Markov Models for realtime and reactive generation of new trajectories from inputted labels and for realtime regression in a continuous-to-continuous use case. As a result, we have developed several proofs of concept, including an incremental speech synthesiser, a software for exploring stylistic spaces for gait and facial motion in realtime, a reactive audiovisual laughter and a prototype demonstrating the realtime reconstruction of lower body gait motion strictly from upper body motion, with conservation of the stylistic properties. This project has been the opportunity to formalise HMM-based mapping, integrate various of these innovations into the Mage library and explore the development of a realtime gesture recognition tool.

show abstract

A 3-D Audio-Visual Corpus of Affective Communication

Cited by 104 publications

References 32 publications

Random Forests for Real Time 3D Face Analysis

Random Forests for Real Time 3D Face Analysis

Expressive Modulation of Neutral Visual Speech

Reactive Statistical Mapping: Towards the Sketching of Performative Control with Data

Contact Info

Product

Resources

About