Salil Deena scite author profile

Salil Deena

3Publications

30Citation Statements Received

60Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Manchester

Publications

Order By: Most citations

Speech-Driven Facial Animation Using a Shared Gaussian Process Latent Variable Model

Deena

Galata

2009

View full text Add to dashboard Cite

Abstract. In this work, synthesis of facial animation is done by modelling the mapping between facial motion and speech using the shared Gaussian process latent variable model. Both data are processed separately and subsequently coupled together to yield a shared latent space. This method allows coarticulation to be modelled by having a dynamical model on the latent space. Synthesis of novel animation is done by first obtaining intermediate latent points from the audio data and then using a Gaussian Process mapping to predict the corresponding visual data. Statistical evaluation of generated visual features against ground truth data compares favourably with known methods of speech animation. The generated videos are found to show proper synchronisation with audio and exhibit correct facial dynamics.

show abstract

Visual speech synthesis by modelling coarticulation dynamics using a non-parametric switching state-space model

Deena

Hou

Galata

2010

View full text Add to dashboard Cite

We present a novel approach to speech-driven facial animation using a non-parametric switching state space model based on Gaussian processes. The model is an extension of the shared Gaussian process dynamical model, augmented with switching states. Audio and visual data from a talking head corpus are jointly modelled using the proposed method. The switching states are found using variable length Markov models trained on labelled phonetic data. We also propose a synthesis technique that takes into account both previous and future phonetic context, thus accounting for coarticulatory effects in speech.

show abstract

Visual Speech Synthesis Using a Variable-Order Switching Shared Gaussian Process Dynamical Model

Deena

Hou

Galata

2013

IEEE Trans. Multimedia

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Salil Deena

Speech-Driven Facial Animation Using a Shared Gaussian Process Latent Variable Model

Visual speech synthesis by modelling coarticulation dynamics using a non-parametric switching state-space model

Visual Speech Synthesis Using a Variable-Order Switching Shared Gaussian Process Dynamical Model

Contact Info

Product

Resources

About