Unconstrained realtime facial performance capture

Hsieh, Pei-Lun; Ma, Chongyang; Yu, Jihun; Li, Hao

doi:10.1109/cvpr.2015.7298776

Cited by 102 publications

(68 citation statements)

References 41 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Generic blendshape models are used by some face tracking methods from monocular RGB video [Garrido et al 2013] or RGB-D video [Weise et al 2011] but need to be deformed into a static face scan or a set of scanned static expressions of an actor prior to tracking. Such generic blendshape adaptation fails to capture person-specific expression details, which is why some recent approaches estimate identity and blendshape parameters from captured face animations, and also person-specific correctives on top of this generic face model [Bouaziz et al 2013;Li et al 2013;Hsieh et al 2015]. However, all these approaches require RGB-D camera input.…”

Section: Related Workmentioning

confidence: 94%

Reconstruction of Personalized 3D Face Rigs from Monocular Video

et al. 2016

View full text Add to dashboard Cite

We present a novel approach for the automatic creation of a personalized high-quality 3D face rig of an actor from just monocular video data (e.g., vintage movies). Our rig is based on three distinct layers that allow us to model the actor's facial shape as well as capture his person-specific expression characteristics at high fidelity, ranging from coarse-scale geometry to finescale static and transient detail on the scale of folds and wrinkles. At the heart of our approach is a parametric shape prior that encodes the plausible subspace of facial identity and expression variations. Based on this prior, a coarse-scale reconstruction is obtained by means of a novel variational fitting approach. We represent person-specific idiosyncrasies, which cannot be represented in the restricted shape and expression space, by learning a set of medium-scale corrective shapes. Fine-scale skin detail, such as wrinkles, are captured from video via shading-based refinement, and a generative detail formation model is learned. Both the medium-and fine-scale detail layers are coupled with the parametric prior by means of a novel sparse linear regression formulation. Once reconstructed, all layers of the face rig can be conveniently controlled by a low number of blendshape expression parameters, as widely used by animation artists. We show captured face rigs and their motions for several actors filmed in different monocular video formats, including legacy footage from YouTube, and demonstrate how they can be used for 3D animation and 2D video editing. Finally, we evaluate our approach qualitatively and quantitatively and compare to related state-of-the-art methods.

show abstract

Section: Related Workmentioning

confidence: 94%

Reconstruction of Personalized 3D Face Rigs from Monocular Video

et al. 2016

View full text Add to dashboard Cite

show abstract

“…They use the depth and color information of an RGB-D camera to robustly segment the face region. Image taken from [HMYL15]. Figure 8: An RGB face tracker that is robust to occlusion has been proposed by [SLL16].…”

Section: Handling Occlusionsmentioning

confidence: 99%

“…In general, they are based on a segmentation mask that disables the data fitting terms in occluded regions. Hsieh et al [HMYL15] use depth and color information of an RGB-D camera to robustly segment the visible face region (see Fig. 7).…”

Section: Handling Occlusionsmentioning

confidence: 99%

State of the Art on Monocular 3D Face Reconstruction, Tracking, and Applications

Zollhöfer

Thies

Garrido

et al. 2018

Computer Graphics Forum

280

152

View full text Add to dashboard Cite

The computer graphics and vision communities have dedicated long standing efforts in building computerized tools for reconstructing, tracking, and analyzing human faces based on visual input. Over the past years rapid progress has been made, which led to novel and powerful algorithms that obtain impressive results even in the very challenging case of reconstruction from a single RGB or RGB‐D camera. The range of applications is vast and steadily growing as these technologies are further improving in speed, accuracy, and ease of use. Motivated by this rapid progress, this state‐of‐the‐art report summarizes recent trends in monocular facial performance capture and discusses its applications, which range from performance‐based animation to real‐time facial reenactment. We focus our discussion on methods where the central task is to recover and track a three dimensional model of the human face using optimization‐based reconstruction algorithms. We provide an in‐depth overview of the underlying concepts of real‐world image formation, and we discuss common assumptions and simplifications that make these algorithms practical. In addition, we extensively cover the priors that are used to better constrain the under‐constrained monocular reconstruction problem, and discuss the optimization techniques that are employed to recover dense, photo‐geometric 3D face models from monocular 2D data. Finally, we discuss a variety of use cases for the reviewed algorithms in the context of motion capture, facial animation, as well as image and video editing.

show abstract

“…An adaptive scheme was proposed to capture more detail with point-to-point deformation on top of blendshapes in [22]. To explicitly deal with outliers caused by occlusions, a method was proposed to segment the face and complete the occluded parts based on the blendshape in [19], which was later extended to RGB input in [28]. Binocular stereo system, on the other hand, can provide higher resolution and work in outdoor environments directly under sunlight, but are more prone to suffer from lighting variation.…”

Section: Expression Clusteringmentioning

confidence: 99%

Realtime Dynamic 3D Facial Reconstruction for Monocular Video In-the-Wild

Liu

Wang

Yang

et al. 2017

2017 IEEE International Conference on Computer Vision Workshops (ICCVW)

View full text Add to dashboard Cite

With the increasing amount of videos recorded using 2D mobile cameras, the technique for recovering the 3D dynamic facial models from these monocular videos has become a necessity for many image and video editing applications. While methods based parametric 3D facial models can reconstruct the 3D shape in dynamic environment, large structural changes are ignored. Structure-frommotion methods can reconstruct these changes but assume the object to be static. To address this problem we present a novel method for realtime dynamic 3D facial tracking and reconstruction from videos captured in uncontrolled environments. Our method can track the deforming facial geometry and reconstruct external objects that protrude from the face such as glasses and hair. It also allows users to move around, perform facial expressions freely without degrading the reconstruction quality.

show abstract

Unconstrained realtime facial performance capture

Cited by 102 publications

References 41 publications

Reconstruction of Personalized 3D Face Rigs from Monocular Video

Reconstruction of Personalized 3D Face Rigs from Monocular Video

State of the Art on Monocular 3D Face Reconstruction, Tracking, and Applications

Realtime Dynamic 3D Facial Reconstruction for Monocular Video In-the-Wild

Contact Info

Product

Resources

About