Recovering facial reflectance and geometry from multi-view images

Song, Guoxian; Zheng, Jianmin; Cai, Jianfei; Cham, Tat‐Jen

doi:10.1016/j.imavis.2020.103897

Cited by 6 publications

(2 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Human face is a very important communication channel for human beings, and it is the carrier of human's complex expressions and language such as happiness, anger, sorrow, and happiness. However, three-dimensional (3D) modelling is a basic problem in the field of computer vision and computer graphics [1,2]. Realistic 3D face modelling has been a research hotspot in the field of computer vision and computer graphics for nearly 30 years, and it has a wide range of applications in film and television animation, human-computer interaction, video games, and communications [3,4].…”

Section: Introductionmentioning

confidence: 99%

Fast Modelling Algorithm for Realistic Three-Dimensional Human Face for Film and Television Animation

2021

Complexity

View full text Add to dashboard Cite

Aiming at the face photos of film and television animation, this paper proposes a new fast three-dimensional (3D) face modelling algorithm. First of all, based on the LBF algorithm, this paper proposes a multifeature selection idea to automatically detect multiple features of the face. Secondly, in order to solve the shortcomings of slow training speed while achieving large pose face alignment, the regression-based CNN is selected as the algorithm to achieve fast convergence. Then, due to the influence of various factors, the extracted feature points are not completely correct, and Gabor features are used to screen the matching of feature points. Finally, by analysing the principle of 3DMM 3D face reconstruction, a single-view 3D face reconstruction method based on CNN is proposed. The experimental results show that the algorithm in this paper has good reconstruction performance and real-time performance and can realize the rapid modelling of human face.

show abstract

Section: Introductionmentioning

confidence: 99%

Fast Modelling Algorithm for Realistic Three-Dimensional Human Face for Film and Television Animation

2021

Complexity

View full text Add to dashboard Cite

show abstract

“…For the intrinsic decomposition task, self-supervision is obtained by dense correspondence between pixels across multiple views [54], training on a sequence of multi-lit images or video streams [25,42], model-based shape reconstruction [46], or through reconstruction loss (imposing consistency between the original images and the re-rendered one from the estimated intrinsic components) while training on a mix of labeled and unlabeled datasets [37,36,16]. Here, we introduce a new self-supervised loss term that 1) reduces the need for pseudo-labels and multi-stage training [37], 2) does not require a sequence of images as input during training [25,42], 3) does not rely on strong priors posed in [54,25,46] for training in limited supervision scenarios (no labels on albedos and normals exist) where the intrinsic decomposition from single image is highly ambiguous. Furthermore, compared to [28] proposing an unsupervised intrinsic decomposition technique given multi-lit images at training, we further disentangle the lighting component from the normals, thus facilitating relighting and light transfer between a source and a target image pair.…”

Section: Related Workmentioning

confidence: 99%

Joint Learning of Portrait Intrinsic Decomposition and Relighting

Zehni,

Ghosh,

Sridhar

et al. 2021

Preprint

View full text Add to dashboard Cite

Inverse rendering is the problem of decomposing an image into its intrinsic components, i.e. albedo, normal and lighting. To solve this ill-posed problem from single image, state-of-the-art methods in shape from shading mostly resort to supervised training on all the components on either synthetic or real datasets. Here, we propose a new self-supervised training paradigm that 1) reduces the need for full supervision on the decomposition task and 2) takes into account the relighting task. We introduce new selfsupervised loss terms that leverage the consistencies between multi-lit images (images of the same scene under different illuminations). Our approach is applicable to multilit datasets. We apply our training approach in two settings: 1) train on a mixture of synthetic and real data, 2) train on real datasets with limited supervision. We showcase the effectiveness of our training paradigm on both intrinsic decomposition and relighting and demonstrate how the model struggles in both tasks without the self-supervised loss terms in limited supervision settings. We provide results of comprehensive experiments on SfSNet, CelebA and Photoface datasets and verify the performance of our approach on images in the wild.

show abstract