“…Instead of representing facial shapes and appearances as a linear combination of basis vectors, these models are formulated implicitly as decoders using neural networks where the 3D faces are generated directly from latent vectors. Some of these methods use fully connected layers or 2D convolutions in image space [64,4,22,63,45], while others use decoders in the mesh domain to represent local geome-tries [49,54,19,26,50,43,47]. With the help of differentiable renderers [61,27,55], several methods [64,63,43] have demonstrated high-fidelity 3D face reconstructions using non-linear morphable face models using fully unsupervised or weakly supervised learning, which is possible using massive amounts of images in the wild.…”