HeadGAN: One-shot Neural Head Synthesis and Editing

Doukas, Michail Christos; Zafeiriou, Stefanos; Sharmanska, Viktoriia

doi:10.1109/iccv48922.2021.01413

Cited by 47 publications

(18 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…X2Face [22] 256 Bi-layer [68] 256 FOMM [40] 256 HeadGAN [15] 256 face-Vid2Vid [58] 512 HDTF [72] 512 PC-AVS [74] 224 wav2lip [37] 96 PIRenderer [38] 256 Ours 1024 called W space, which is constructed by mapping a normal distribution to a new distribution via a multi-layer perceptron (MLP).…”

Section: Feature Resolution Video Audio Intuitive Attribute Driven Dr...mentioning

confidence: 99%

“…NS-PVD [30] extends DVP by a novel target-style preserving recurrent GAN. Recent 3D model-based methods [15,17,18,38] can also do a good job for subject-agnostic face synthesis. HeadGAN [15] pre-processes the 3d mash as input of the network.…”

Section: Feature Resolution Video Audio Intuitive Attribute Driven Dr...mentioning

confidence: 99%

“…Recent 3D model-based methods [15,17,18,38] can also do a good job for subject-agnostic face synthesis. HeadGAN [15] pre-processes the 3d mash as input of the network. PIRenderer [38] predicts a flow field for feature warping.…”

Section: Feature Resolution Video Audio Intuitive Attribute Driven Dr...mentioning

confidence: 99%

“…To achieve accurate and intuitive motion control, semantic medium plays an important role in the generation process. Following previous works [15,38], we take advantage of the 3DMM [7] parameters for motion modeling. In 3DMM, the 3D shape 𝑺 of a face can be decoupled as:…”

Section: Video-driven Motion Generatormentioning

confidence: 99%

“…There are many attempts to drive a static portrait with a video or audio from different perspectives in recent literature. A set of methods [15,38,65,72] take the advantage of 3D Morphable Models (3DMMs), a parametric model that decomposes expression, pose, and identity, to transfer facial motions. For the audio-driven case, the audio features are always projected to the parameter space of 3DMM [65,67,70].…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations