2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.00339
|View full text |Cite
|
Sign up to set email alerts
|

Depth-Aware Generative Adversarial Network for Talking Head Video Generation

Abstract: Predominant techniques on talking head generation largely depend on 2D information, including facial appearances and motions from input face images. Nevertheless, dense 3D facial geometry, such as pixel-wise depth, plays a critical role in constructing accurate 3D facial structures and suppressing complex background noises for generation. However, dense 3D annotations for facial videos is prohibitively costly to obtain. In this work, firstly, we present a novel self-supervised method for learning dense 3D faci… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
36
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 66 publications
(36 citation statements)
references
References 105 publications
0
36
0
Order By: Relevance
“…Talking head generation works can be broadly classified in three categories based on the type of input they use to generate a talking head: Text-driven [16,33,36], Audio-driven [9,13,18,31,37,43,45], and Video-driven [12,27,29,39,44] Talking Head Generation.…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…Talking head generation works can be broadly classified in three categories based on the type of input they use to generate a talking head: Text-driven [16,33,36], Audio-driven [9,13,18,31,37,43,45], and Video-driven [12,27,29,39,44] Talking Head Generation.…”
Section: Related Workmentioning
confidence: 99%
“…The motion field was used to calculate dense flow and warp the source frame in a latent space. Several other works [39,12] followed the same principle and added supplementary components to improve the quality. Face-vid2vid [39] used keypoint information in a 3D space, taking care of head rotation, among other things.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations