2022
DOI: 10.1007/978-3-031-19784-0_21
|View full text |Cite
|
Sign up to set email alerts
|

Temporally Consistent Semantic Video Editing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(10 citation statements)
references
References 47 publications
0
10
0
Order By: Relevance
“…Other methods combine the optimization and encoder approaches and propose hybrid strategies by using the encoder for initialization and refining the latent code by optimization [35,95]. Recent 2D GAN inversion methods achieve faithful reconstruction with high editing capabilities and have been extended for video editing [2,75,88]. However, editing 3D-related attributes such as camera parameters and head pose remains inconsistent and prone to severe flickering as the pre-trained generator is unaware of the 3D structure.…”
Section: Gan Inversion Gan Inversion Maps a Real Image Backmentioning
confidence: 99%
“…Other methods combine the optimization and encoder approaches and propose hybrid strategies by using the encoder for initialization and refining the latent code by optimization [35,95]. Recent 2D GAN inversion methods achieve faithful reconstruction with high editing capabilities and have been extended for video editing [2,75,88]. However, editing 3D-related attributes such as camera parameters and head pose remains inconsistent and prone to severe flickering as the pre-trained generator is unaware of the 3D structure.…”
Section: Gan Inversion Gan Inversion Maps a Real Image Backmentioning
confidence: 99%
“…Generative models have demonstrated remarkable ability in synthesizing photorealistic images, including human faces [27]. Recent work has extended these models to add intuitive semantic editing, such as synthesis of glasses on faces [20,30,35,70,73]. Fader Networks [30] disentangle the salient image information, and then generate different images by varying attribute values, including glasses on faces.…”
Section: Related Workmentioning
confidence: 99%
“…Subsequent work has proposed two decoders for modeling latent representations and facial attributes [20], selective transfer units [35], and geometryaware flow [76] to further improve editing fidelity. Yao et al [73] extend facial attribute editing to video sequences via latent transformation and a identity preservation loss, which is further improved by Xu et al [70], incorporating flow-based consistency. More recent works propose 3Daware generative models to achieve view-consistent synthesis [8,10,49,55,63,67,71].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations