2022
DOI: 10.1145/3544777
|View full text |Cite
|
Sign up to set email alerts
|

Pivotal Tuning for Latent-based Editing of Real Images

Abstract: Recently, numerous facial editing techniques have been proposed that leverage the generative power of a pretrained StyleGAN. To successfully edit an image this way, one must first project (or invert) the image into the pretrained generator’s domain. As it turns out, StyleGAN’s latent space induces an inherent tradeoff between distortion and editability, i.e., between maintaining the original appearance and convincingly altering its attributes. Hence, it remains challenging to apply ID-preserving edits to real … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
139
0
2

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 241 publications
(141 citation statements)
references
References 41 publications
0
139
0
2
Order By: Relevance
“…Editing a real image requires finding an initial noise vector that produces the given input image when fed into the diffusion process. This process, known as inversion, has recently drawn considerable attention for GANs, e.g., [51,1,3,35,50,43,45,47], but has not yet been fully addressed for text-guided diffusion models.…”
Section: Applicationsmentioning
confidence: 99%
“…Editing a real image requires finding an initial noise vector that produces the given input image when fed into the diffusion process. This process, known as inversion, has recently drawn considerable attention for GANs, e.g., [51,1,3,35,50,43,45,47], but has not yet been fully addressed for text-guided diffusion models.…”
Section: Applicationsmentioning
confidence: 99%
“…Lin et al [137] (Mar 2022) propose a method for multi-view consistent video editing and animation based on 3D GAN inversion. They invert the video frames into the latent space of a pi-GAN by using pivotal tuning inversion (PTI) [146] and edit face attributes by using StyleFlow [97]. IDE-3D [61] adopts a hybrid GAN inversion approach.…”
Section: Conditional 3d Generative Modelsmentioning
confidence: 99%
“…Recent works [55, 19,3,45,56,67,48] have demonstrated semantic manipulation, especially for facial attributes, by analyzing the manifold and finding meaningful direction or mapping. Combining with GAN inversion [1,73,2,52,64,53,4,5], the applications of 2D GANs have been extended to real image editing. Alternatively, there have been studies [11,27,36,25] that discover and disentangle latent embeddings into interpretable dimensions during training of the generator.…”
Section: Related Workmentioning
confidence: 99%
“…despite their capability of multi-view consistency. Recently proposed EG3D [8] has shown experiments of novel view synthesis and presented outstanding results, but it requires iterative optimization for latent code and fine-tuning of the generator [53] for each target image.…”
Section: Related Workmentioning
confidence: 99%