2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.00232
|View full text |Cite
|
Sign up to set email alerts
|

Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation

Abstract: Figure 1. New "pets" generated using ConceptLab. Each pair depicts a learned concept that was optimized to be unique and distinct from existing members of the pet category. Our method can generate a variety of novel concepts from a single broad category.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
590
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 793 publications
(592 citation statements)
references
References 48 publications
(51 reference statements)
1
590
0
1
Order By: Relevance
“…To manipulate real images, it is first necessary to invert them into their latent code representations. This is typically done via per-image optimization [63,38,8,12,1,2,28,54,48] or by training an encoder to learn a direct mapping from a given image to its corresponding latent code [63,45,62,46,47,55,5,9]. For a comprehensive survey on GAN inversion, we refer the reader to Xia et al [60].…”
Section: Real Image Editingmentioning
confidence: 99%
“…To manipulate real images, it is first necessary to invert them into their latent code representations. This is typically done via per-image optimization [63,38,8,12,1,2,28,54,48] or by training an encoder to learn a direct mapping from a given image to its corresponding latent code [63,45,62,46,47,55,5,9]. For a comprehensive survey on GAN inversion, we refer the reader to Xia et al [60].…”
Section: Real Image Editingmentioning
confidence: 99%
“…Recently, many works have been developed for the task of GAN inversion, i.e., reversing a given image back to a latent code with a pre-trained GAN model. Existing methods either optimize the latent code [1] or learn an extra encoder to project the image space back to the latent space [12,38]. Abdal et al [1] embedded images into an extended latent space of StyleGAN, allowing further semantic image editing operations.…”
Section: Related Workmentioning
confidence: 99%
“…These optimization-based methods, however, are slow and improper for real-world applications. To address this issue, Pixel2Style2Pixel (pSp) [38] embeds real images into extended latent space without additional optimization, which can be used in a wide range of imageto-image translation tasks. Menon et al [34] proposed a self-supervised approach that traverses the HR natural image manifold, searching for images that can downscale to the original LR image.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations