2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021
DOI: 10.1109/iccv48922.2021.00209
|View full text |Cite
|
Sign up to set email alerts
|

StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
439
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
2

Relationship

2
6

Authors

Journals

citations
Cited by 710 publications
(520 citation statements)
references
References 22 publications
1
439
0
1
Order By: Relevance
“…Similarly, [1] uses conditional continuous normalization flows to perform supervised attribute processing in the latent space of StyleGAN2. Recently, text-based manipulation methods have been proposed [15,21], which use CLIP [24] to perform fine-grained and disentangled manipulations of images.…”
Section: Latent Space Manipulationmentioning
confidence: 99%
“…Similarly, [1] uses conditional continuous normalization flows to perform supervised attribute processing in the latent space of StyleGAN2. Recently, text-based manipulation methods have been proposed [15,21], which use CLIP [24] to perform fine-grained and disentangled manipulations of images.…”
Section: Latent Space Manipulationmentioning
confidence: 99%
“…Shen et al [51] perform eigenvalue decomposition on the affine transformation layers of StyleGAN2 generators [28] to learn versatile manipulation directions. Xia et al [59] and Patashnik and Wu et al [44] manipulate images using a human-understandable text prompt providing a more intuitive image editing interface.…”
Section: Gan-based Image Editingmentioning
confidence: 99%
“…Instead, one may view our work as complementing these existing approaches. For example, as shall be shown, pairing StyleFusion with GANSpace [17] or StyleCLIP [44] leverages their diverse manipulations while ensuring that the resulting edits alter only the desired semantic regions. [50], Style-CLIP [44]) results in more precise image manipulations.…”
Section: Gan-based Image Editingmentioning
confidence: 99%
See 1 more Smart Citation
“…Our method consists of four steps: First, we find a linear editing direction responsible for a faulty attribute which we wish to enhance. Such directions can be found with weak supervision [32], in a zero-shot manner [24] or even in an un-supervised fashion [16,33]. Second, we build on prior observations that latent space distances are linearly correlated with semantic attribute strengths [22].…”
Section: Introductionmentioning
confidence: 99%