On the "steerability" of generative adversarial networks

Jahanian, Ali; Chai, Lucy; Isola, Phillip

doi:10.48550/arxiv.1907.07171

Cited by 57 publications

(81 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The approaches of semantic image editing with StyleGAN can be roughly divided into two groups, i.e., supervised approaches and unsupervised approaches. The supervised approaches [3,9,13,14,19,25] introduce pretrained classifiers to find the directions that alter the output of the classifiers. For example, InterfaceGAN [1] trains linear support vector machines (SVMs) using the attribute annotations labeled by the off-the-shelf classifiers and finds hyperplanes in the latent space serving as the separation boundary.…”

Section: Related Workmentioning

confidence: 99%

“…GAN Dissection [5] found the causal feature maps are specialized to synthesize specific visual arXiv:2111.13010v1 [cs.CV] 25 Nov 2021 concepts in generated images. Some previous works [9,10,14,32] find that rich semantic information is encoded in the latent space of StyleGAN, e.g., Z or W space and various semantic manipulations can be achieved by moving the latent code along the direction in the latent space. But modifications to the latent code in the Z or W space are spatial entangled.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Attribute-specific Control Units in StyleGAN for Fine-grained Image Manipulation

Wang

Chen

et al. 2021

Proceedings of the 29th ACM International Conference on Multimedia

View full text Add to dashboard Cite

Image manipulation with StyleGAN has been an increasing concern in recent years. Recent works have achieved tremendous success in analyzing several semantic latent spaces to edit the attributes of the generated images. However, due to the limited semantic and spatial manipulation precision in these latent spaces, the existing endeavors are defeated in fine-grained StyleGAN image manipulation, i.e., local attribute translation. To address this issue, we discover attribute-specific control units, which consist of multiple channels of feature maps and modulation styles. Specifically, we collaboratively manipulate the modulation style channels and feature maps in control units rather than individual ones to obtain the semantic and spatial disentangled controls. Furthermore, we propose a simple yet effective method to detect the attribute-specific control units. We move the modulation style along a specific sparse direction vector and replace the filter-wise styles used to compute the feature maps to manipulate these control units. We evaluate our proposed method in various face attribute manipulation tasks. Extensive qualitative and quantitative results demonstrate that our proposed method performs favorably against the state-of-the-art methods. The manipulation results of real images further show the effectiveness of our method. CCS CONCEPTS• Computing methodologies → Image manipulation.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Attribute-specific Control Units in StyleGAN for Fine-grained Image Manipulation

Wang

Chen

et al. 2021

Proceedings of the 29th ACM International Conference on Multimedia

View full text Add to dashboard Cite

show abstract

“…Zhu et al [17] Unsupervised [19,14] Self-Supervised InterFaceGAN [9] Supervised GANALYZE [13] Supervised StyleSpace [16] Supervised LELSD (Ours) Supervised Image Composition Bau et al [20] Unsupervised Chai et al [21] Unsupervised Editing in Style [15] Unsupervised Zhang et al [7] Supervised Barbershop [22] Supervised not only agnostic to the GAN architecture, but also is able to effectively disentangle the semantic attributes. To this end, we propose Locally Effective Latent Space Directions (LELSD), a framework to find the latent space directions that affect local regions of the output image.…”

Section: Latent Space Traversalmentioning

confidence: 99%

“…To solve this problem, [9,13,14] use an external supervision and find latent space directions that yield the desired change in the generated images. This is done by finding the latent space direction that maximizes a designed objective function.…”

Section: Related Workmentioning

confidence: 99%

“…Several recent works control the semantics of the GANgenerated image by tweaking the latent code to perform global [9,10,11,12,13,14] or localized [7,15,16,17] editing. Although a lot of progress has been made in global image editing, it remains challenging to disentangle the semantic attributes and thus control the local semantics of the image.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Optimizing Latent Space Directions For GAN-based Local Image Editing

Pajouheshgar¹,

Zhang²,

Süsstrunk³

2021

Preprint

View full text Add to dashboard Cite

show abstract

Capture, Reconstruction, and Representation of the Visual Real World for Virtual Reality

Richardt

Tompkin

Wetzstein

2020

Lecture Notes in Computer Science

View full text Add to dashboard Cite

We provide an overview of the concerns, current practice, and limitations for capturing, reconstructing, and representing the real world visually within virtual reality. Given that our goals are to capture, transmit, and depict complex real-world phenomena to humans, these challenges cover the opto-electromechancial, computational, informational, and perceptual fields. Practically producing a system for real-world VR capture requires navigating a complex design space and pushing the state of the art in each of these areas. As such, we outline several promising directions for future work to improve the quality and flexibility of real-world VR capture systems.

show abstract

On the "steerability" of generative adversarial networks

Cited by 57 publications

References 20 publications

Attribute-specific Control Units in StyleGAN for Fine-grained Image Manipulation

Attribute-specific Control Units in StyleGAN for Fine-grained Image Manipulation

Optimizing Latent Space Directions For GAN-based Local Image Editing

Capture, Reconstruction, and Representation of the Visual Real World for Virtual Reality

Contact Info

Product

Resources

About