Yotam Nitzan scite author profile

Recently, there has been a surge of diverse methods for performing image editing by employing pre-trained unconditional generators. Applying these methods on real images, however, remains a challenge, as it necessarily requires the inversion of the images into their latent space. To successfully invert a real image, one needs to find a latent code that reconstructs the input image accurately, and more importantly, allows for its meaningful manipulation. In this paper, we carefully study the latent space of StyleGAN, the state-of-the-art unconditional generator. We identify and analyze the existence of a distortion-editability tradeoff and a distortion-perception tradeoff within the StyleGAN latent space. We then suggest two principles for designing encoders in a manner that allows one to control the proximity of the inversions to regions that StyleGAN was originally trained on. We present an encoder based on our two principles that is specifically designed for facilitating editing on real images by balancing these tradeoffs. By evaluating its performance qualitatively and quantitatively on numerous challenging domains, including cars and horses, we show that our inversion method, followed by common editing techniques, achieves superior real-image editing quality, with only a small reconstruction accuracy drop.

show abstract

Designing an Encoder for StyleGAN Image Manipulation

Tov¹,

Alaluf²,

Nitzan³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

Face identity disentanglement via latent space mapping

et al. 2020

View full text Add to dashboard Cite

Learning disentangled representations of data is a fundamental problem in artificial intelligence. Specifically, disentangled latent representations allow generative models to control and compose the disentangled factors in the synthesis process. Current methods, however, require extensive supervision and training, or instead, noticeably compromise quality. In this paper, we present a method that learns how to represent data in a disentangled way, with minimal supervision, manifested solely using available pre-trained networks. Our key insight is to decouple the processes of disentanglement and synthesis, by employing a leading pre-trained unconditional image generator, such as StyleGAN. By learning to map into its latent space, we leverage both its state-of-the-art quality, and its rich and expressive latent space, without the burden of training it. We demonstrate our approach on the complex and high dimensional domain of human heads. We evaluate our method qualitatively and quantitatively, and exhibit its success with de-identification operations and with temporal identity coherency in image sequences. Through extensive experimentation, we show that our method successfully disentangles identity from other facial attributes, surpassing existing methods, even though they require more training and supervision.

show abstract

A severe citrus tristeza virus isolate causing the collapse of trees of sour orange before virus is detectable throughout the canopy§

Ben-Ze’ev

Bar‐Joseph

Nitzan³

et al. 1989

Annals of Applied Biology

View full text Add to dashboard Cite

A rapidly spreading decline of 'Minneola' tangelos, 'Shamouti' and 'Valencia' sweet oranges grafted on sour orange rootstock in the Morasha area, in the coastal plain of Israel, was found to be caused by a severe 'seedling yellows' strain of the citrus tristeza virus (CTV). Repeated ELISA tests revealed great variation in distribution of CTV throughout the canopies, even in declining trees. In a substantial number of the declining trees, samples of up to 10 twigs per tree were not always sufficient for CTV detection. The ELISA values (O.D. 405 nm) in the parts found infected were high, whereas in most of the twigs showing negative ELISA results the virus was absent as indicated by biological indexing. The Morasha strain of CTV was also characterised by rapid annual spread rates. The ratio D/E (the proportion of Declining trees found among ELISA-positive ones) is proposed as a simple index of strain severity. The epidemiological consequences of the uneven distribution of CTV and rapid decline are discussed.

show abstract

Face Identity Disentanglement via Latent Space Mapping

Nitzan¹,

Bermano²,

Li³

et al. 2020

Preprint

View full text Add to dashboard Cite

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yotam Nitzan

Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation

Designing an encoder for StyleGAN image manipulation

Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation

Designing an encoder for StyleGAN image manipulation

Designing an Encoder for StyleGAN Image Manipulation

Face identity disentanglement via latent space mapping

A severe citrus tristeza virus isolate causing the collapse of trees of sour orange before virus is detectable throughout the canopy§

Face Identity Disentanglement via Latent Space Mapping

Contact Info

Product

Resources

About