Network Bending: Expressive Manipulation of Deep Generative Models

Broad, Terence; Leymarie, Frederic Fol; Grierson, Mick

doi:10.1007/978-3-030-72914-1_2

Cited by 9 publications

(14 citation statements)

References 34 publications

(64 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this paper, we have demonstrated our network bending framework in both the image and audio domains. For the image domain we have used StyleGAN2 [ 4 ], the state of the art generative model for unconditional image generation, in the audio domain we have built our own custom generative model to demonstrate how the same principles of clustering features and applying transformations to clustered features first presented in [ 1 ] can be applied directly to another domain. The generative model for audio we have presented is building on a much smaller body of research, and has more room for improvement in terms of the fidelity of the generated outputs, however it is still adequate and demonstrates that our clustering algorithm is capable of discovering semantically meaningful components of the signal ( Figure 5 ).…”

Section: Discussionmentioning

confidence: 99%

“…This process is one that could be particularly useful for music production, where an artist may want to create multiple variations of recordings they have created, that can later be layered into a music composition. An alternative use-case of this process used in the image domain is given in [ 1 ], where the chaining of multiple stochastic layers was used in the production of a series of five EP (extended play record) artworks that shared a common aesthetic theme.…”

Section: Manipulation Pipelinementioning

confidence: 99%

“…The network bending framework [ 1 ], allows for the direct and expressive manipulation of deep generative models. First demonstrated solely for generative models in the image domain, this paper presents how network bending can be used in both the image and audio domains.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Network Bending: Expressive Manipulation of Generative Models in Multiple Domains

Broad

Leymarie

Grierson

2021

Entropy

Self Cite

View full text Add to dashboard Cite

This paper presents the network bending framework, a new approach for manipulating and interacting with deep generative models. We present a comprehensive set of deterministic transformations that can be inserted as distinct layers into the computational graph of a trained generative neural network and applied during inference. In addition, we present a novel algorithm for analysing the deep generative model and clustering features based on their spatial activation maps. This allows features to be grouped together based on spatial similarity in an unsupervised fashion. This results in the meaningful manipulation of sets of features that correspond to the generation of a broad array of semantically significant features of the generated results. We outline this framework, demonstrating our results on deep generative models for both image and audio domains. We show how it allows for the direct manipulation of semantically meaningful aspects of the generative process as well as allowing for a broad range of expressive outcomes.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Manipulation Pipelinementioning

confidence: 99%

See 1 more Smart Citation

Network Bending: Expressive Manipulation of Generative Models in Multiple Domains

Broad

Leymarie

Grierson

2021

Entropy

Self Cite

View full text Add to dashboard Cite

show abstract

“…Interestingly, [29] showed that one can perform arithmetic in the latent space that affects predictable changes in image space. Since these works, a host of methods have been proposed to explore the latent structure in these generators by imposing structure at training-time [4,24] or more recently in the pre-trained generators themselves [1,12,31,32,34,36]. However, of the approaches that decompose the intermediate features directly (such as [12]), a linear decomposition is applied-where we argue a multilinear one can be more suitable in providing an ability to locate different categories of transformation.…”

Section: Related Workmentioning

confidence: 99%

Tensor Component Analysis for Interpreting the Latent Space of GANs

Oldfield¹,

Georgopoulos²,

Panagakis³

et al. 2021

Preprint

View full text Add to dashboard Cite

This paper addresses the problem of finding interpretable directions in the latent space of pre-trained Generative Adversarial Networks (GANs) to facilitate controllable image synthesis. Such interpretable directions correspond to transformations that can affect both the style and geometry of the synthetic images. However, existing approaches that utilise linear techniques to find these transformations often fail to provide an intuitive way to separate these two sources of variation. To address this, we propose to a) perform a multilinear decomposition of the tensor of intermediate representations, and b) use a tensor-based regression to map directions found using this decomposition to the latent space. Our scheme allows for both linear edits corresponding to the individual modes of the tensor, and non-linear ones that model the multiplicative interactions between them. We show experimentally that we can utilise the former to better separate style-from geometry-based transformations, and the latter to generate an extended set of possible transformations in comparison to prior works. We demonstrate our approach's efficacy both quantitatively and qualitatively compared to the current state-of-the-art.

show abstract

“…Liu et al [16] propose a GAN that involves semantic conditional information of the input by embedding facial attribute vectors in both the generator and discriminator, so that the model could be guided to output elderly face images with attributes faithful to each corresponding input. Broad et al [17] introduce network bending model that allows for the direct manipulation of semantically meaningful aspects of the generative process. In exploring the limit of how far human expressions can be captured, in this article we have train GANs using a collection of portraits of detained individuals, portraits of dead people who died of violent causes and people whose portraits were taken during an orgasm.…”

Section: Generative Adversarial Networkmentioning

confidence: 99%

GANs and Artificial Facial Expressions in Synthetic Portraits

Rosado

Fernandez

Reverter

2021

BDCC

View full text Add to dashboard Cite

Generative adversarial networks (GANs) provide powerful architectures for deep generative learning. GANs have enabled us to achieve an unprecedented degree of realism in the creation of synthetic images of human faces, landscapes, and buildings, among others. Not only image generation, but also image manipulation is possible with GANs. Generative deep learning models are inherently limited in their creative abilities because of a focus on learning for perfection. We investigated the potential of GAN’s latent spaces to encode human expressions, highlighting creative interest for suboptimal solutions rather than perfect reproductions, in pursuit of the artistic concept. We have trained Deep Convolutional GAN (DCGAN) and StyleGAN using a collection of portraits of detained persons, portraits of dead people who died of violent causes, and people whose portraits were taken during an orgasm. We present results which diverge from standard usage of GANs with the specific intention of producing portraits that may assist us in the representation and recognition of otherness in contemporary identity construction.

show abstract

Network Bending: Expressive Manipulation of Deep Generative Models

Cited by 9 publications

References 34 publications

Network Bending: Expressive Manipulation of Generative Models in Multiple Domains

Network Bending: Expressive Manipulation of Generative Models in Multiple Domains

Tensor Component Analysis for Interpreting the Latent Space of GANs

GANs and Artificial Facial Expressions in Synthetic Portraits

Contact Info

Product

Resources

About