“…Multiple works have studied inversion in the context of StyleGAN. They either directly optimize the latent vector to reproduce a specific image [1,2,6,15,33,46,56], or train an efficient encoder over large collection of images [4,5,16,19,23,29,32,40,53]. Typically, direct optimization is more accurate, but encoders are faster at inference.…”