2021
DOI: 10.48550/arxiv.2104.06954
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Aligning Latent and Image Spaces to Connect the Unconnectable

Abstract: Figure 1: Our method can generate infinite images of diverse and complex scenes that transition naturally from one into another. It does so without any conditioning and trains without any supervision from a dataset of unrelated square images.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 6 publications
(14 citation statements)
references
References 53 publications
0
14
0
Order By: Relevance
“…Further, observe that the face identity is well-preserved for unrelated edits and that local edits, such as those changing hairstyle and expression, do not alter unrelated image regions (e.g., expression is consistent across the "gender", "hi-top fade", and "tanned" edits). Notably, this disentanglement holds for other domains such as animal faces (AFHQv2 [13]) and landscapes (Landscapes HQ [63]). When editing animals, fur color, pose, and backgrounds are well-preserved under the various edits.…”
Section: Editing Via Non-linear Latent Pathsmentioning
confidence: 97%
See 1 more Smart Citation
“…Further, observe that the face identity is well-preserved for unrelated edits and that local edits, such as those changing hairstyle and expression, do not alter unrelated image regions (e.g., expression is consistent across the "gender", "hi-top fade", and "tanned" edits). Notably, this disentanglement holds for other domains such as animal faces (AFHQv2 [13]) and landscapes (Landscapes HQ [63]). When editing animals, fur color, pose, and backgrounds are well-preserved under the various edits.…”
Section: Editing Via Non-linear Latent Pathsmentioning
confidence: 97%
“…7. Editing in S. We edit synthetic images using the Style-CLIP [48] global directions technique using StyleGAN3 generators trained on the FFHQ [35], AFHQv2 [13,34], and Landscapes HQ [63] datasets.…”
Section: Editing Via Non-linear Latent Pathsmentioning
confidence: 99%
“…It is mostly popular for 3D reconstruction and geometry processing tasks (e.g., [35,39,43,45,50]), including video-based reconstruc- tion [33,46,51,79]. Several recent projects explored the task of building generative models over such representations to synthesize images (e.g., [4,62,63]), 3D objects (e.g., [11,32,58]) or multi-modal signals (e.g., [15,16]), and our work extends this line of research to video generation.…”
Section: Related Workmentioning
confidence: 99%
“…It is noticeable on datasets where new content appears during a video, like Sky Timelapse or Rainbow Jelly. We believe it can be resolved using ideas similar to ALIS [63].…”
Section: A Limitations and Potential Negative Impact A1 Limitationsmentioning
confidence: 99%
See 1 more Smart Citation