2020
DOI: 10.48550/arxiv.2006.12226
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Hierarchical Patch VAE-GAN: Generating Diverse Videos from a Single Sample

Abstract: We consider the task of generating diverse and novel videos from a single video sample. Recently, new hierarchical patch-GAN based approaches were proposed for generating diverse images, given only a single sample at training time. Moving to videos, these approaches fail to generate diverse samples, and often collapse into generating samples similar to the training video. We introduce a novel patchbased variational autoencoder (VAE) which allows for a much greater diversity in generation. Using this tool, a ne… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
6

Relationship

2
4

Authors

Journals

citations
Cited by 7 publications
(24 citation statements)
references
References 28 publications
0
24
0
Order By: Relevance
“…To evaluate the realism of generated frames, we use the FID metric [13] over each one. For temporal consistency, we adopt the recently proposed SVFID score introduced by Gur et al [12]. SVFID is an extension of FID for a single video, evaluating how the generated samples capture the temporal statistics of a single video, by using features from a pretrained action recognition network.…”
Section: Resultsmentioning
confidence: 99%
“…To evaluate the realism of generated frames, we use the FID metric [13] over each one. For temporal consistency, we adopt the recently proposed SVFID score introduced by Gur et al [12]. SVFID is an extension of FID for a single video, evaluating how the generated samples capture the temporal statistics of a single video, by using features from a pretrained action recognition network.…”
Section: Resultsmentioning
confidence: 99%
“…Throughout the experiments, we consider the following set of baselines: SinGAN [31], ConSin-GAN [13] and HP-VAE-GAN [8]. To evaluate image generation, we use the single-image FID metric [31].…”
Section: Methodsmentioning
confidence: 99%
“…Previous methods [31,13,8] freeze each intermediate generator g i except for the current training scale, ensuring each g i to be independent 1 . In our case, we freeze the projection of all previous scales, except the current scale.…”
Section: Reconstruction Loss Functionmentioning
confidence: 99%
See 2 more Smart Citations