2021
DOI: 10.48550/arxiv.2103.01678
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Wasserstein GANs Work Because They Fail (to Approximate the Wasserstein Distance)

Abstract: Wasserstein GANs are based on the idea of minimising the Wasserstein distance between a real and a generated distribution. We provide an indepth mathematical analysis of differences between the theoretical setup and the reality of training Wasserstein GANs. In this work, we gather both theoretical and empirical evidence that the WGAN loss is not a meaningful approximation of the Wasserstein distance. Moreover, we argue that the Wasserstein distance is not even a desirable loss function for deep generative mode… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

2
14
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(16 citation statements)
references
References 6 publications
2
14
0
Order By: Relevance
“…This is in contrast to the standard Wasserstein 1 framework, where the gradients of a solution u to (1) determine only the direction of optimal mass transport, and not the speed of that transport or the amount of mass transported. This insight provides more evidence to support the hypothesis in Stanczuk et al (2021) that WGAN-GP might perform well because they fail to compute W 1 (µ, ν).…”
Section: Introductionsupporting
confidence: 57%
See 4 more Smart Citations
“…This is in contrast to the standard Wasserstein 1 framework, where the gradients of a solution u to (1) determine only the direction of optimal mass transport, and not the speed of that transport or the amount of mass transported. This insight provides more evidence to support the hypothesis in Stanczuk et al (2021) that WGAN-GP might perform well because they fail to compute W 1 (µ, ν).…”
Section: Introductionsupporting
confidence: 57%
“…This is in contrast to the gradients of Kantorovich potentials for the Wasserstein 1 distance, which only determine the normalized direction of flow. This may explain, in support of Stanczuk et al (2021), the success of WGAN-GP, since the training of the generator is based on these gradients.…”
mentioning
confidence: 81%
See 3 more Smart Citations