Characterizing and Avoiding Problematic Global Optima of Variational Autoencoders

Yacoby, Yaniv; Pan, Weiwei; Doshi‐Velez, Finale

doi:10.48550/arxiv.2003.07756

Cited by 1 publication

(2 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In contrast, there has been little work to characterize pathologies at the global optima of the MFG-VAEs training objective. [52] shows that, when the decoder's capacity is restricted, posterior collapse and the mismatch between aggregated posterior and prior can occur as global optima of the training objective. In contrast to existing work, we focus on global optima of the MFG-VAE objective in fully general settings: with fully flexible generative and inference models, as well as with and without learned observation noise.…”

Section: Related Workmentioning

confidence: 99%

“…[33,47]). Recent work [52] attributes a number of these pathologies to properties of the training objective; in particular, the objective may compromise learning a good generative model in order to learn a good inference model -in other words, the inference model over-regularizes the generative model. While this pathology has been noted in literature [4,53,6], no prior work has characterizes the conditions under which the MFG-VAE objective compromises learning a good generative model in order to learn a good inference model; more worrisomely, no prior work has related MFG-VAE pathologies with the performance of MFG-VAEs on downstream tasks.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Failure Modes of Variational Autoencoders and Their Effects on Downstream Tasks

Yacoby¹,

Pan²,

Doshi‐Velez³

2020

Preprint

Self Cite

View full text Add to dashboard Cite

Variational Auto-encoders (VAEs) are deep generative latent variable models that are widely used for a number of downstream tasks. While it has been demonstrated that VAE training can suffer from a number of pathologies, existing literature lacks characterizations of exactly when these pathologies occur and how they impact down-stream task performance. In this paper we concretely characterize conditions under which VAE training exhibits pathologies and connect these failure modes to undesirable effects on specific downstream tasks -learning compressed and disentangled representations, adversarial robustness and semi-supervised learning.Preprint. Under review.

show abstract

Section: Related Workmentioning

confidence: 99%