Data-Dependent Conditional Priors for Unsupervised Learning of Multimodal Data

Lavda, Frantzeska; Gregorova, Magda; Kalousis, Alexandros

doi:10.3390/e22080888

Cited by 4 publications

(2 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, its performance in terms of the test log-likelihood and quality of generated samples is often dissatisfying, and thus many modifications were proposed. In general, one can obtain a tighter lower bound and, thus, a more powerful and flexible model, by advancing over the following three components: the encoder [1][2][3][4], the prior (or marginal over latents) [5][6][7][8][9], and the decoder [10]. Recent studies have shown that, by employing deep hierarchical architectures and by carefully designing the building blocks of the neural networks, VAEs can successfully model high-dimensional data and reach state-of-the-art test likelihoods [11][12][13].…”

Section: Introductionmentioning

confidence: 99%

Self-Supervised Variational Auto-Encoders

Gatopoulos

Tomczak

2021

Entropy

View full text Add to dashboard Cite

Density estimation, compression, and data generation are crucial tasks in artificial intelligence. Variational Auto-Encoders (VAEs) constitute a single framework to achieve these goals. Here, we present a novel class of generative models, called self-supervised Variational Auto-Encoder (selfVAE), which utilizes deterministic and discrete transformations of data. This class of models allows both conditional and unconditional sampling while simplifying the objective function. First, we use a single self-supervised transformation as a latent variable, where the transformation is either downscaling or edge detection. Next, we consider a hierarchical architecture, i.e., multiple transformations, and we show its benefits compared to the VAE. The flexibility of selfVAE in data reconstruction finds a particularly interesting use case in data compression tasks, where we can trade-off memory for better data quality and vice-versa. We present the performance of our approach on three benchmark image data (Cifar10, Imagenette64, and CelebA).

show abstract

Section: Introductionmentioning

confidence: 99%

Self-Supervised Variational Auto-Encoders

Gatopoulos

Tomczak

2021

Entropy

View full text Add to dashboard Cite

show abstract

“…However, as its performance in terms of test likelihood and quality of generated samples was far from the desired one, many modifications were proposed in order to improved its performance on high-dimensional data like natural images. In general, one can obtain a tighter lower bound, and, thus, a more powerful and flexible model, by advancing over the following three elements: the encoder (Rezende et al, 2014;van den Berg et al, 2018;Hoogeboom et al, 2020;Maaløe et al, 2016), the prior (or marginal over latents) (Chen et al, 2016;Habibian et al, 2019;Lavda et al, 2020;Lin & Clark, 2020;Tomczak & Welling, 2017) and the decoder (Gulrajani et al, 2016). Nevertheless, recent studies have shown that by employing deep hierarchical architectures and by carefully designed the building blocks of the neural networks, VAEs can successful model large high-dimensional data and reach state-of-the-art test likelihoods (Zhao et al, 2017;Maaløe et al, 2019;Vahdat & Kautz, 2020).…”

Section: Introductionmentioning

confidence: 99%

Self-Supervised Variational Auto-Encoders

Gatopoulos,

Tomczak

2020

Preprint

View full text Add to dashboard Cite

Density estimation, compression and data generation are crucial tasks in artificial intelligence. Variational Auto-Encoders (VAEs) constitute a single framework to achieve these goals. Here, we present a novel class of generative models, called self-supervised Variational Auto-Encoder (selfVAE), that utilizes deterministic and discrete variational posteriors. This class of models allows to perform both conditional and unconditional sampling, while simplifying the objective function. First, we use a single self-supervised transformation as a latent variable, where a transformation is either downscaling or edge detection. Next, we consider a hierarchical architecture, i.e., multiple transformations, and we show its benefits compared to the VAE. The flexibility of selfVAE in data reconstruction finds a particularly interesting use case in data compression tasks, where we can trade-off memory for better data quality, and vice-versa. We present performance of our approach on three benchmark image data (Cifar10, Imagenette64, and CelebA).

show abstract

Latent Variable Models

Tomczak

2021

Deep Generative Modeling

View full text Add to dashboard Cite

Data-Dependent Conditional Priors for Unsupervised Learning of Multimodal Data

Cited by 4 publications

References 10 publications

Self-Supervised Variational Auto-Encoders

Self-Supervised Variational Auto-Encoders

Self-Supervised Variational Auto-Encoders

Latent Variable Models

Contact Info

Product

Resources

About