The Deep Weight Prior

Atanov, Andrei; Ashukha, Arsenii; Struminsky, Kirill; Vetrov, Dmitry; Welling, Max

doi:10.48550/arxiv.1810.06943

Cited by 6 publications

(11 citation statements)

References 8 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, most modern applications of BNNs still relied on simple Gaussian priors. Although a few different priors have been proposed for BNNs, these were mostly designed for specific tasks (Atanov et al, 2018;Ghosh & Doshi-Velez, 2017;Overweg et al, 2019;Nalisnick, 2018;Cui et al, 2020;Hafner et al, 2020) or relied heavily on non-standard inference methods (Sun et al, 2019;Ma et al, 2019;Karaletsos & Bui, 2020;Pearce et al, 2020). Moreover, while many interesting distributions have been proposed as variational posteriors for BNNs (Louizos & Welling, 2017;Swiatkowski et al, 2020;Dusenberry et al, 2020;Aitchison et al, 2020), these approaches have still used Gaussian priors.…”

Section: Related Workmentioning

confidence: 99%

“…BNN priors. Finally, previous work has investigated the performance implications of neural network priors chosen without reference to the empirical distributions of SGD-trained networks (Ghosh & Doshi-Velez, 2017;Wu et al, 2018;Atanov et al, 2018;Nalisnick, 2018;Overweg et al, 2019;Farquhar et al, 2019;Cui et al, 2020;Rothfuss et al, 2020;Hafner et al, 2020;Matsubara et al, 2020;Tran et al, 2020;Garriga-Alonso & van der Wilk, 2021). While these priors might in certain circumstances offer performance improvements, they did not offer a recipe for finding potentially valuable features to incorporate into the weight priors.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Bayesian Neural Network Priors Revisited

Fortuin¹,

Garriga-Alonso²,

Ober³

et al. 2021

Preprint

View full text Add to dashboard Cite

Isotropic Gaussian priors are the de facto standard for modern Bayesian neural network inference. However, such simplistic priors are unlikely to either accurately reflect our true beliefs about the weight distributions, or to give optimal performance. We study summary statistics of neural network weights in different networks trained using SGD. We find that fully connected networks (FCNNs) display heavytailed weight distributions, while convolutional neural network (CNN) weights display strong spatial correlations. Building these observations into the respective priors leads to improved performance on a variety of image classification datasets. Moreover, we find that these priors also mitigate the cold posterior effect in FCNNs, while in CNNs we see strong improvements at all temperatures, and hence no reduction in the cold posterior effect.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Bayesian Neural Network Priors Revisited

Fortuin¹,

Garriga-Alonso²,

Ober³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Bayesian inference facilitates a general framework for incorporating specific properties or prior knowledge into machine learning techniques through selecting a prior distribution carefully. Atanov et al [387] presented a novel type of prior distributions for CNN, deep weight prior (DWP), that examined generative models to persuade a certain structure of trained convolutional filters. They devised a technique for VI with implicit priors and denoted DWP in a form of an implicit distribution.…”

Section: Other Uq Techniquesmentioning

confidence: 99%

A review of uncertainty quantification in deep learning: Techniques, applications and challenges

et al. 2021

View full text Add to dashboard Cite

“…The Bayesian brain hypothesis [4] was widely used to model sensorimotor behavior [5,6], perceptual decision making [7], object perception in visual cortex [8,9], and even cognition [10]. Bayesian inference is also a principled computational framework in deeplearning-based machine learning [11][12][13]. When the prior beliefs are taken into account, the unsupervised learning meets Bayesian inference.…”

mentioning

confidence: 99%

“…Thus incorporating prior beliefs from past experiences into unsupervised learning is a common and fundamental characteristic of the computation in the brain or artificial neural networks. However, current studies mostly focused on neural implementations of the Bayesian inference [4,14], or focused on designing scalable Bayesian learning algorithms for deep networks [12,13], making a scientific understanding of unsupervised learning with prior knowledge lag far behind its neural implementations or engineering applications.…”

mentioning

confidence: 99%

Statistical physics of unsupervised learning with prior knowledge in neural networks

Hou,

Huang

2019

Preprint

View full text Add to dashboard Cite

Integrating sensory inputs with prior beliefs from past experiences in unsupervised learning is a common and fundamental characteristic of brain or artificial neural computation. However, a quantitative role of prior knowledge in unsupervised learning remains unclear, prohibiting a scientific understanding of unsupervised learning. Here, we propose a statistical physics model of unsupervised learning with prior knowledge, revealing that the sensory inputs drive a series of continuous phase transitions related to spontaneous intrinsic-symmetry breaking. The intrinsic symmetry includes both reverse symmetry and permutation symmetry, commonly observed in most artificial neural networks. Compared to the prior-free scenario, the prior reduces more strongly the minimal data size triggering the reverse symmetry breaking transition, and moreover, the prior merges, rather than separates, permutation symmetry breaking phases. We claim that the prior can be learned from data samples, which in physics corresponds to a two-parameter Nishimori plane constraint. This work thus reveals mechanisms about the influence of the prior on unsupervised learning.

show abstract

The Deep Weight Prior

Cited by 6 publications

References 8 publications

Bayesian Neural Network Priors Revisited

Bayesian Neural Network Priors Revisited

A review of uncertainty quantification in deep learning: Techniques, applications and challenges

Statistical physics of unsupervised learning with prior knowledge in neural networks

Contact Info

Product

Resources

About