Minimal model of permutation symmetry in unsupervised learning

Hou, Tianqi; Wong, K. Y. Michael; Huang, Haiping

doi:10.1088/1751-8121/ab3f3f

Cited by 14 publications

(11 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Now we calculate the free energy of the model for the extensive-load case α = P/N ∼ O(1). To derive a typical behavior of the model, we need to perform a disorder average of ln Z, which can be tackled by the replica method: −βf = lim n→0,N →∞ ln Z n nN (e.g., see [24,25]). In essence, n copies of the original system are introduced.…”

Section: Arxiv:210314317v1 [Cond-matdis-nn] 26 Mar 2021mentioning

confidence: 99%

Associative memory model with arbitrary Hebbian length

Jiang

Zhou²,

Hou³

et al. 2021

Phys. Rev. E

Self Cite

View full text Add to dashboard Cite

show abstract

Section: Arxiv:210314317v1 [Cond-matdis-nn] 26 Mar 2021mentioning

confidence: 99%

Associative memory model with arbitrary Hebbian length

Jiang

Zhou²,

Hou³

et al. 2021

Phys. Rev. E

Self Cite

View full text Add to dashboard Cite

show abstract

“…By asking a minimal data size to trigger learning in a two-layer neural network, namely a restricted Boltzmann machine (RBM) [15], a recent study claimed that sensory inputs (or data streams) are able to drive a series of phase transitions related to broken inherent-symmetries of the model [16]. However, this model does not assume any prior knowledge during learning, therefore the impact of priors on the learning remains unexplained.…”

mentioning

confidence: 99%

“…From a neural network perspective, the interplay between the prior and the likelihood function of data can be captured by synaptic weights. These synaptic weights are modeled by feedforward connections in a RBM [15,16,18]. More precisely, the RBM is a two-layer neural network where there do not exist intra-layer connections.…”

mentioning

confidence: 99%

“…1+q tanh(β 2 q)+|q+tanh(β 2 q)| denoting the learning threshold for the prior-free scenario [16]. In the correlation-free case (q = 0), the known threshold α c = β −4 is recovered [18,24].…”

mentioning

confidence: 99%

“…(2) q 1 = q 2 = τ 1 = τ 2 , and T 1 = T 2 = r. These two fixed points share the same free energy, representing two possible choices of ground truth-(ξ 1,true , ξ 2,true ) or (ξ 2,true , ξ 1,true ). In fact, the PSB phase has two subtypes-a PSB s phase where the permutation symmetry between ξ 1 and ξ 2 is broken on the student's side, and a PSB t phase where the PSB occurs on the teacher's side [16]. As one significant difference from the priorfree scenario [16], the self-overlap deviates from r at the turnover, thereby merging PSB s phase and PSB t phase into a single PSB phase, rather than separating these two subtypes as in the prior-free scenario (Fig.…”

mentioning

confidence: 99%

See 2 more Smart Citations

Statistical physics of unsupervised learning with prior knowledge in neural networks

Hou,

Huang

2019

Preprint

Self Cite

View full text Add to dashboard Cite

Integrating sensory inputs with prior beliefs from past experiences in unsupervised learning is a common and fundamental characteristic of brain or artificial neural computation. However, a quantitative role of prior knowledge in unsupervised learning remains unclear, prohibiting a scientific understanding of unsupervised learning. Here, we propose a statistical physics model of unsupervised learning with prior knowledge, revealing that the sensory inputs drive a series of continuous phase transitions related to spontaneous intrinsic-symmetry breaking. The intrinsic symmetry includes both reverse symmetry and permutation symmetry, commonly observed in most artificial neural networks. Compared to the prior-free scenario, the prior reduces more strongly the minimal data size triggering the reverse symmetry breaking transition, and moreover, the prior merges, rather than separates, permutation symmetry breaking phases. We claim that the prior can be learned from data samples, which in physics corresponds to a two-parameter Nishimori plane constraint. This work thus reveals mechanisms about the influence of the prior on unsupervised learning.

show abstract

Inherent-Symmetry Breaking in Unsupervised Learning

Huang

2021

Statistical Mechanics of Neural Networks

View full text Add to dashboard Cite

Minimal model of permutation symmetry in unsupervised learning

Cited by 14 publications

References 30 publications

Associative memory model with arbitrary Hebbian length

Associative memory model with arbitrary Hebbian length

Statistical physics of unsupervised learning with prior knowledge in neural networks

Inherent-Symmetry Breaking in Unsupervised Learning

Contact Info

Product

Resources

About