2023
DOI: 10.1038/s42005-023-01416-5
|View full text |Cite
|
Sign up to set email alerts
|

The autoregressive neural network architecture of the Boltzmann distribution of pairwise interacting spins systems

Indaco Biazzo

Abstract: Autoregressive Neural Networks (ARNNs) have shown exceptional results in generation tasks across image, language, and scientific domains. Despite their success, ARNN architectures often operate as black boxes without a clear connection to underlying physics or statistical models. This research derives an exact mapping of the Boltzmann distribution of binary pairwise interacting systems in autoregressive form. The parameters of the ARNN are directly related to the Hamiltonian’s couplings and external fields, an… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 42 publications
0
0
0
Order By: Relevance
“…The ARNN architectures used are typically adapted from those developed for data-driven problems in computer science, and prior knowledge on theoretical properties of the physical system has rarely been utilized to customize the ARNN architecture, or only been exploited for specific physical systems (Biał as et al 2022, Biazzo et al 2022). Recently, a direct mapping was established between the Boltzmann distribution and the corresponding ARNN architecture (Biazzo 2023). The derived general architecture was not directly usable, but thanks to the analytic derivation, two specific architectures for two well-known fully connected spin models, the Curie-Weiss model and the Sherringhton-Kirkpatrick model, were derived.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The ARNN architectures used are typically adapted from those developed for data-driven problems in computer science, and prior knowledge on theoretical properties of the physical system has rarely been utilized to customize the ARNN architecture, or only been exploited for specific physical systems (Biał as et al 2022, Biazzo et al 2022). Recently, a direct mapping was established between the Boltzmann distribution and the corresponding ARNN architecture (Biazzo 2023). The derived general architecture was not directly usable, but thanks to the analytic derivation, two specific architectures for two well-known fully connected spin models, the Curie-Weiss model and the Sherringhton-Kirkpatrick model, were derived.…”
Section: Introductionmentioning
confidence: 99%
“…In Biazzo (2023), the functions ρ i were approximated with specific architectures derived for the fully connected Curie-Weiss and Sherrington-Kirkpatrick models. In the present work, we propose a more general approach to approximate ρ i using feedforward neural networks, whose inputs are the variables ξ i , and to take advantage of the sparsity of interactions among the spins.…”
mentioning
confidence: 99%