Learning action-oriented models through active inference

Tschantz, Alexander; Seth, Anil K.; Buckley, Christopher L.

doi:10.1101/764969

Cited by 44 publications

(69 citation statements)

References 65 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In other words, it promotes agents to sample data in order to resolve uncertainty about the hidden state of the environment. This term is formally equivalent to a number of established quantities, such as (expected) Bayesian surprise, mutual information, and the expected reduction in posterior entropy [11], [31], and has been used to describe various epistemic foraging behaviors, such as saccades [32]- [35] and sentence comprehension [15]. In the current paper, we conduct experiments in fully observed environments, and as such, do not consider the state information gain term in our analysis.…”

Section: E Expected Free Energymentioning

confidence: 99%

Scaling Active Inference

Tschantz

Baltieri

Seth

et al. 2020

2020 International Joint Conference on Neural Networks (IJCNN)

Self Cite

View full text Add to dashboard Cite

In reinforcement learning (RL), agents often operate in partially observed and uncertain environments. Model-based RL suggests that this is best achieved by learning and exploiting a probabilistic model of the world. 'Active inference' is an emerging normative framework in cognitive and computational neuroscience that offers a unifying account of how biological agents achieve this. On this framework, inference, learning and action emerge from a single imperative to maximize the Bayesian evidence for a niched model of the world. However, implementations of this process have thus far been restricted to low-dimensional and idealized situations. Here, we present a working implementation of active inference that applies to highdimensional tasks, with proof-of-principle results demonstrating efficient exploration and an order of magnitude increase in sample efficiency over strong model-free baselines. Our results demonstrate the feasibility of applying active inference at scale and highlight the operational homologies between active inference and current model-based approaches to RL.

show abstract

Section: E Expected Free Energymentioning

confidence: 99%

Scaling Active Inference

Tschantz

Baltieri

Seth

et al. 2020

2020 International Joint Conference on Neural Networks (IJCNN)

Self Cite

View full text Add to dashboard Cite

show abstract

“…For instance, the position of a cup of coffee has potential consequences for vision, gustation, olfaction, and somatosensation. It may be that the data-generating process is of a form that requires some transformation of the x variables, or even that the generative model is not an accurate description of the data-generating process [ 68 ]. Regardless of whether the model is a ‘good’ model, the inferential interpretation is useful in thinking about modularity.…”

Section: Neuronal Message Passingmentioning

confidence: 99%

Modules or Mean-Fields?

Parr

Sajid

Friston

2020

Entropy

View full text Add to dashboard Cite

The segregation of neural processing into distinct streams has been interpreted by some as evidence in favour of a modular view of brain function. This implies a set of specialised ‘modules’, each of which performs a specific kind of computation in isolation of other brain systems, before sharing the result of this operation with other modules. In light of a modern understanding of stochastic non-equilibrium systems, like the brain, a simpler and more parsimonious explanation presents itself. Formulating the evolution of a non-equilibrium steady state system in terms of its density dynamics reveals that such systems appear on average to perform a gradient ascent on their steady state density. If this steady state implies a sufficiently sparse conditional independency structure, this endorses a mean-field dynamical formulation. This decomposes the density over all states in a system into the product of marginal probabilities for those states. This factorisation lends the system a modular appearance, in the sense that we can interpret the dynamics of each factor independently. However, the argument here is that it is factorisation, as opposed to modularisation, that gives rise to the functional anatomy of the brain or, indeed, any sentient system. In the following, we briefly overview mean-field theory and its applications to stochastic dynamical systems. We then unpack the consequences of this factorisation through simple numerical simulations and highlight the implications for neuronal message passing and the computational architecture of sentience.

show abstract

“…performing variational inference in computational models that have parameters corresponding to beliefs about actions, one can make specific predictions about epistemic actions such as eye movements [5]. By incorporating learning, one can also make predictions about the biases that may accrue to an agent's beliefs about the world, as it attempts to minimise expected free energy [6]. By reconstruing goals and rewards as prior expectations, these models can also make fine-grained predictions about the dynamics of reinforcement learning [7].…”

Section: Extending Free Energy Into the Futurementioning

confidence: 99%

“…By incorporating learning, one can also make predictions about the biases that may accrue to an agent's beliefs about the world, as it attempts to minimise expected free energy [6]. By reconstruing goals and rewards as prior expectations, these models can also make fine-grained predictions about the dynamics of reinforcement learning [7].…”

Section: Extending Free Energy Into the Futurementioning

confidence: 99%