2019
DOI: 10.1101/764969
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning action-oriented models through active inference

Abstract: Converging theories suggest that organisms learn and exploit probabilistic models of their environment. However, it remains unclear how such models can be learned in practice. The open-ended complexity of natural environments means that it is generally infeasible for organisms to model their environment comprehensively. Alternatively, action-oriented models attempt to encode a parsimonious representation of adaptive agent-environment interactions. One approach to learning action-oriented models is to l… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
69
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
8
1

Relationship

3
6

Authors

Journals

citations
Cited by 44 publications
(69 citation statements)
references
References 65 publications
0
69
0
Order By: Relevance
“…In other words, it promotes agents to sample data in order to resolve uncertainty about the hidden state of the environment. This term is formally equivalent to a number of established quantities, such as (expected) Bayesian surprise, mutual information, and the expected reduction in posterior entropy [11], [31], and has been used to describe various epistemic foraging behaviors, such as saccades [32]- [35] and sentence comprehension [15]. In the current paper, we conduct experiments in fully observed environments, and as such, do not consider the state information gain term in our analysis.…”
Section: E Expected Free Energymentioning
confidence: 99%
“…In other words, it promotes agents to sample data in order to resolve uncertainty about the hidden state of the environment. This term is formally equivalent to a number of established quantities, such as (expected) Bayesian surprise, mutual information, and the expected reduction in posterior entropy [11], [31], and has been used to describe various epistemic foraging behaviors, such as saccades [32]- [35] and sentence comprehension [15]. In the current paper, we conduct experiments in fully observed environments, and as such, do not consider the state information gain term in our analysis.…”
Section: E Expected Free Energymentioning
confidence: 99%
“…For instance, the position of a cup of coffee has potential consequences for vision, gustation, olfaction, and somatosensation. It may be that the data-generating process is of a form that requires some transformation of the x variables, or even that the generative model is not an accurate description of the data-generating process [ 68 ]. Regardless of whether the model is a ‘good’ model, the inferential interpretation is useful in thinking about modularity.…”
Section: Neuronal Message Passingmentioning
confidence: 99%
“…performing variational inference in computational models that have parameters corresponding to beliefs about actions, one can make specific predictions about epistemic actions such as eye movements [5]. By incorporating learning, one can also make predictions about the biases that may accrue to an agent's beliefs about the world, as it attempts to minimise expected free energy [6]. By reconstruing goals and rewards as prior expectations, these models can also make fine-grained predictions about the dynamics of reinforcement learning [7].…”
Section: Extending Free Energy Into the Futurementioning
confidence: 99%
“…By incorporating learning, one can also make predictions about the biases that may accrue to an agent's beliefs about the world, as it attempts to minimise expected free energy [6]. By reconstruing goals and rewards as prior expectations, these models can also make fine-grained predictions about the dynamics of reinforcement learning [7].…”
Section: Extending Free Energy Into the Futurementioning
confidence: 99%