2020
DOI: 10.48550/arxiv.2009.01791
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Action and Perception as Divergence Minimization

Abstract: We introduce a unified objective for action and perception of intelligent agents. Extending representation learning and control, we minimize the joint divergence between the combined system of agent and environment and a target distribution. Intuitively, such agents use perception to align their beliefs with the world, and use actions to align the world with their beliefs. Minimizing the joint divergence to an expressive target maximizes the mutual information between the agent's representations and inputs, th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 106 publications
0
7
0
Order By: Relevance
“…The derivation of alternative functionals that preserve the desirable epistemic behavior of EFE optimization is an active research area [41,8]. There have been several interesting proposals such as the Free Energy of the Expected Future [22,7,9] or Generalized Free Energy [5], as well as amortization strategies [42,43]. However, the approach for a majority of the alternative functionals is to facilitate epistemics by the same mutual information term utilized by EFE while finessing the remainder of the functional.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…The derivation of alternative functionals that preserve the desirable epistemic behavior of EFE optimization is an active research area [41,8]. There have been several interesting proposals such as the Free Energy of the Expected Future [22,7,9] or Generalized Free Energy [5], as well as amortization strategies [42,43]. However, the approach for a majority of the alternative functionals is to facilitate epistemics by the same mutual information term utilized by EFE while finessing the remainder of the functional.…”
Section: Discussionmentioning
confidence: 99%
“…The AIF literature describes multiple Free Energy (FE) objectives for policy planning, e.g., the Expected FE [4], Generalized FE [5] and Predicted (Bethe) FE [6] (among others, see e.g. [7,8,9]). Traditionally, the Expected Free Energy (EFE) is evaluated for a selection of policies, and a posterior distribution over policies is constructed from the corresponding EFEs.…”
Section: Introductionmentioning
confidence: 99%
“…Such loop-closures have a further functional significance in allowing experiences to be bound together into a unified representational system where updates can be propagated in a mutually-constrained wholistic fashion, so providing a basis for the rapid and flexible construction and refinement of knowledge structures in the form of cognitive schemas that have both graph-like and map-like properties. With further experience, these schemata can then be transferred to the neocortex in the form of more stable adaptive action and thought tendencies, so forming a powerful hybrid architecture for instantiating robust causal world models (Hafner et al, 2020;Safron, 2021b).…”
Section: Latentslam a Bio-inspired Slam Algorithmmentioning
confidence: 99%
“…As a final note, we highlight alternative approaches to task specification. Building on the Free Energy Principle [13,12], Hafner et al [17] consider a variety of task types in terms of minimization of distance to a desired target distribution [3]. Alternatively, Littman et al [30] and Li et al [28] propose variations of linear temporal logic (LTL) as a mechanism for specifying a task to RL agents, with related literature extending LTL to the multi-task [58] and multi-agent [18] settings, or using reward machines for capturing task structure [19].…”
Section: Other Perspectives On Rewardmentioning
confidence: 99%