2018
DOI: 10.1101/411272
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Computational mechanisms of curiosity and goal-directed exploration

Abstract: Successful behaviour depends on the right balance between maximising reward and soliciting information about the world. Here, we show how different types of information-gain emerge when casting behaviour as surprise minimisation. We present two distinct mechanisms for goal-directed exploration that express separable profiles of active sampling to reduce uncertainty. 'Hidden state' exploration motivates agents to sample unambiguous observations to accurately infer the (hidden) state of the world. Conversely, 'm… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
70
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 47 publications
(70 citation statements)
references
References 28 publications
0
70
0
Order By: Relevance
“…the term to be maximized) as: We provide a full derivation of Eq 4 in Appendix 4 and discuss its relationship to several established formalisms. Here, we have decomposed epistemic value into state epistemic value, or salience, and parameter epistemic value, or novelty [53]. State epistemic value quantifies the degree to which the expected observations o τ reduce the uncertainty in an agent's beliefs about the hidden states s τ .…”
Section: Plos Computational Biologymentioning
confidence: 99%
“…the term to be maximized) as: We provide a full derivation of Eq 4 in Appendix 4 and discuss its relationship to several established formalisms. Here, we have decomposed epistemic value into state epistemic value, or salience, and parameter epistemic value, or novelty [53]. State epistemic value quantifies the degree to which the expected observations o τ reduce the uncertainty in an agent's beliefs about the hidden states s τ .…”
Section: Plos Computational Biologymentioning
confidence: 99%
“…However, because interpreting EFE as a single psychological quantity is difficult, the intuitive understanding of how the information seeking behavior arises from EFE is less straightforward. In addition, how the epistemic value (i.e., curiosity; Friston, Lin, et al, 2017;Schwartenbeck et al, 2019) plays a role in minimizing (information) surprise is not obvious. PRS provides an intuitive account of how curiosity arises: informative action makes the hidden state less ambiguous, thus reducing RS.…”
Section: Discussionmentioning
confidence: 99%
“…The predictions of PRS and EFE can also be tested using an experimental design in an experience-based manner, which are applicable to animals other than humans. This can be achieved by extending the previous examples to explain the active inference based on EFE Friston et al (2015); FitzGerald, Schwartenbeck, Moutoussis, Dolan, and Friston (2015); Friston, FitzGerald, et al (2017); Schwartenbeck et al (2019). For example, Friston et al (2015) applied the active inference framework to a variant of T-maze task, in which visiting the tip of the center arm provides the subject with information on which of the other left or right arm is likely to offer a reward.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…The counterfactual agent is able to interpret such signals as evidence that the hidden dynamics underwriting its sensory flows may have changed in some significant way (e.g., finding oneself confronted by oncoming traffic), and can draw on alternative possible models to evaluate which parameterisation affords the best explanation for the data at hand (cf. parameter exploration; Schwartenbeck et al 2019). If the contingent relations structuring relevant environmental properties have indeed altered (e.g., realising one is visiting a country where people drive on the opposite side of the road), the agent will need to update its model in order to capture these novel conditions (see Sales et al 2019).…”
Section: Model 3: Counterfactual Active Inferencementioning
confidence: 99%