2013
DOI: 10.1007/978-3-642-39802-5_17
|View full text |Cite
|
Sign up to set email alerts
|

Learning Epistemic Actions in Model-Free Memory-Free Reinforcement Learning: Experiments with a Neuro-robotic Model

Abstract: Passive sensory processing is often insufficient to guide biological organisms in complex environments. Rather, behaviourally relevant information can be accessed by performing so-called epistemic actions that explicitly aim at unveiling hidden information. However, it is still unclear how an autonomous agent can learn epistemic actions and how it can use them adaptively. In this work, we propose a definition of epistemic actions for POMDPs that derive from their characterizations in cognitive science and clas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
4
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
1
1

Relationship

3
3

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 24 publications
0
4
0
Order By: Relevance
“…Finally, the vulnerability we have described can be seen as ideally contiguous with those associated with state identification errors [9,87,88,89,90]. Under conditions of the environment in which information about the states is either incomplete or inaccessible, the resulting interaction between state identification and value estimation can cause the creation of fictitious internal states, where addictive behaviours would always be considered as highly rewarding [9].…”
Section: Discussionmentioning
confidence: 99%
“…Finally, the vulnerability we have described can be seen as ideally contiguous with those associated with state identification errors [9,87,88,89,90]. Under conditions of the environment in which information about the states is either incomplete or inaccessible, the resulting interaction between state identification and value estimation can cause the creation of fictitious internal states, where addictive behaviours would always be considered as highly rewarding [9].…”
Section: Discussionmentioning
confidence: 99%
“…Thus, the work presented in [80] investigated the role of a bottom-up attention component for the adaptation of a top-down component (see also [68]), the one of [77] focused on the advantages of the actionbased memory component for visual search, the one of [81] investigated how an active-vision system can re-adapt to changing tasks, and the one of [78] focussed on the effects of using manipulation actions to gather information from the world (i.e., as epistemic actions [56]). The novelty of this paper with respect to these works is as follows: (a) the clear identification of the four problems illustrated above, stemming from the active vision solution, and their systematic use to analyse the system behaviour and learning processes; (b) the clear identification of the four architectural principles and the systematic study of how they contribute to solve the four problems; (c) a deeper understanding, supported by new data analyses, of how the interplay of the four principles leads to solve the four problems and also generates some interesting emergent properties of the system (for example the developmental history of the system).…”
Section: ) a Bottom-up Attention Component Guiding Attention Onmentioning
confidence: 99%
“…Terekhov et al [35] presents the design and performance of a block-modular neural network architecture allowing parts of the existing network to be re-used to solve novel tasks while retaining performance on the original task. Ognibene et al [36] presents theoretical insights into the unveiling of hidden information through epistemic actions and the experimental benefits of using this actively-gathered information in order to efficiently accomplish a seekand-reach task.…”
Section: Research Backgroundsmentioning
confidence: 99%