Imitation as a model-free process in human reinforcement learning

Najar, Anis; Bonnet, Emmanuelle; Bahrami, Bahador; Palminteri, Stefano

doi:10.1101/797407

Cited by 3 publications

(4 citation statements)

References 38 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The computational questions covered in this survey extend beyond the boundaries of Artificial Intelligence, as similar research questions regarding the computational implementation of social learning strategies are also raised in the field of Cognitive Neuroscience [10,87,96]. Thus we think this survey can be of interest for both communities.…”

Section: Discussionmentioning

confidence: 95%

Reinforcement learning with human advice: a survey

Najar¹,

Chétouani²

2020

Preprint

Self Cite

View full text Add to dashboard Cite

In this paper, we provide an overview of the existing methods for integrating human advice into a Reinforcement Learning process. We propose a taxonomy of different types of teaching signals, and present them according to three main aspects: how they can be provided to the learning agent, how they can be integrated into the learning process, and how they can be interpreted by the agent if their meaning is not determined beforehand. Finally, we compare the benefits and limitations of using each type of teaching signals, and propose a unified view of interactive learning methods.

show abstract

Section: Discussionmentioning

confidence: 95%

Reinforcement learning with human advice: a survey

Najar¹,

Chétouani²

2020

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…The future direction would be to extend the framework to include more neural populations, representing more complications of the ToM circuits in human brain, and a history of recent events. Given the recent debates on predictive coding and value shaping hypotheses in the context of ToM and social cognition (see 42,43 for instance), it is also of great importance to test if these hypotheses can be explained by an extended ToM-based ImRL framework.…”

Section: Discussionmentioning

confidence: 99%

On computational models of theory of mind and the imitative reinforcement learning in spiking neural networks

Gorgan Mohammadi,

Ganjtabesh

2024

Sci Rep

View full text Add to dashboard Cite

Theory of Mind is referred to the ability of inferring other’s mental states, and it plays a crucial role in social cognition and learning. Biological evidences indicate that complex circuits are involved in this ability, including the mirror neuron system. The mirror neuron system influences imitation abilities and action understanding, leading to learn through observing others. To simulate this imitative learning behavior, a Theory-of-Mind-based Imitative Reinforcement Learning (ToM-based ImRL) framework is proposed. Employing the bio-inspired spiking neural networks and the mechanisms of the mirror neuron system, ToM-based ImRL is a bio-inspired computational model which enables an agent to effectively learn how to act in an interactive environment through observing an expert, inferring its goals, and imitating its behaviors. The aim of this paper is to review some computational attempts in modeling ToM and to explain the proposed ToM-based ImRL framework which is tested in the environment of River Raid game from Atari 2600 series.

show abstract

“…The future direction would be to extend the framework to include more neural populations, representing more complications of the ToM circuits in human brain, and a history of recent events. Given the recent debates on predictive coding and value shaping hypotheses in the context of ToM and social cognition (see 40 and 41 for instance), it is also of great importance to test if these hypotheses can be explained by an extended ToM-based ImRL framework.…”

Section: Discussionmentioning

confidence: 99%

“…As a result, for each group of components, we respectively have a three-dimensional tensor of shape (n, h, d), where n, h, and d respectively denote the number of features (7 for obstacles, 4 for side margins, 3 for oriented margins, and 1 for shot distances), the height of the frame, and the maximum horizontal distance from the plane/shot. All together, there would be four independent input populations: one for the obstacles of shape (7, 101, 175), one for the side margins of shape (4,25,17), one for the oriented margins of shape (3,101,41), and one for the distance of obstacles from the shot of shape (1,101,21). Note that the lowest bar in the game frame is not considered in ToM-based agent's visual access to reduce the complexity of the network's input (see Figure 5).…”

Section: Network Architecturementioning

confidence: 99%

On Computational Models of Theory of Mind and the Imitative Reinforcement Learning in Spiking Neural Networks

Ganjtabesh,

Mohammadi

2023

Preprint

View full text Add to dashboard Cite

Theory of Mind is referred to the ability of inferring other's mental states, and it plays a crucial role in social cognition and learning. Biological evidences indicate that complex circuits are involved in this ability, including the mirror neuron system. The mirror neuron system influences imitation abilities and action understanding, leading to learn through observing others. To simulate this imitative learning behavior, a Theory-of-Mind-based Imitative Reinforcement Learning (ToM-based ImRL) framework is proposed. Employing the bio-inspired spiking neural networks and the mechanisms of the mirror neuron system, ToM-based ImRL is a bio-inspired computational model which enables an agent to effectively learn how to act in an interactive environment through observing an expert, inferring its goals, and imitating its behaviors. The aim of this paper is to review some computational attempts in modeling ToM and to explain the proposed ToM-based ImRL framework which is tested in the environment of River Raid game from Atari 2600 series.

show abstract

Imitation as a model-free process in human reinforcement learning

Cited by 3 publications

References 38 publications

Reinforcement learning with human advice: a survey

Reinforcement learning with human advice: a survey

On computational models of theory of mind and the imitative reinforcement learning in spiking neural networks

On Computational Models of Theory of Mind and the Imitative Reinforcement Learning in Spiking Neural Networks

Contact Info

Product

Resources

About