2021
DOI: 10.1016/j.tins.2021.07.007
|View full text |Cite
|
Sign up to set email alerts
|

Learning offline: memory replay in biological and artificial reinforcement learning

Abstract: Learning to act in an environment to maximise rewards is among the brain's key functions. This process has often been conceptualised within the framework of reinforcement learning, which has also gained prominence in machine learning and artificial intelligence (AI) as a way to optimise decision-making. A common aspect of both biological and machine reinforcement learning is the reactivation of previously experienced episodes, referred to as replay. Replay is important for memory consolidation in biological ne… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
11
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
6
2
2

Relationship

0
10

Authors

Journals

citations
Cited by 23 publications
(12 citation statements)
references
References 122 publications
(230 reference statements)
1
11
0
Order By: Relevance
“…In particular, the opportunity cost hypothesis holds that cognitive effort occurs when the reward value of the current task under execution is less than the potential reward value of an alternative task, thereby promoting a shift to the other task (Agrawal et al, 2022;Boureau et al, 2015;Inzlicht et al, 2014;Kurzban et al, 2013). Cognitive effort could also induce pauses in cognitive activity that allow new memories to be consolidated in cortex (Agrawal et al, 2022;Holroyd & Verguts, 2021), consistent with computational arguments (Roscow et al, 2021) and biological (Holroyd & Verguts, 2021) and behavioral (Gershman et al, 2014;Wamsley, 2022) evidence of this.…”
Section: Summary and Discussionmentioning
confidence: 59%
“…In particular, the opportunity cost hypothesis holds that cognitive effort occurs when the reward value of the current task under execution is less than the potential reward value of an alternative task, thereby promoting a shift to the other task (Agrawal et al, 2022;Boureau et al, 2015;Inzlicht et al, 2014;Kurzban et al, 2013). Cognitive effort could also induce pauses in cognitive activity that allow new memories to be consolidated in cortex (Agrawal et al, 2022;Holroyd & Verguts, 2021), consistent with computational arguments (Roscow et al, 2021) and biological (Holroyd & Verguts, 2021) and behavioral (Gershman et al, 2014;Wamsley, 2022) evidence of this.…”
Section: Summary and Discussionmentioning
confidence: 59%
“…Several studies show that awake replay is required for the acquisition of a task rule that requires short-term memory (Jadhav et al 2012; Igata, Ikegaya, and Sasaki 2021; Fernández-Ruiz et al 2019). By extension, several computational models that have virtual agents learn a similar foraging task have significantly increased the learning performance by incorporating replay like activity (Mattar and Daw 2018; Hayes et al 2021; Roscow et al 2021; Johnson and Redish 2005). Learning a new task rule, however, is a complex iterative behavior that requires different brain functions, such as information registration, pattern inference, revision and goal-directed modulations, to coordinate together at different time scales.…”
Section: Discussionmentioning
confidence: 99%
“…The procedure was identical to Experiment 2a, except that participants were required to rest for at least 7 minutes (enforced as countdown on the screen) following the learning stage of the superordinate task phase. The rest period was included to promote memory replay and integration (Roscow et al, 2021).…”
Section: Methodsmentioning
confidence: 99%