2018
DOI: 10.3389/fncir.2018.00053
|View full text |Cite
|
Sign up to set email alerts
|

Eligibility Traces and Plasticity on Behavioral Time Scales: Experimental Support of NeoHebbian Three-Factor Learning Rules

Abstract: Most elementary behaviors such as moving the arm to grasp an object or walking into the next room to explore a museum evolve on the time scale of seconds; in contrast, neuronal action potentials occur on the time scale of a few milliseconds. Learning rules of the brain must therefore bridge the gap between these two different time scales. Modern theories of synaptic plasticity have postulated that the co-activation of pre- and postsynaptic neurons sets a flag at the synapse, called an eligibility trace, that l… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

18
333
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 258 publications
(376 citation statements)
references
References 155 publications
18
333
0
Order By: Relevance
“…In random e-prop the values of the weights B jk are randomly chosen and remain fixed, similar to broadcast alignment for feedforward networks Nøkland, 2016). Resulting synaptic plasticity rules (see Methods) look very similar to previously proposed plasticity rules (Gerstner et al, 2018). In particular they involve postsynaptic depolarization as one of the factors, similarly as the data-based rule in Clopath et al (2010), see section S6 in the supplement for an analysis.…”
Section: Mathematical Basis For E-propsupporting
confidence: 68%
See 3 more Smart Citations
“…In random e-prop the values of the weights B jk are randomly chosen and remain fixed, similar to broadcast alignment for feedforward networks Nøkland, 2016). Resulting synaptic plasticity rules (see Methods) look very similar to previously proposed plasticity rules (Gerstner et al, 2018). In particular they involve postsynaptic depolarization as one of the factors, similarly as the data-based rule in Clopath et al (2010), see section S6 in the supplement for an analysis.…”
Section: Mathematical Basis For E-propsupporting
confidence: 68%
“…In contrast to the supervised case where the learning signals depend on the deviation from an external target signal, the learning signals here are emitted when an action is taken and they express here how much this action deviates from the action mean that is currently proposed by the network. We show in Methods that reward-based e-prop yields local reward-based rules for synaptic plasticity that are in many aspects similar to ones that have previously been discussed in the literature (Gerstner et al, 2018). But those previously proposed rules estimated gradients of the policy essentially by correlating the noisy output of network neurons with rewards, which is known to be inefficient due to noisy gradient estimates.…”
Section: Reward-based E-propmentioning
confidence: 55%
See 2 more Smart Citations
“…The elongation of synapses is likely slower because the processes needed for the integration of additional AMPARs, namely the growth of presynaptic boutons, the enlargement of the PSD, and protein synthesis (Lisman, 2017), take time. This late stage of LTP can be described as "neoHebbian," as in addition to the Hebbian requirement of synchronized activation of pre-and postsynaptic neurons, a third factor marking the salience, novelty, or reward of the stimuli and signaled through neuromodulators, such as dopamine, is also involved (Gerstner, Lehmann, Liakoni, Corneil, & Brea, 2018;Lisman, 2017). Indeed, dopamine has been reported to promote dendritic protein synthesis in the CA1 hippocampal region as well as the enlargement of dendritic spines in the NAc when paired with pre-and postsynaptic spikes (Smith, Starck, Roberts, & Schuman, 2005;Yagishita et al, 2014).…”
Section: Discussionmentioning
confidence: 99%