2002
DOI: 10.1016/s0303-2647(02)00077-1
|View full text |Cite
|
Sign up to set email alerts
|

Isotropic sequence order learning using a novel linear algorithm in a closed loop behavioural system

Abstract: In this article, we present an isotropic algorithm for sequence order learning. Its central goal is to learn the causal relation between two (or more) inputs in order to react to the earliest incoming signal after successful learning (like in typical classical conditioning situations). We implement this algorithm in a behaving system (a robot) thereby creating a closed loop situation where the learner's actions influence its own sensor inputs to the end of creating an autonomous agent. Autonomous behaviour imp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2003
2003
2021
2021

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 15 publications
0
4
0
Order By: Relevance
“…Rate-based models rely on the average firing rate of a neuron and not on its precise firing time (Sutton and Barto, 1981;Kosco, 1986;Klopf, 1986Klopf, , 1988Kempter et al, 2001b;Porr and Wörgötter, 2002;Kistler, 2002;Porr and Wörgötter, 2003a). As such they are at first sight unrelated to the spike-based models.…”
Section: Network Models Of Stdpmentioning
confidence: 99%
“…Rate-based models rely on the average firing rate of a neuron and not on its precise firing time (Sutton and Barto, 1981;Kosco, 1986;Klopf, 1986Klopf, , 1988Kempter et al, 2001b;Porr and Wörgötter, 2002;Kistler, 2002;Porr and Wörgötter, 2003a). As such they are at first sight unrelated to the spike-based models.…”
Section: Network Models Of Stdpmentioning
confidence: 99%
“…The learning terminates when A P fends off the disturbance at node 3 precisely and persistently. In this case, the reflex loop is no longer evoked, E remains at zero, the learner undergoes no further changes, and thus, the SaR network has successfully generated the forward model of the reflex, in a similar fashion to the model-based learning paradigms presented in the work by Porr and Wörgötter (2002, 2003.…”
Section: The Control Error (E)mentioning
confidence: 95%
“…This lead to the assumption that the brain resembles an actor / critic architecture where dopamine, as the reward prediction error, drives synaptic changes in the striatum (Humphries et al, 2006). An-other interpretation of the actor / critic architecture is a nested closed-loop platform where an inner reflex loop generates an error signal which tunes an actor in an outer loop to create anticipatory actions, in other words, the actor generates a forward model of the reflex (Porr and Wörgötter, 2002). Thus the actor / critic architecture can be used for both model and model free learning.…”
Section: Introductionmentioning
confidence: 99%
“…Characteristic waveforms of the system in response to δ-pulse inputs are shown in figure 2a. The weight change can be calculated analytically by correlating the two resonator responses of H 0 and H 1 in the Laplace space (Porr & Wörgötter 2002). Figure 2b shows how the synaptic weight ρ 1 of the conditioned input changes, assuming identical bandpass filters for two inputs which occur with a time difference of T between them.…”
Section: The Neuronal Circuitmentioning
confidence: 99%