1995
DOI: 10.1109/21.467711
|View full text |Cite
|
Sign up to set email alerts
|

Convergence of teams and hierarchies of learning automata in connectionist systems

Abstract: Learning algorithms for feedforward connectionist systems in a reinforcement learning environment are developed and analyzed in this paper. The connectionist system is made of units of groups of learning automata. The learning algorithm used is the LR-I and the asymptotic behavior of this algorithm is approximated by an Ordinary Differential Equation (ODE) for low values of the learning parameter. This is done using weak convergence techniques. The reinforcement learning model is used to pose the goal of the s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
13
0

Year Published

1999
1999
2018
2018

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 20 publications
(13 citation statements)
references
References 10 publications
0
13
0
Order By: Relevance
“…In -model environments, the reinforcement signal lies in the interval a, b . Learning automata can be classified into two main families [11,[13][14][15][16][17][18][19]: fixed structure learning automata and variable structure learning automata. Variable structure learning automata are represented by a triple , , where is the set of inputs, is the set of actions, and is learning algorithm.…”
Section: Stochastic Graphmentioning
confidence: 99%
“…In -model environments, the reinforcement signal lies in the interval a, b . Learning automata can be classified into two main families [11,[13][14][15][16][17][18][19]: fixed structure learning automata and variable structure learning automata. Variable structure learning automata are represented by a triple , , where is the set of inputs, is the set of actions, and is learning algorithm.…”
Section: Stochastic Graphmentioning
confidence: 99%
“…A learning automaton [15][16][17][18][19][20][21] is an adaptive decision-making unit that improves its performance by learning how to choose the optimal action from a finite set of allowed actions through repeated interactions with a random environment. The action is chosen at random based on a probability distribution kept over the action-set and at each instant the given action is served as the input to the random environment.…”
Section: Learning Automatamentioning
confidence: 99%
“…The environment can be described by a triple Learning automata can be classified into two main families [15][16][17][18][19][20]: fixed structure learning automata and variable structure learning automata. Variable structure learning automata are represented by a triple…”
Section: Learning Automatamentioning
confidence: 99%
“…A learning automaton [32][33][34][35][36][37][38] is an adaptive decision-making unit that improves its performance by learning how to choose the optimal action from a finite set of allowed actions through repeated interactions with a random environment. The action is chosen at random based on a probability distribution kept over the action set and at each instant the given action is served as the input to the random environment.…”
Section: Learning Automatamentioning
confidence: 99%