1987
DOI: 10.1109/tsmc.1987.6499323
|View full text |Cite
|
Sign up to set email alerts
|

Learning automata with changing number of actions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
70
0

Year Published

1991
1991
2019
2019

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 124 publications
(70 citation statements)
references
References 0 publications
0
70
0
Order By: Relevance
“…Finally, the probability vector of the actions of the chosen subset is rescaled as 1 ̂ 1 · , for all . The absolute expediency and ε-optimality of the method described above have been proved in [17].…”
Section: Variable Action Set Learning Automatamentioning
confidence: 99%
See 2 more Smart Citations
“…Finally, the probability vector of the actions of the chosen subset is rescaled as 1 ̂ 1 · , for all . The absolute expediency and ε-optimality of the method described above have been proved in [17].…”
Section: Variable Action Set Learning Automatamentioning
confidence: 99%
“…A learning automaton [16,17,18] is an adaptive decision-making unit that improves its performance by learning how to choose the optimal action from a finite set of allowed actions through repeated interactions with a random environment. Learning automaton has been shown to perform well in graph theory [21,23,25,26,28], networking [18,20,22,24,27,29,30,31], and some other areas.…”
Section: Learning Automatamentioning
confidence: 99%
See 1 more Smart Citation
“…A learning automaton [11,12,13] is an adaptive decision-making unit that improves its performance by learning how to choose the optimal action from a finite set of allowed actions through repeated interactions with a random environment. The action is chosen at random based on a probability distribution kept over the action-set and at each instant the given action is served as the input to the random environment.…”
Section: Learning Automata Theorymentioning
confidence: 99%
“…A learning automaton [17][18][19] is an adaptive decision-making unit that improves its performance by learning how to choose the optimal action from a finite set of allowed actions through repeated interactions with a random environment. The action is chosen at random based on a probability distribution kept over the action-set and at each instant the given action is served as the input to the random environment.…”
Section: Learning Automata Theorymentioning
confidence: 99%