1996
DOI: 10.1109/3477.517033
|View full text |Cite
|
Sign up to set email alerts
|

Finite time analysis of the pursuit algorithm for learning automata

Abstract: The problem of analyzing the finite time behavior of learning automata is considered. This problem involves the finite time analysis of the learning algorithm used by the learning automaton and is important in determining the rate of convergence of the automaton. In this paper, a general framework for analyzing the finite time behavior of the automaton learning algorithms is proposed. Using this framework, the finite time analysis of the Pursuit Algorithm is presented. We have considered both continuous and di… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
52
0

Year Published

2005
2005
2018
2018

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 40 publications
(52 citation statements)
references
References 11 publications
0
52
0
Order By: Relevance
“…In other words, the proofs of these results are already found in the literature, namely in [26] and in [23] respectively. The proofs are thus not repeated here.…”
Section: 1mentioning
confidence: 56%
See 1 more Smart Citation
“…In other words, the proofs of these results are already found in the literature, namely in [26] and in [23] respectively. The proofs are thus not repeated here.…”
Section: 1mentioning
confidence: 56%
“…The proofs for the convergence of PAs have been studied and reported for decades in [6], [23], [24], [25] [26], which, unfortunately, all have a common flaw that has been recently discovered by the authors of [27]. Further, the authors of [27] submitted a new proof for the convergence of the CPA which adequately rectified the flawed proofs.…”
Section: Proofs Of Pasmentioning
confidence: 99%
“…The formal "corrected" proof for the finite time analysis of the DPA [2] remains open. It is currently being investigated.…”
Section: Discussionmentioning
confidence: 99%
“…Hence the dilemma! Prior Proofs: The ε-optimality of the families of PAs have been presented in [2], [13], [10], [11], and [12]. The basic result stated in these papers is that by utilizing a sufficiently small value for the learning parameter (or resolution), both the CPA and the DPA will converge to the optimal action with an arbitrarily large probability.…”
Section: Proof Complexity For Easmentioning
confidence: 99%
See 1 more Smart Citation