1994
DOI: 10.1016/0016-0032(94)90039-6
|View full text |Cite
|
Sign up to set email alerts
|

Continuous action set learning automata for stochastic optimization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
45
0
1

Year Published

2003
2003
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 69 publications
(46 citation statements)
references
References 9 publications
0
45
0
1
Order By: Relevance
“…and its objective is find an action α that minimizes M (α). A CALA, whose action probability distribution at instant n is Gaussian with mean µ n and standard deviation σ n , is introduced in 33 . At instance n, the action chosen by CALA can be represented by a pair (α n , µ n ), where α n is chosen from distribution N (µ n , σ n ) and µ n is the mean of the Gaussian distribution.…”
Section: Stochastic Learning Automatamentioning
confidence: 99%
See 2 more Smart Citations
“…and its objective is find an action α that minimizes M (α). A CALA, whose action probability distribution at instant n is Gaussian with mean µ n and standard deviation σ n , is introduced in 33 . At instance n, the action chosen by CALA can be represented by a pair (α n , µ n ), where α n is chosen from distribution N (µ n , σ n ) and µ n is the mean of the Gaussian distribution.…”
Section: Stochastic Learning Automatamentioning
confidence: 99%
“…For this algorithm it is shown that with arbitrary large probability, µ n will converge close to α * and σ n will converge close to σ L , if we choose µ and σ L sufficiently small and K sufficiently large 33 . Beigy and Meybodi proposed a CALA whose action probability distribution at instant n is Gaussian with mean µ n and standard deviation σ n 34 .…”
Section: Stochastic Learning Automatamentioning
confidence: 99%
See 1 more Smart Citation
“…But unlike the solutions reported in [16], [17], [19], the solution here is far more consequential because the system does not rely on a Teacher or "Oracle" instructing the LA which way it should move. Thus, our solution will have applications in all the areas mentioned earlier for which the GG has found direct applications [9], [10], and for the areas where the entire field of LA and stochastic learning, has found uni-modal optimization applications from a finite or infinite action set [1]- [3], [5], [6], [18], [20].…”
Section: B Salient Aspects Of the Papermentioning
confidence: 99%
“…Age-old techniques resort to moving toward the optimal point by using the first derivative of the criterion function, and where these updates are, for example, either proportional to the derivative(s) or involve the second derivative(s). In a related manner, the SPL is also associated to the field of learning automata (LA) in which the action space has an infinite number of actions [52]. In this case, one would not resort to invoking the derivatives of the criterion, but rather to sampling the action space and moving appropriately within the space.…”
mentioning
confidence: 99%