1982
DOI: 10.1137/0320039
|View full text |Cite
|
Sign up to set email alerts
|

Learning Algorithms for Two-Person Zero-Sum Stochastic Games with Incomplete Information: A Unified Approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
41
0

Year Published

1986
1986
2019
2019

Publication Types

Select...
3
3
3

Relationship

0
9

Authors

Journals

citations
Cited by 57 publications
(43 citation statements)
references
References 15 publications
2
41
0
Order By: Relevance
“…Proof It has been proved in [13,17] that there always exists a 0 > x under which Equation (19) is satisfied, if under which the probability of converging Algorithm 1 to the spanning tree with the minimum expected weight is greater than ) 1 ( ε − and hence the proof of the theorem. ■…”
Section: Lemmamentioning
confidence: 95%
See 1 more Smart Citation
“…Proof It has been proved in [13,17] that there always exists a 0 > x under which Equation (19) is satisfied, if under which the probability of converging Algorithm 1 to the spanning tree with the minimum expected weight is greater than ) 1 ( ε − and hence the proof of the theorem. ■…”
Section: Lemmamentioning
confidence: 95%
“…In -model environments, the reinforcement signal lies in the interval a, b . Learning automata can be classified into two main families [11,[13][14][15][16][17][18][19]: fixed structure learning automata and variable structure learning automata. Variable structure learning automata are represented by a triple , , where is the set of inputs, is the set of actions, and is learning algorithm.…”
Section: Stochastic Graphmentioning
confidence: 99%
“…A learning automaton [11][12][13][14][15][16][17][18] is an adaptive decisionmaking unit that improves its performance by learning how to choose the optimal action from a finite set of allowed actions through repeated interactions with a random environment. Learning automata can be classified into two main families: fixed structure learning automata and variable structure learning automata.…”
Section: A Learning Automatamentioning
confidence: 99%
“…It can be shown that 36,48 for every error parameter ε ∈ (0, 1) there exists a value of xunder which the Eq. (19) is satisfied, and we have…”
Section: Theoremmentioning
confidence: 99%