1987
DOI: 10.1137/0325070
|View full text |Cite
|
Sign up to set email alerts
|

Asymptotic Properties of Distributed and Communicating Stochastic Approximation Algorithms

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

1
100
0

Year Published

1995
1995
2013
2013

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 110 publications
(101 citation statements)
references
References 10 publications
1
100
0
Order By: Relevance
“…Therefore we have a stochastic, asynchronous updating pattern, where a subset of an iterative process similar to (1.1) can be updated many times before the remaining components are selected for a single update. Based on this idea extensions to the standard theory have been examined such as those by Kushner and Yin [17,18]. Here however we follow the extension to asynchronous stochastic approximation provided by Borkar [9] and Konda and Borkar [14].…”
mentioning
confidence: 99%
See 2 more Smart Citations
“…Therefore we have a stochastic, asynchronous updating pattern, where a subset of an iterative process similar to (1.1) can be updated many times before the remaining components are selected for a single update. Based on this idea extensions to the standard theory have been examined such as those by Kushner and Yin [17,18]. Here however we follow the extension to asynchronous stochastic approximation provided by Borkar [9] and Konda and Borkar [14].…”
mentioning
confidence: 99%
“…Studying the performance of these processes can be carried out using the asynchronous stochastic approximation framework. However, the previous work in this area has focused on continuous, single-valued updates as discussed in the literature (see for example [9,14,17,18,24]). Furthermore some of the assumptions which are typically used are challenging to verify.…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…Convergence of this distributed learning scheme is currently under investigation. More complicated distributed learning scenarios that incorporate partial information sharing between users (see, e.g., Kushner and Yin 1987) might be similarly considered.…”
Section: Discussionmentioning
confidence: 99%
“…Under quite general conditions, constrained Q-learning algorithms were considered in Chap. 12 of [5] and the convergence was proved via stochastic approximation methods for distributed and asynchronous recursive procedures ( [6] and [7]). In particular, we demonstrated that how one may deal with constraints such as a hypercube [−B, B] r , where r is the dimension of the state variable.…”
Section: Q-learning Algorithmsmentioning
confidence: 99%