2007
DOI: 10.1007/s10107-007-0201-x
|View full text |Cite
|
Sign up to set email alerts
|

A stochastic gradient type algorithm for closed-loop problems

Abstract: We focus on solving closed-loop stochastic problems, and propose a perturbed gradient algorithm to achieve this goal. The main hurdle in such problems is the fact that the control variables are infinite dimensional, and have hence to be represented in a finite way in order to numerically solve the problem. In the same way, the gradient of the criterion is itself an infinite dimensional object. Our algorithm replaces this exact (and unknown) gradient by a perturbed one, which consists in the product of the true… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
6
0

Year Published

2007
2007
2022
2022

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 21 publications
0
6
0
Order By: Relevance
“…In fact, this setting has been investigated in detail in the context of stochastic Nash games [28]. Further examples for stochastic approximation schemes in a Hilbert-space setting obeying Assumption 6 are [7,8] and [38]. We now discuss an example that further clarifies the requirements on the estimator.…”
Section: Assumption 6 (Asymptotically Unbiased So)mentioning
confidence: 99%
“…In fact, this setting has been investigated in detail in the context of stochastic Nash games [28]. Further examples for stochastic approximation schemes in a Hilbert-space setting obeying Assumption 6 are [7,8] and [38]. We now discuss an example that further clarifies the requirements on the estimator.…”
Section: Assumption 6 (Asymptotically Unbiased So)mentioning
confidence: 99%
“…The natural extension of these techniques in the closed-loop stochastic case (see Barty et al (2009)) fails to provide decomposed state dependent strategies. Indeed, the optimal strategy of a subproblem depends on the state of the whole system, and not only on the local state.…”
Section: Decomposition Approachmentioning
confidence: 99%
“…when controls do not rely on any observation, Cohen and Culioli (1990) proposed to take advantage of both decomposition techniques and stochastic gradient algorithms. These techniques have been extended in the closed-loop stochastic case by Barty, Roy, and Strugarek (2009), but so far 1 In the case of power management, the state dimension is usually the number of power units. they fail to provide decomposed state dependent strategies in the Markovian case.…”
Section: Introductionmentioning
confidence: 99%
“…when controls do not rely on any observation, [CC90] proposed to take advantage of both decomposition techniques and stochastic gradient algorithms. These techniques have been extended in the closed-loop stochastic case by [BRS07], but so far they fail to provide decomposed state dependent strategies in the Markovian case. This is because a subproblem's optimal strategy depends on the state of the whole system, not only on the local state.…”
Section: Introductionmentioning
confidence: 99%