2019 18th European Control Conference (ECC) 2019
DOI: 10.23919/ecc.2019.8795922
|View full text |Cite
|
Sign up to set email alerts
|

Learning continuous $Q$-functions using generalized Benders cuts

Abstract: Q-functions are widely used in discrete-time learning and control to model future costs arising from a given control policy, when the initial state and input are given. Although some of their properties are understood, Q-functions generating optimal policies for continuous problems are usually hard to compute. Even when a system model is available, optimal control is generally difficult to achieve except in rare cases where an analytical solution happens to exist, or an explicit exact solution can be computed.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
4
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
1

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 16 publications
(35 reference statements)
0
4
0
Order By: Relevance
“…This results in an approximation which lower bounds Q (N ) everywhere. Following [9], [13] we choose the cuts to be linearly separable in their arguments:…”
Section: B Reformulated Benders' Cutsmentioning
confidence: 99%
See 1 more Smart Citation
“…This results in an approximation which lower bounds Q (N ) everywhere. Following [9], [13] we choose the cuts to be linearly separable in their arguments:…”
Section: B Reformulated Benders' Cutsmentioning
confidence: 99%
“…The proof is an adaptation of the proof of Lemma III.1 in [9] that uses the properties of the dual variables to prove the global lower bounding property for a cut added at any [13] carry over to the present setting and describe the conditions under which new cuts improve the approximation and by how much. We highlight that the addition of the new cut q I+1 at the parameter {s}…”
Section: B Reformulated Benders' Cutsmentioning
confidence: 99%
“…The proof is a simple adaptation of the proof of Lemma III.1 in [7]. Lemmas III.2-4 of [10] carry over to the present setting but are omitted in the interest of space. We highlight that the addition of the new cut q I+1 at the parameter {s} N -1 0 ∈ S (N ) (x 0 ) leads to an improvement of…”
Section: B Reformulated Benders' Cutsmentioning
confidence: 99%
“…This results in an approximation which lower bounds Q (N ) ⋆ everywhere. Following [7], [10] we choose the cuts to be linearly separable in their arguments:…”
Section: B Reformulated Benders' Cutsmentioning
confidence: 99%