2021
DOI: 10.48550/arxiv.2108.06740
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A fast iterative PDE-based algorithm for feedback controls of nonsmooth mean-field control problems

Abstract: A PDE-based accelerated gradient algorithm is proposed to seek optimal feedback controls of McKean-Vlasov dynamics subject to nonsmooth costs, whose coefficients involve mean-field interactions both on the state and action. It exploits a forward-backward splitting approach and iteratively refines the approximate controls based on the gradients of smooth costs, the proximal maps of nonsmooth costs, and dynamically updated momentum parameters. At each step, the state dynamics is realized via a particle approxima… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 6 publications
(10 citation statements)
references
References 48 publications
0
10
0
Order By: Relevance
“…In this case, (1.7) can be viewed as an infinite-dimensional extension of the iterative shrinkage-thresholding algorithm (see [5,32]).…”
Section: Standing Assumptions and Main Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…In this case, (1.7) can be viewed as an infinite-dimensional extension of the iterative shrinkage-thresholding algorithm (see [5,32]).…”
Section: Standing Assumptions and Main Resultsmentioning
confidence: 99%
“…) is the Fréchet derivative of the differentiable component of J(•; ξ 0 ) at the iterate α m (see [39,1]), while the function prox τ ℓ can be identified as the proximal map of the nonsmooth component of J(•; ξ 0 ). We refer the reader to [32] for a detailed derivation of the algorithm and to [21,34] for similar gradient-based algorithms without the nonsmooth term ℓ.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Among the numerical methods proposed for mean-field control problems, we refer to [21] for a policy gradient-type method where feedback controls are approximated by neural networks and optimised for a given objective function; to [32] and again to [21] for a mean-field FBSDE method, generalising the deep BSDE method to mean-field dependence and in the former case to delayed effects; to [48] for a hybrid model where the mean-field distribution is approximated by a particle system and the control is obtained by numerical approximation of a PDE; and to [6] for a survey of methods for the coupled PDE systems, mainly in the spirit of the seminal works [3,2,5]; see also a related semi-Lagrangian scheme in [16]; a gradient method and penalisation approach in [44]; and a recent analysis of policy iteration in [38].…”
Section: Numerics For Mean-field Control Problemsmentioning
confidence: 99%
“…with η ∈ R and δ ≥ 0 such that δ 2 ≤ η 2 . The effect of this modification is that in (48) Λt is replaced by (1 − δ) Λt and additionally the term…”
Section: Solution Via Riccati Odesmentioning
confidence: 99%