2023
DOI: 10.3390/e25020208
|View full text |Cite
|
Sign up to set email alerts
|

Forward-Backward Sweep Method for the System of HJB-FP Equations in Memory-Limited Partially Observable Stochastic Control

Abstract: Memory-limited partially observable stochastic control (ML-POSC) is the stochastic optimal control problem under incomplete information and memory limitation. To obtain the optimal control function of ML-POSC, a system of the forward Fokker–Planck (FP) equation and the backward Hamilton–Jacobi–Bellman (HJB) equation needs to be solved. In this work, we first show that the system of HJB-FP equations can be interpreted via Pontryagin’s minimum principle on the probability density function space. Based on this in… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
4

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(28 citation statements)
references
References 58 publications
0
1
0
Order By: Relevance
“…Future work could include expanding our formalism to noisy dynamics, noisy control [25,[65][66][67][68], imperfect information or to study the robustness of the control, similar to [40,69,70]. There is also the intriguing possibility of allowing individuals to directly influence government [71] in the same way that ε allows the government to influence individuals.…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…Future work could include expanding our formalism to noisy dynamics, noisy control [25,[65][66][67][68], imperfect information or to study the robustness of the control, similar to [40,69,70]. There is also the intriguing possibility of allowing individuals to directly influence government [71] in the same way that ε allows the government to influence individuals.…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…These effects can result in the existence of (multiple) stationary states [47,49]. This and other complexities could be included in our approach in principle, such as multiple agent types with different risk and behaviour profiles [19,[50][51][52][53], seasonal effects [54], stochastic control [13,[55][56][57][58], or decentralised optimal behaviour via interventions orchestrated by a benevolent social planner [27][28][29][30][31]. We reserve these developments for future studies.…”
Section: Introductionmentioning
confidence: 99%
“…In order to address these issues, we propose an alternative theoretical framework to DSC, memory-limited DSC (ML-DSC), which is the decentralized version of memorylimited POSC (ML-POSC) [23,24]. The two major difficulties in DSC originate from the ignorance of constraints over controllers when we derive the optimal estimation and control solution.…”
Section: Introductionmentioning
confidence: 99%
“…Specifically, we address the optimization problem by converting ML-DSC in the state space into the deterministic optimal control problem in the probability density function space. This technique has recently been used in mean-field stochastic control [25,26] and ML-POSC [23,24], and is also effective for ML-DSC. Following that, we can solve ML-DSC in a similar way to the deterministic optimal control problem on the probability density function space; the optimal control function of ML-DSC was obtained by jointly solving the Hamilton-Jacobi-Bellman (HJB) equation and the Fokker-Planck (FP) equation.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation