A Neural Network Approach for High-Dimensional Optimal Control Applied to Multi-Agent Path Finding

Onken, Derek; Nurbekyan, Levon; Li, Xingjian; Fung, Samy Wu; Osher, Stanley; Ruthotto, Lars

doi:10.48550/arxiv.2104.03270

Cited by 12 publications

(17 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…From the results in Section 5.3, we observe that SympOCnet can handle the path planning problem whose state space has dimension 512, and hence the method can potentially mitigate the CoD. In Section 5.4, we apply SympOCnet to a swarm path planning problem in [74], and demonstrate good performance and efficiency in path planning problems, where the agents move in a three-dimensional space.…”

Section: Penalty Methods In the Training Process Of Sympocnetmentioning

confidence: 95%

See 1 more Smart Citation

SympOCnet: Solving optimal control problems with applications to high-dimensional multi-agent path planning problems

Meng¹,

Zhang²,

Darbon³

et al. 2022

Preprint

View full text Add to dashboard Cite

Solving high-dimensional optimal control problems in real-time is an important but challenging problem, with applications to multi-agent path planning problems, which have drawn increased attention given the growing popularity of drones in recent years. In this paper, we propose a novel neural network method called SympOCnet that applies the Symplectic network to solve high-dimensional optimal control problems with state constraints. We present several numerical results on path planning problems in two-dimensional and three-dimensional spaces. Specifically, we demonstrate that our SympOCnet can solve a problem with more than 500 dimensions in 1.5 hours on a single GPU, which shows the effectiveness and efficiency of SympOCnet. The proposed method is scalable and has the potential to solve truly high-dimensional path planning problems in real-time.

show abstract

Section: Penalty Methods In the Training Process Of Sympocnetmentioning

confidence: 95%

“…Multiple drones with obstacle avoidance in a three-dimensional space. We consider the three-dimensional swarm path planning example in [74]. To be specific, we consider M = 100 drones with radius 0.18.…”

Section: 3mentioning

confidence: 99%

SympOCnet: Solving optimal control problems with applications to high-dimensional multi-agent path planning problems

Meng¹,

Zhang²,

Darbon³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Recently, deep neural networks have been applied to develop numerical methods that have demonstrated remarkable performance in overcoming the curse of dimensionality and solving high-dimensional HJB equation effectively [2,3,8,10,11,13]. [8] proposes to use traditional numerical methods to evaluate the solution of HJB equation at certain points, and then use these pre-computed data to train a neural network; relying on the generalization of neural networks, the numerical solution on the entire domain is then obtained.…”

Section: Introductionmentioning

confidence: 99%

“…However, because the nonlinear Feynman-Kac's lemma is involved in the reformulation procedure, Deep BSDE method can only handle some specific PDEs, while DGM is a more general approach. Some other works, e.g., [10] also parameterize the solution of HJB equation with a neural network, but the objective function to be optimized is directly chosen as the cost functional plus some regularization term. And the regularization term is then selected to be the deviation of the trial solution from the PDE and boundary conditions, which is in fact the objective function in DGM.…”

Section: Introductionmentioning

confidence: 99%

“…[11] solves several HJB equations of a class of nonlinear control systems with Deep BSDE method, but the form of FBSDEs has been modified so that the solution of the forward stochastic differential equation (FSDE) is no longer a trajectory driven by pure noise. To some extent, [10] and [11] can be viewed as the corresponding advanced works of DGM and Deep BSDE in optimal control problems. They no longer consider solving a general PDE, but combine some other properties of HJB eqaution that are inherited from optimal control theory.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Deep BSDE-ML Learning and Its Application to Model-Free Optimal Control

Wang¹,

Ni²

2022

Preprint

View full text Add to dashboard Cite

A modified Deep BSDE (backward differential equation) learning method with measurability loss, called Deep BSDE-ML method, is introduced in this paper to solve a kind of linear decoupled forward-backward stochastic differential equations (FBSDEs), which is encountered in the policy evaluation of learning the optimal feedback policies of a class of stochastic control problems. The measurability loss is characterized via the measurability of BSDE's state at the forward initial time, which differs from that related to terminal state of the known Deep BSDE method. Though the minima of the two loss functions are shown to be equal, this measurability loss is proved to be equal to the expected mean squared error between the true diffusion term of BSDE and its approximation. This crucial observation extends the application of the Deep BSDE methodapproximating the gradients of the solution of a partial differential equation (PDE) instead of the solution itself.Simultaneously, a learning-based framework is introduced to search an optimal feedback control of a deterministic nonlinear system. Specifically, by introducing Gaussian exploration noise, we are aiming to learn a robust optimal controller under this stochastic case. This reformulation sacrifices the optimality to some extent, but as suggested in reinforcement learning (RL) exploration noise is essential to enable the model-free learning. The new stochastic optimal control problem is solved with general policy iteration methodology-repeating policy evaluation and policy improvment. Instead of fitting the value function, our policy evaluation approximates its gradient, thus can be seamlessly integrated with policy improvement without manually differentiating a neural network. By using the proposed Deep BSDE-ML method, this is achieved through optimizing the understood loss function in the FBSDE formulation. With some simulating tricks, the whole algorithm can be implemented in both model-based and model-free fashions. Compared with the Markov framework of RL, our method is built on the diffusion process, thus is preferred from a theoretical point of view for continuous-time and continuous-space tasks. Numerical Experiments suggest that the proposed model-free approach performs as good as its model-based counterpart.

show abstract

Multi-objective Optimal Control of Wastewater Treatment Process Based on Neural Network

Yu,

Ding,

2024

Lecture Notes in Electrical Engineering

View full text Add to dashboard Cite

A Neural Network Approach for High-Dimensional Optimal Control Applied to Multi-Agent Path Finding

Cited by 12 publications

References 36 publications

SympOCnet: Solving optimal control problems with applications to high-dimensional multi-agent path planning problems

SympOCnet: Solving optimal control problems with applications to high-dimensional multi-agent path planning problems

Deep BSDE-ML Learning and Its Application to Model-Free Optimal Control

Multi-objective Optimal Control of Wastewater Treatment Process Based on Neural Network

Contact Info

Product

Resources

About