2021
DOI: 10.48550/arxiv.2104.03270
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Neural Network Approach for High-Dimensional Optimal Control Applied to Multi-Agent Path Finding

Derek Onken,
Levon Nurbekyan,
Xingjian Li
et al.

Abstract: We propose a neural network approach for solving high-dimensional optimal control problems arising in real-time applications. Our approach yields controls in a feedback form and can therefore handle uncertainties such as perturbations to the system's state. We accomplish this by fusing the Pontryagin Maximum Principle (PMP) and Hamilton-Jacobi-Bellman (HJB) approaches and parameterizing the value function with a neural network. We train our neural network model using the objective function of the control probl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 12 publications
(17 citation statements)
references
References 36 publications
0
16
0
Order By: Relevance
“…From the results in Section 5.3, we observe that SympOCnet can handle the path planning problem whose state space has dimension 512, and hence the method can potentially mitigate the CoD. In Section 5.4, we apply SympOCnet to a swarm path planning problem in [74], and demonstrate good performance and efficiency in path planning problems, where the agents move in a three-dimensional space.…”
Section: Penalty Methods In the Training Process Of Sympocnetmentioning
confidence: 95%
See 1 more Smart Citation
“…From the results in Section 5.3, we observe that SympOCnet can handle the path planning problem whose state space has dimension 512, and hence the method can potentially mitigate the CoD. In Section 5.4, we apply SympOCnet to a swarm path planning problem in [74], and demonstrate good performance and efficiency in path planning problems, where the agents move in a three-dimensional space.…”
Section: Penalty Methods In the Training Process Of Sympocnetmentioning
confidence: 95%
“…Multiple drones with obstacle avoidance in a three-dimensional space. We consider the three-dimensional swarm path planning example in [74]. To be specific, we consider M = 100 drones with radius 0.18.…”
Section: 3mentioning
confidence: 99%
“…Recently, deep neural networks have been applied to develop numerical methods that have demonstrated remarkable performance in overcoming the curse of dimensionality and solving high-dimensional HJB equation effectively [2,3,8,10,11,13]. [8] proposes to use traditional numerical methods to evaluate the solution of HJB equation at certain points, and then use these pre-computed data to train a neural network; relying on the generalization of neural networks, the numerical solution on the entire domain is then obtained.…”
Section: Introductionmentioning
confidence: 99%
“…However, because the nonlinear Feynman-Kac's lemma is involved in the reformulation procedure, Deep BSDE method can only handle some specific PDEs, while DGM is a more general approach. Some other works, e.g., [10] also parameterize the solution of HJB equation with a neural network, but the objective function to be optimized is directly chosen as the cost functional plus some regularization term. And the regularization term is then selected to be the deviation of the trial solution from the PDE and boundary conditions, which is in fact the objective function in DGM.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation