2020
DOI: 10.1109/access.2020.3017480
|View full text |Cite
|
Sign up to set email alerts
|

Range-Aware Impact Angle Guidance Law With Deep Reinforcement Meta-Learning

Abstract: In this paper, a new guidance law is proposed for impact angle constrained missile with timevarying velocity against a maneuvering target. The proposed guidance law is based on model-based deep reinforcement learning (RL) technique where a deep neural network is trained to be a predictive model used in model predictive path integral (MPPI) control. Tube-MPPI, a robust approach utilizing ancillary controller for disturbance rejection, is introduced in guidance law design in this work to deal with the MPPI degra… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(3 citation statements)
references
References 34 publications
(56 reference statements)
0
3
0
Order By: Relevance
“…In [19], a finite-time dual-layer guidance law considering the second-order autopilot dynamics was designed, and an extended state observers are used to estimate the unknown target maneuver and acceleration derivative of the missile. with the development of artificial intelligence technology, neural networks [20][21][22] and deep reinforcement learning technique [23][24][25][26][27][28] have also begun to be applied for the design of guidance laws to estimate the uncertain dynamics of maneuvering targets. Moreover, in order to reduce the computational cost of the neural networks, a non-fragile quantitative prescribed performance control method was developed in [29], a simplified finite-time fuzzy neural controller has been introduced in [30], an adaptive critic design-based fuzzy neural controller was proposed in [31].…”
Section: Introductionmentioning
confidence: 99%
“…In [19], a finite-time dual-layer guidance law considering the second-order autopilot dynamics was designed, and an extended state observers are used to estimate the unknown target maneuver and acceleration derivative of the missile. with the development of artificial intelligence technology, neural networks [20][21][22] and deep reinforcement learning technique [23][24][25][26][27][28] have also begun to be applied for the design of guidance laws to estimate the uncertain dynamics of maneuvering targets. Moreover, in order to reduce the computational cost of the neural networks, a non-fragile quantitative prescribed performance control method was developed in [29], a simplified finite-time fuzzy neural controller has been introduced in [30], an adaptive critic design-based fuzzy neural controller was proposed in [31].…”
Section: Introductionmentioning
confidence: 99%
“…Different from the UAVs, the speed of missile is faster and the direction is more difficult to control [19,20]. An adaptive guidance system using the reinforcement meta-learning with recurrent network was proposed in [21]. A model-based DRL method was presented in [22] to predict the model of the guidance dynamics, and the predicted result was incorporated into a model predictive path integral control framework.…”
Section: Introductionmentioning
confidence: 99%
“…Some of these algorithms have achieved satisfactory results in coping with problems with low complexity and accuracy requirements. In [4], [5] and [13], the state-of-the-art reinforcement learning frameworks have demonstrated their effectiveness in the guidance task. Zhang et.al proposed a gradient-descent-based reinforcement learning method in the actor-critic framework and achieved consensus control for multiagent systems by following a tracking leader [14].…”
Section: Introductionmentioning
confidence: 99%