Maneuvering penetration strategies of ballistic missiles based on deep reinforcement learning

Qiu, Xiaoqi; Gao, Cai; Jing, Wuxing

doi:10.1177/09544100221088361

Cited by 14 publications

(6 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…21,22 Maneuvring penetration strategies of ballistic missiles are proposed based on DRL. 23 A guidance strategy for spacecraft proximity operations is proposed based on DRL. 24 So, DRL has a powerful capability of solving problems with uncertainties offline.…”

Section: Introductionmentioning

confidence: 99%

Calculate the ignition height of the vertical landing phase online for the reusable rocket

Cheng,

Jing,

Gao

2024

Proceedings of the Institution of Mechanical Engineers, Part G:

View full text Add to dashboard Cite

For the vertical landing phase of the reusable rocket, in order to improve the landing accuracy with consideration of multiple uncertainties, a novel strategy to calculate the ignition height online is proposed based on polynomial guidance law (PGL), particle swarm optimization (PSO), and deep reinforcement learning (DRL). Firstly, a deep neural network (DNN) is designed to describe the relationship between the state of the reusable rocket and the ignition height. To accomplish the guidance task of the vertical landing phase, PGL is modified by introducing the estimated aerodynamic acceleration. Through simulation, the output range of the DNN is estimated by the modified PSO. Then, the reward function is shaped and the parameters of the DNN are trained on a training set of simulation scenarios by the DRL algorithm. Finally, to demonstrate the effectiveness of the proposed strategy, the trained DNN is used to calculate the ignition height of 1500 unlearned simulation scenarios online. The numerical simulation results show that the proposed strategy has higher landing accuracy and lower fuel consumption than the offline strategy of fixed ignition height based on the modified PSO.

show abstract

Section: Introductionmentioning

confidence: 99%

Calculate the ignition height of the vertical landing phase online for the reusable rocket

Cheng,

Jing,

Gao

2024

Proceedings of the Institution of Mechanical Engineers, Part G:

View full text Add to dashboard Cite

show abstract

“…The trust region policy optimization (TRPO) algorithm was proposed to generate an interception guidance law ( Chen et al, 2023 ). With an emphasis on the terminal evasion scenario, Qiu et al (2022) , based on DRL, developed a maneuver evasion guidance method that took into account both guidance accuracy and evasion capabilities. In a different study ( Jiang et al, 2022 ), the problem was reformulated as a Markov decision process (MDP), and an Actor-Critic (AC) framework-based DRL algorithm was used to solve it to suggest the anti-interception guiding law.…”

Section: Introductionmentioning

confidence: 99%

Intelligent maneuver strategy for hypersonic vehicles in three-player pursuit-evasion games via deep reinforcement learning

Yan,

Jiang,

et al. 2024

Front. Neurosci.

View full text Add to dashboard Cite

Aiming at the rapid development of anti-hypersonic collaborative interception technology, this paper designs an intelligent maneuver strategy of hypersonic vehicles (HV) based on deep reinforcement learning (DRL) to evade the collaborative interception by two interceptors. Under the meticulously designed collaborative interception strategy, the uncertainty and difficulty of evasion are significantly increased and the opportunity for maneuvers is further compressed. This paper, accordingly, selects the twin delayed deep deterministic gradient (TD3) strategy acting on the continuous action space and makes targeted improvements combining deep neural networks to grasp the maneuver strategy and achieve successful evasion. Focusing on the time-coordinated interception strategy of two interceptors, the three-player pursuit and evasion (PE) problem is modeled as the Markov decision process, and the double training strategy is proposed to juggle both interceptors. In reward functions of the training process, the energy saving factor is set to achieve the trade-off between miss distance and energy consumption. In addition, the regression neural network is introduced into the deep neural network of TD3 to enhance intelligent maneuver strategies’ generalization. Finally, numerical simulations are conducted to verify that the improved TD3 algorithm can effectively evade the collaborative interception of two interceptors under tough situations, and the improvements of the algorithm in terms of convergence speed, generalization, and energy-saving effect are verified.

show abstract

“…The intelligent game maneuver adopts a closed-loop maneuver scheme of "interceptor movement-situational awareness-maneuver strategy generation-maneuver control implementation" that realizes timely maneuvering to increase miss distance and increase evasion probability. The key to intelligent game maneuver lies in the selection of intelligent algorithms Among the intelligent algorithms associated with hypersonic aircraft, deep learning (DL)and reinforcement learning (RL) are the first to bear the brunt [21][22][23][24][25][26][27][28][29][30][31][32]. Due to its strong nonlinear fitting ability, the deep neural network (DNN) in DL has been widely used in the PE problems of hypersonic aircraft [21][22][23].…”

Section: Introductionmentioning

confidence: 99%

“…Among these, the most prevalent study [21] resolves the tension between the accuracy and speed of the IPP by building an IPP neural network model after using the ballistic model to create training data. And the algorithms of reinforcement learning, especially deep reinforcement learning (DRL), provide a new approach to the design of HVs' evasion strategies [24][25][26][27][28][29][30][31][32]. As an unsupervised heuristic algorithm without an accurate model, RL and DRL can generate actions based on the interaction with the environment, that is, conduct intelligent maneuvering games based on both attack and defense sides.…”

Section: Introductionmentioning

confidence: 99%

Intelligent Maneuver Strategy for a Hypersonic Pursuit-Evasion Game Based on Deep Reinforcement Learning

Guo,

Jiang,

Huang

et al. 2023

Aerospace

View full text Add to dashboard Cite

In order to improve the problem of overly relying on situational information, high computational power requirements, and weak adaptability of traditional maneuver methods used by hypersonic vehicles (HV), an intelligent maneuver strategy combining deep reinforcement learning (DRL) and deep neural network (DNN) is proposed to solve the hypersonic pursuit–evasion (PE) game problem under tough head-on situations. The twin delayed deep deterministic (TD3) gradient strategy algorithm is utilized to explore potential maneuver instructions, the DNN is used to fit to broaden application scenarios, and the intelligent maneuver strategy is generated with the initial situation of both the pursuit and evasion sides as the input and the maneuver game overload of the HV as the output. In addition, the experience pool classification strategy is proposed to improve the training convergence and rate of the TD3 algorithm. A set of reward functions is designed to achieve adaptive adjustment of evasion miss distance and energy consumption under different initial situations. The simulation results verify the feasibility and effectiveness of the above intelligent maneuver strategy in dealing with the PE game problem of HV under difficult situations, and the proposed improvement strategies are validated as well.

show abstract

Maneuvering penetration strategies of ballistic missiles based on deep reinforcement learning

Cited by 14 publications

References 31 publications

Calculate the ignition height of the vertical landing phase online for the reusable rocket

Calculate the ignition height of the vertical landing phase online for the reusable rocket

Intelligent maneuver strategy for hypersonic vehicles in three-player pursuit-evasion games via deep reinforcement learning

Intelligent Maneuver Strategy for a Hypersonic Pursuit-Evasion Game Based on Deep Reinforcement Learning

Contact Info

Product

Resources

About