Optimal game theoretic solution of the pursuit‐evasion intercept problem using on‐policy reinforcement learning

Kartal, Yusuf; Subbarao, Kamesh; Dogan, Atilla; Lewis, Frank L.

doi:10.1002/rnc.5719

Cited by 20 publications

(13 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We can obtain the controls of pursuer and evader which should be adopted in the next interval as (30):…”

Section: Policy Iterationmentioning

confidence: 99%

“…However, the system information about both sides of the game must be obtained completely. Kartal et al [30] used the synchronous tuning algorithm in the pursuit-evasion game of the first-order system to obtain the capture conditions of agents in the game and reached the Nash equilibrium. Zhang et al [31] and Li et al [32] determined the scheme's feasibility in distributed systems.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Solution for Pursuit-Evasion Game of Agents by Adaptive Dynamic Programming

Gong

Liu

et al. 2023

Electronics

View full text Add to dashboard Cite

The paper studies a novel method for real-time solutions of the two-player pursuit-evasion game. The min-max principle is adopted to confirm the Nash equilibrium of the game. As agents in the game can form an Internet of Things (IoT) system, the real-time control law of each agent is obtained by taking a linear-quadratic cost function in adaptive dynamic programming. By introducing the Lyapunov function, we consider the scenario when capture occurs. Since most actual systems are continuous, the policy iteration algorithm is used to make the real-time policy converge to the analytical solution of the Nash equilibrium. Furthermore, we employ the value function approximation method to calculate the neural network parameters without directly solving the Hamilton–Jacobi–Isaacs equation. Simulation results depict the method’s feasibility in different scenarios of the pursuit-evasion game.

show abstract

“…We can obtain the controls of pursuer and evader which should be adopted in the next interval as (30):…”

Section: Policy Iterationmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Solution for Pursuit-Evasion Game of Agents by Adaptive Dynamic Programming

Gong

Liu

et al. 2023

Electronics

View full text Add to dashboard Cite

show abstract

“… Due to the limitation of the driving force, the input of the movement mode of the pursuer and the evader is limited to the constants

and

. The input of the movement mode of the pursuer also satisfies a certain proportion relationship [ 23 , 24 ]. Because of the above assumption, the pursuer is slower than the evader, i.e.,

The location of the line of defense is determined, as are the number of pursuers and evaders.…”

Section: Problem Formulationmentioning

confidence: 99%

Optimal-Damage-Effectiveness Cooperative-Control Strategy for the Pursuit–Evasion Problem with Multiple Guided Missiles

Dai

et al. 2022

Sensors

View full text Add to dashboard Cite

In this paper, an optimal-damage-effectiveness cooperative-control strategy based on a damage-efficiency model and a virtual-force method is proposed to solve the pursuit–evasion problem with multiple guided missiles. Firstly, different from the overly ideal assumption in the traditional pursuit–evasion problem, an optimization problem that maximizes the damage efficiency is established and solved, making the optimal-damage-effectiveness strategy more meaningful for practical applications. Secondly, a modified virtual-force method is proposed to obtain this optimal-damage-effectiveness control strategy, which solves the numerical solution challenges brought by the high-complexity damage function. Thirdly, adaptive gain is designed in this strategy based on guidance-integrated fuze technology to achieve robust maximum damage efficiency in unpredictable interception conditions. Finally, the effectiveness and robustness of the proposed strategy are verified by numerical simulations.

show abstract

“…[7][8][9][10] Likewise, perimeter-defense problems are another variant of PEGs wherein the defender team is tasked to capture the intruders before the latter breach the target perimeter. Hamilton-Jacobi-Bellman-Isaacs equation is one of the conventional tools to address perimeter-defense problems, 11,12 however, this is not suitable for team games or the type of sequential arrival games considered in this work. A perimeter defense problem in a planar conical environment is studied 13 recently where two algorithms were presented.…”

Section: Introductionmentioning

confidence: 99%

Target defense against a sequentially arriving cooperative intruder team

Pourghorban¹,

Maity²

2023

Open Architecture/Open Business Model Net-Centric Systems and Defense Transformation 2023

View full text Add to dashboard Cite

We consider a variant of the target defense problem where a single defender is tasked to guard a target region from a sequence of incoming intruders. Each intruder's objective is to breach the target boundary without being captured and the defender's objective is to capture as many intruders as possible. The intruders appear sequentially on a fixed circle surrounding the target, resulting in a sequence of 1-vs-1 games between the defender and the intruders. Each 1-vs-1 game is terminated when the target is breached or the intruder is captured. The defender has to start the next game as soon as the current game ends. Each intruder knows the entry point of the last intruder and this information is used to find an optimal entry point. Each game is analyzed by dividing it into two phases: partial information and full information phase. We utilize the notions of engagement surface and capture circle to analyze the strategies for the defender as well as the intruders. Furthermore, we analytically compute the capture percentage for both finite and infinite sequences of intruder arrivals. Finally, the theoretical results are verified through numerical examples using Monte-Carlo type random trials of experiments.

show abstract

Optimal game theoretic solution of the pursuit‐evasion intercept problem using on‐policy reinforcement learning

Cited by 20 publications

References 31 publications

Solution for Pursuit-Evasion Game of Agents by Adaptive Dynamic Programming

Solution for Pursuit-Evasion Game of Agents by Adaptive Dynamic Programming

Optimal-Damage-Effectiveness Cooperative-Control Strategy for the Pursuit–Evasion Problem with Multiple Guided Missiles

Target defense against a sequentially arriving cooperative intruder team

Contact Info

Product

Resources

About