Autonomous Control of Combat Unmanned Aerial Vehicles to Evade Surface-to-Air Missiles Using Deep Reinforcement Learning

Lee, Gyeong Taek; Kim, Chang Ouk

doi:10.1109/access.2020.3046284

Cited by 20 publications

(19 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…• Flock Centering: maintaining flight formation as suggested by Reynolds [95] involves three concepts: 1) flock centering, 2) avoiding obstacles, and 3) velocity matching. This topology was applied in several research papers [82,[96][97][98][99]]. • Leader-Follower Flocking: the flock leader has its mission of reaching destination, while the followers (other UAVs) flock with the leader with a mission of maintaining distance and relative position to the leader [100,101].…”

Section: Flockingmentioning

confidence: 99%

Autonomous Unmanned Aerial Vehicle navigation using Reinforcement Learning: A systematic review

AlMahamid

Grolinger

2022

Engineering Applications of Artificial Intelligence

View full text Add to dashboard Cite

Section: Flockingmentioning

confidence: 99%

Autonomous Unmanned Aerial Vehicle navigation using Reinforcement Learning: A systematic review

AlMahamid

Grolinger

2022

Engineering Applications of Artificial Intelligence

View full text Add to dashboard Cite

“…The final goal is for the agent to move from the starting point to the target point. I represented the coordinates of the agent as a state using an effective coordinate vector [33]. The action of the agent was set as a simple movement: left, right, up, and down.…”

Section: Environmentmentioning

confidence: 99%

Learning user-defined sub-goals using memory editing in reinforcement learning

Lee¹

2022

Preprint

View full text Add to dashboard Cite

The aim of reinforcement learning (RL) is to allow the agent to achieve the final goal. Most RL studies have focused on improving the efficiency of learning to achieve the final goal faster. However, the RL model is very difficult to modify an intermediate route in the process of reaching the final goal. That is, the agent cannot be under control to achieve other sub-goals in the existing studies. If the agent can go through the sub-goals on the way to the destination, the RL can be applied and studied in various fields. In this study, I propose a methodology to achieve the user-defined sub-goals as well as the final goal using memory editing. The memory editing is performed to generate various sub-goals and give an additional reward to the agent. In addition, the sub-goals are separately learned from the final goal. I set two simple environments and various scenarios in the test environments. As a result, the agent almost successfully passed the sub-goals as well as the final goal under control. Moreover, the agent was able to be induced to visit the novel state indirectly in the environments. I expect that this methodology can be used in the fields that need to control the agent in a variety of scenarios.

show abstract

“…Combining two optimization methods also has been studied [39]. Recently, with the development of deep learning, studies on the path planning using the RL have mainly been proposed [3], [6], [7], [9], [10], [11], [14], [15], [16], [17], [40], [41], [42]. They have supposed the specific scenario and set an environment to apply the agent in the path planning.…”

Section: Path Planningmentioning

confidence: 99%

“…P ATH planning is a method to find an optimal route from the starting point to the target point. It has been widely used in various fields such as robotics [1], [2], [3], drone [4], [5], [6], [7], [8], [9], military service [10], [11], and self-driving car [12], [13]. Recently, reinforcement learning (RL) has been mainly studied for the path planning [3], [7], [9], [10], [11], [14], [15], [16], [17].…”

Section: Introductionmentioning

confidence: 99%

“…It has been widely used in various fields such as robotics [1], [2], [3], drone [4], [5], [6], [7], [8], [9], military service [10], [11], and self-driving car [12], [13]. Recently, reinforcement learning (RL) has been mainly studied for the path planning [3], [7], [9], [10], [11], [14], [15], [16], [17]. To get an optimal solution, it is essential to give enough reward for an agent to reach the goal and to set up a specific environment.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Fully Controllable Agent in the Path Planning using Goal-Conditioned Reinforcement Learning

Lee¹

2022

Preprint

View full text Add to dashboard Cite

The aim of path planning is to reach the goal from starting point by searching for an agent's route. In the path planning, the routes may vary depending on the number of variables such that it is important for the agent to reach various goals. Numerous studies, however, have dealt with a single goal that is predefined by the user. In the present study, I propose a novel reinforcement learning framework for a fully controllable agent in the path planning. To do this, I propose a bi-directional memory editing to obtain various bi-directional trajectories of the agent, in which the agent's behavior and sub-goals are trained on the goal-conditioned RL. As for the agent's to move in various directions, I utilize the sub-goals dedicated network, separated from a policy network. Lastly, I present the reward shaping to shorten the number of steps for the agent to reach the goal. In the experimental result, the agent was able to reach the various goals that have never been visited by the agent in the training. We confirmed that the agent could perform difficult missions such as a round trip and the agent used the shorter route with the reward shaping.

show abstract

Autonomous Control of Combat Unmanned Aerial Vehicles to Evade Surface-to-Air Missiles Using Deep Reinforcement Learning

Cited by 20 publications

References 18 publications

Autonomous Unmanned Aerial Vehicle navigation using Reinforcement Learning: A systematic review

Autonomous Unmanned Aerial Vehicle navigation using Reinforcement Learning: A systematic review

Learning user-defined sub-goals using memory editing in reinforcement learning

A Fully Controllable Agent in the Path Planning using Goal-Conditioned Reinforcement Learning

Contact Info

Product

Resources

About