Memory-Based Deep Reinforcement Learning for Obstacle Avoidance in UAV With Limited Environment Knowledge

Singla, Abhik; Padakandla, Sindhu; Bhatnagar, Shalabh

doi:10.1109/tits.2019.2954952

Cited by 171 publications

(116 citation statements)

References 31 publications

Supporting

Mentioning

116

Contrasting

Order By: Relevance

“…In 2013, DeepMind innovatively combined deep learning (DL) with RL to form a new hotspot in the field of artificial intelligence which is known as DRL [20]. By leveraging the decision-making capabilities of RL and the perceived capabilities of DL, DRL has been proven to be efficient at controlling UAV [21][22][23][24][25][26][27][28][29][30][31]. Zhu [21] proposed a framework for target driven visual navigation, this framework addressed some of the limitations that prevent DRL algorithms from being applied to realistic settings.…”

Section: Related Workmentioning

confidence: 99%

“…Kersandt [27] used DQN, Double DQN, and Dueling DQN [33] in the same UAV control mission and compared each of these methods. Singla [28] designed a deep recurrent Q-Network [34] with temporal attention that exhibited significant improvements over DQN and D3QN [32] for UAV motion planning in a cluttered and unseen environment. For the autonomous landing task of UAV, Polvara R [29] introduced a sequential DQN which is comparable with DQN and human pilots while being quantitatively better in noisy conditions.…”

Section: Related Workmentioning

confidence: 99%

“…Therefore, the UAV model has been simplified. For example, in real situations, the motion space of a UAV is continuous, whereas in some studies [26][27][28], the motion space of a UAV is considered discrete. Furthermore, the flight speed of a UAV is variable within a certain range, where Reference [30] regards UAV speed as a fixed value.…”

mentioning

confidence: 99%

See 2 more Smart Citations

Deep Reinforcement Learning Approach with Multiple Experience Pools for UAV’s Autonomous Motion Planning in Complex Unknown Environments

Wan

Gao

et al. 2020

Sensors

View full text Add to dashboard Cite

Autonomous motion planning (AMP) of unmanned aerial vehicles (UAVs) is aimed at enabling a UAV to safely fly to the target without human intervention. Recently, several emerging deep reinforcement learning (DRL) methods have been employed to address the AMP problem in some simplified environments, and these methods have yielded good results. This paper proposes a multiple experience pools (MEPs) framework leveraging human expert experiences for DRL to speed up the learning process. Based on the deep deterministic policy gradient (DDPG) algorithm, a MEP–DDPG algorithm was designed using model predictive control and simulated annealing to generate expert experiences. On applying this algorithm to a complex unknown simulation environment constructed based on the parameters of the real UAV, the training experiment results showed that the novel DRL algorithm resulted in a performance improvement exceeding 20% as compared with the state-of-the-art DDPG. The results of the experimental testing indicate that UAVs trained using MEP–DDPG can stably complete a variety of tasks in complex, unknown environments.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Deep Reinforcement Learning Approach with Multiple Experience Pools for UAV’s Autonomous Motion Planning in Complex Unknown Environments

Wan

Gao

et al. 2020

Sensors

View full text Add to dashboard Cite

show abstract

“…These DL approaches usually include a module for situational awareness that generates a set of feature maps related to the state of the robotic system and its surroundings, and then such computed feature maps feed up a second module for the decision-making process. Therefore, the combination of the mentioned two modules make up a complex network that takes raw sensor data as input and generates the motion control commands for the robotic system [48,[51][52][53][54][55].…”

Section: Deep Learning In the Context Of Autonomous Collision Avoidancementioning

confidence: 99%

“…Thus, given a specific state and based on previous experience, the agent can infer which action maximizes a predefined goal. Several approaches use RL methods in order to learn effective collision avoidance policies that require experience on successful trajectories as well as on undesirable events like collisions [51,53,54,[57][58][59]. This use of simulated environments allows for collecting a large amount of data in an easy way.…”

Section: Deep Learning In the Context Of Autonomous Collision Avoidancementioning

confidence: 99%

A Review on IoT Deep Learning UAV Systems for Autonomous Obstacle Detection and Collision Avoidance

et al. 2019

View full text Add to dashboard Cite

Advances in Unmanned Aerial Vehicles (UAVs), also known as drones, offer unprecedented opportunities to boost a wide array of large-scale Internet of Things (IoT) applications. Nevertheless, UAV platforms still face important limitations mainly related to autonomy and weight that impact their remote sensing capabilities when capturing and processing the data required for developing autonomous and robust real-time obstacle detection and avoidance systems. In this regard, Deep Learning (DL) techniques have arisen as a promising alternative for improving real-time obstacle detection and collision avoidance for highly autonomous UAVs. This article reviews the most recent developments on DL Unmanned Aerial Systems (UASs) and provides a detailed explanation on the main DL techniques. Moreover, the latest DL-UAV communication architectures are studied and their most common hardware is analyzed. Furthermore, this article enumerates the most relevant open challenges for current DL-UAV solutions, thus allowing future researchers to define a roadmap for devising the new generation affordable autonomous DL-UAV IoT solutions.

show abstract

6G‐edge support of Internet of Autonomous Vehicles: A survey

Ibn‐Khedher,

Laroui,

Alfaqawi

et al. 2023

Trans Emerging Tel Tech

View full text Add to dashboard Cite

With the commercial deployment of 5G mobile communication systems, 6G is proposed to meet the needs of new use cases including connected and autonomous vehicles (CAVs). In this article, we surveyed different 6G embracing technologies such as mobile edge computing, unmanned aerial vehicles, network function virtualization, software defined networking, and artificial intelligence that can enhance the Internet of Autonomous Vehicles (IoAV) network performance comparing to existing 5G enabled solutions. Then, we highlight the convergence path of 6G and IoAV to satisfy the next generation of CAV applications. Further, we propose 6G cellular communication to support the quality of next IoAV applications. Furthermore, the article attempts to study the serious challenges in the 6G‐IoAV research field and propose potential future directions.

show abstract

Memory-Based Deep Reinforcement Learning for Obstacle Avoidance in UAV With Limited Environment Knowledge

Cited by 171 publications

References 31 publications

Deep Reinforcement Learning Approach with Multiple Experience Pools for UAV’s Autonomous Motion Planning in Complex Unknown Environments

Deep Reinforcement Learning Approach with Multiple Experience Pools for UAV’s Autonomous Motion Planning in Complex Unknown Environments

A Review on IoT Deep Learning UAV Systems for Autonomous Obstacle Detection and Collision Avoidance

6G‐edge support of Internet of Autonomous Vehicles: A survey

Contact Info

Product

Resources

About