2020 Latin American Robotics Symposium (LARS), 2020 Brazilian Symposium on Robotics (SBR) and 2020 Workshop on Robotics in Educ 2020
DOI: 10.1109/lars/sbr/wre51543.2020.9307015
|View full text |Cite
|
Sign up to set email alerts
|

Deep Reinforcement Learning for Mapless Navigation of Unmanned Aerial Vehicles

Abstract: This paper presents a novel deep reinforcement learning-based system for 3D mapless navigation for Unmanned Aerial Vehicles (UAVs). Instead of using a image-based sensing approach, we propose a simple learning system that uses only a few sparse range data from a distance sensor to train a learning agent. We based our approaches on two state-of-art double critic Deep-RL models: Twin Delayed Deep Deterministic Policy Gradient (TD3) and Soft Actor-Critic (SAC). We show that our two approaches manage to outperform… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
10
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 16 publications
(11 citation statements)
references
References 28 publications
0
10
0
1
Order By: Relevance
“…The study around mapless navigation is extensively explored using terrestrial mobile robots [3]. Autonomous navigation of aerial mobile robots using Deep-RL approaches is less frequent and mainly focused on approaches that avoid the use of visual information [6], [10] or using simplified information, without contrastive learning [7]- [9], [13].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…The study around mapless navigation is extensively explored using terrestrial mobile robots [3]. Autonomous navigation of aerial mobile robots using Deep-RL approaches is less frequent and mainly focused on approaches that avoid the use of visual information [6], [10] or using simplified information, without contrastive learning [7]- [9], [13].…”
Section: Related Workmentioning
confidence: 99%
“…These depth images are stored in a replay memory, and it is sampled memories in the buffer to make the Contrastive and Reinforcement Learning. methods are capable of providing an action response for continuous systems [4], [5], as well as the Deep-RL control of UAVs in simulated environments [6]. It is possible to see the application of these techniques for the mapless navigation of UAVs in models of unknown dynamics in many environments [6]- [10].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“… Havenstrøm et al (2021) applies a curriculum learning technique with the PPO algorithm to control a 6-DOF underactuated autonomous underwater vehicle (AUV), gradually increasing the presence and severity of obstacles and disturbances during the RL training process. Grando et al (2021) develops and compares two approaches based on the Deep Deterministic Policy Gradient (DDPG) and Soft Actor-Critic (SAC) RL algorithms, respectively, to navigate a simulated quadrotor drone to a target position in 3D, including air-water medium transitions. Overall, it is thus safe to say that the existing works in the available scientific literature prove the potential of RL in path following and collision avoidance with both stationary and moving obstacles.…”
Section: Introductionmentioning
confidence: 99%
“…Até o momento, o presente trabalho possui duas contribuic ¸ões científicas. A primeira foi realizada no Simpósio Latino Americano de Robótica 2020 (IEEE LARS 2020) [Grando et al 2020], trabalho que envolveu a parte de navegac ¸ão aérea 2D e que ficou entre os 10 melhores do evento. Outro artigo científico foi publicado na Conferência Internacional de Robótica e Automac ¸ão (IEEE ICRA 2021, Qualis A1) [Grando et al 2021…”
Section: Introduc ¸ãOunclassified