Adaptive Genomic Evolution of Neural Network Topologies (AGENT) for State-to-Action Mapping in Autonomous Agents

Behjat, Amir; Chidambaran, Sharat; Chowdhury, Souma

doi:10.1109/icra.2019.8793613

Cited by 16 publications

(12 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Based on our observations in, 16 using smaller reaction time leads to a more controllable maneuver; these maneuvers are not necessarily energy optimal. Since the focus of the current paper is not about energy optimality but rather the detection quality which might be more useful in practice, it is possible to fix the reaction time to the smallest practical value.…”

Section: A Optimization Problem Definitionmentioning

confidence: 93%

“…22,23 Further description of our neuroevolution method called AGENT can be found in. 16 Neuroevolution uses an evolutionary algorithm to optimize the topology and weights of a neural network, and is typically used for solving problems that can be posed as reinforcement learning; 24 however unlike RL, neuroevolution is significantly more amenable to parallel deployment and escaping local minima, both crucial to expensive offline learning investments.…”

Section: B Neuroevolution Processmentioning

confidence: 99%

See 1 more Smart Citation

Training Detection-Range-Frugal Cooperative Collision Avoidance Models for Quadcopters via Neuroevolution

Behjat

Gabani

Chowdhury

2019

AIAA Aviation 2019 Forum

Self Cite

View full text Add to dashboard Cite

Cooperative autonomous approaches to avoiding collisions among small Unmanned Aerial Vehicles (UAVs) is central to safe integration of UAVs within the civilian airspace. One potential online cooperative approach is the concept of reciprocal actions, where both UAVs take pre-trained mutually coherent actions that do not require active online coordination (thereby avoiding the computational burden and risk associated with it). This paper presents a learning based approach to train such reciprocal maneuvers. Neuroevolution, which uses evolutionary algorithms to simultaneously optimize the topology and weights of neural networks, is used as the learning method -which operates over a set of sample approach scenarios. Unlike most existing work (that minimize travel distance, energy or risk), the training process here focuses on the objective of minimizing the required detection range; this has important practical implications w.r.t. alleviating the dependency on sophisticated sensing and their reliability under various environments. A specialized design of experiments and line search is used to identify the minimum detection range for each sample scenarios. In order to allow an efficient training process, a classifier is used to discard actions (without simulating them) where the controller would fail. The model obtained via neuroevolution is observed to generalize well to (i.e., successful collision avoidance over) unseen approach scenarios.

show abstract

Section: A Optimization Problem Definitionmentioning

confidence: 93%

Section: B Neuroevolution Processmentioning

confidence: 99%

Training Detection-Range-Frugal Cooperative Collision Avoidance Models for Quadcopters via Neuroevolution

Behjat

Gabani

Chowdhury

2019

AIAA Aviation 2019 Forum

Self Cite

View full text Add to dashboard Cite

show abstract

“…[78] The adaptive differential multi-objective optimization algorithm was used to find the optimal solution to avoid obstacles. [79] Solved the problem of premature stagnation in genetic algorithm. [80] Applied genetic algorithm and evolutionary robot to evolve neural network controller.…”

Section: Categorymentioning

confidence: 99%

“…[78] defined the path planing problem of UAV as a multi-objective optimization problem, and proposed a new multi-gene structure to describe the path, in which the adaptive adjustment, crossover and mutation strategies were adopted, and the adaptive differential multi-objective optimization algorithm was applied to obtain the optimal solution to avoid obstacles and meet the flight restrictions of UAV. [79] used the minimum spanning tree and adaptive tournament selection to quantify and control the genetic diversity, which solved the problem of premature stagnation. For the obstacle avoidance problem in multi-UAV scenario, [80] used genetic algorithm and evolutionary robot to evolve neural network controller, solving the obstacle avoidance problem of multi-UAV.…”

Section: Categorymentioning

confidence: 99%

Anti-collision Technologies for Unmanned Aerial Vehicles: Recent Advances and Future Trends

Wei¹,

Meng²,

Lai³

et al. 2021

Preprint

View full text Add to dashboard Cite

Unmanned aerial vehicles (UAVs) are widely applied in civil applications, such as disaster relief, agriculture and cargo transportation, etc. With the massive number of UAV flight activities, the anti-collision technologies aiming to avoid the collisions between UAVs and other objects have attracted much attention. The anti-collision technologies are of vital importance to guarantee the survivability and safety of UAVs. In this article, a comprehensive survey on UAV anti-collision technologies is presented. We firstly introduce laws and regulations on UAV safety which prevent collision at the policy level. Then, the process of anti-collision technologies are reviewed from three aspects, i.e., obstacle sensing, collision prediction, and collision avoidance. We provide detailed survey and comparison of the methods of each aspect and analyze their pros and cons. Besides, the future trends on UAV anti-collision technologies are presented from the perspective of fast obstacle sensing and fast wireless networking. Finally, we summarize this article.

show abstract

“…Various ANN fitting (learning) methods have seen demonstrations on robotics and control applications. Popular learning methods include Reinforcement Learning [16], [17], Supervised Learning [18], Imitation Learning [19], Neuroevolution [20], [21], etc., among which the advanced reinforcement learning [22] and neuroevolution [23] methods are directly applicable to launching and wrapping control of tether-net systems. These aforementioned machine learning methods bring capabilities of adapting to system uncertainties, and selecting optimal actions (policies) according to various debris characteristics.…”

Section: Introductionmentioning

confidence: 99%

Learning Robust Policies for Generalized Debris Capture with an Automated Tether-Net System

Zeng,

Hecht,

KrisshnaKumar

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Tether-net launched from a chaser spacecraft provides a promising method to capture and dispose off large space debris in orbit. This tether-net system is subject to several sources of uncertainty in sensing and actuation that affect the performance of its net launch and closing control. Earlier reliability based optimization approaches to design control actions however remain challenging and computationally prohibitive to generalize over varying launch scenarios and target (debris) state relative to chaser. To search for a general and reliable control policy, this paper presents a reinforcement learning framework that integrates a proximal policy optimization (PPO2) approach with net dynamics simulations. The latter allows evaluating the episodes of net-based target capture, and estimate the capture quality index that serves as the reward feedback to PPO2. Here, the learnt policy is designed to model the timing of the net closing action based on the state of the moving net and the target, under any given launch scenario. A stochastic state transition model is considered in order to incorporate synthetic uncertainties in state estimation and launch actuation. Along with notable reward improvement during training, the trained policy demonstrates capture performance (over a wide range of launch/target scenarios) that is close to that obtained with reliability based optimization run over an individual scenario.

show abstract

Adaptive Genomic Evolution of Neural Network Topologies (AGENT) for State-to-Action Mapping in Autonomous Agents

Cited by 16 publications

References 28 publications

Training Detection-Range-Frugal Cooperative Collision Avoidance Models for Quadcopters via Neuroevolution

Training Detection-Range-Frugal Cooperative Collision Avoidance Models for Quadcopters via Neuroevolution

Anti-collision Technologies for Unmanned Aerial Vehicles: Recent Advances and Future Trends

Learning Robust Policies for Generalized Debris Capture with an Automated Tether-Net System

Contact Info

Product

Resources

About