2020
DOI: 10.48550/arxiv.2003.01157
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Reinforcement co-Learning of Deep and Spiking Neural Networks for Energy-Efficient Mapless Navigation with Neuromorphic Hardware

Abstract: Energy-efficient mapless navigation is crucial for mobile robots as they explore unknown environments with limited on-board resources. Although the recent deep reinforcement learning (DRL) approaches have been successfully applied to navigation, their high energy consumption limits their use in many robotic applications. Here, we propose a neuromorphic approach that combines the energy-efficiency of spiking neural networks with the optimality of DRL to learn control policies for mapless navigation. Our hybrid … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 23 publications
0
4
0
Order By: Relevance
“…SDDPG [91] (2020) PyTroch/-Spiking deep deterministic policy gradient (SDDPG), which consists of a spiking actor network and a deep critic network that were trained jointly using gradient descent for energy-efficient mapless navigation.…”
Section: Ann-to-snn Conversionmentioning
confidence: 99%
See 1 more Smart Citation
“…SDDPG [91] (2020) PyTroch/-Spiking deep deterministic policy gradient (SDDPG), which consists of a spiking actor network and a deep critic network that were trained jointly using gradient descent for energy-efficient mapless navigation.…”
Section: Ann-to-snn Conversionmentioning
confidence: 99%
“…In this work, reference [ 90 ] shows that SNN may robustly control an autonomous robot in mapping and exploring an unknown environment, while compensating for its own intrinsic hardware imperfections, such as partial or total loss of visual input. Reference [ 91 ] proposed a variant of deep deterministic policy gradient (DDPG), called spiking deep deterministic policy gradient (SDDPG), which consists of a spiking actor network and a deep critic network that were trained jointly using gradient descent for energy-efficient mapless navigation. This work explored an indirect SNN training approach based on the reward-modulated spike-timing-dependent plasticity (R-STDP) learning rule and supervised learning framework.…”
Section: Snns In Robotic Controlmentioning
confidence: 99%
“…The interaction between Loihi and the simulation environments is realized using the framework proposed in [60] to allow for real-time control of the robot arms (figure 4). For this, we use a Read channel controlled by the low-frequency x86 Loihi cores (SNIP) to read out the spiking activity of the motor neurons and translate it to joint movements.…”
Section: Controller-robot Interactionmentioning
confidence: 99%
“…Event camera based perception was used in other applications as well, such as self-supervised learning of optical flow [28], steering prediction for self driving cars [3]. Spiking neural networks were also used to examine eventbased data [29,30,31,32,33,34].…”
Section: Event Cameras and Machine Learningmentioning
confidence: 99%