2020
DOI: 10.1109/access.2020.3045027
|View full text |Cite
|
Sign up to set email alerts
|

Coverage Path Planning for Decomposition Reconfigurable Grid-Maps Using Deep Reinforcement Learning Based Travelling Salesman Problem

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
38
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 56 publications
(38 citation statements)
references
References 33 publications
0
38
0
Order By: Relevance
“…Specifically, we employed the actor-critic methods [34] to learn approximations to both the policy and value functions of the RL problem. Two neural networks were utilized to represent the actor and critic networks, similarly to the work of [30]. Both networks employed the pointer network architecture [35], consisting of a pair of RNNs (encoders and decoders), each containing long short-term memory (LSTM) layers [36] to parameterize the trained policy and value model.…”
Section: Optimization With Reinforcement Learningmentioning
confidence: 99%
See 2 more Smart Citations
“…Specifically, we employed the actor-critic methods [34] to learn approximations to both the policy and value functions of the RL problem. Two neural networks were utilized to represent the actor and critic networks, similarly to the work of [30]. Both networks employed the pointer network architecture [35], consisting of a pair of RNNs (encoders and decoders), each containing long short-term memory (LSTM) layers [36] to parameterize the trained policy and value model.…”
Section: Optimization With Reinforcement Learningmentioning
confidence: 99%
“…Both networks employed the pointer network architecture [35], consisting of a pair of RNNs (encoders and decoders), each containing long short-term memory (LSTM) layers [36] to parameterize the trained policy and value model. For further details on the neural network architecture, we refer to the works of [30,35].…”
Section: Optimization With Reinforcement Learningmentioning
confidence: 99%
See 1 more Smart Citation
“…The traveling salesman problem (TSP) is one of the bestknown combinatorial optimization problems and is often considered in autonomous vehicle route planning [11,19,31,48,50,65,80]. In a TSP, the sequence of autonomous agent movements should optimize a route between a set of nodes [3,16,32,33,55].…”
Section: Introductionmentioning
confidence: 99%
“…To deal with the CCPP problem in a large complex environment, Kyaw et al. formulated it using TSP and deep reinforcement learning (DRL) [ 20 , 21 ]. The recurrent neural network (RNN) was trained using RL and combined with the cellular decomposition method to iteratively generate the coverage path.…”
Section: Introductionmentioning
confidence: 99%