2020
DOI: 10.3390/app10020575
|View full text |Cite
|
Sign up to set email alerts
|

Motion Planning of Robot Manipulators for a Smoother Path Using a Twin Delayed Deep Deterministic Policy Gradient with Hindsight Experience Replay

Abstract: In order to enhance performance of robot systems in the manufacturing industry, it is essential to develop motion and task planning algorithms. Especially, it is important for the motion plan to be generated automatically in order to deal with various working environments. Although PRM (Probabilistic Roadmap) provides feasible paths when the starting and goal positions of a robot manipulator are given, the path might not be smooth enough, which can lead to inefficient performance of the robot system. This pape… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
28
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
10

Relationship

1
9

Authors

Journals

citations
Cited by 69 publications
(29 citation statements)
references
References 22 publications
0
28
0
Order By: Relevance
“…The performance of the proposed SAC-based path planning is validated not only simulation but also experiment using two real open manipulators. The results show that the proposed method finds a shorter and smoother path for most scenarios due to enhanced exploration performance by SAC, and outperforms over the existing results such as PRM [ 29 ] and TD3 (Twin Delayed Deep Deterministic Policy Gradient)-based path planning [ 30 ].…”
Section: Introductionmentioning
confidence: 89%
“…The performance of the proposed SAC-based path planning is validated not only simulation but also experiment using two real open manipulators. The results show that the proposed method finds a shorter and smoother path for most scenarios due to enhanced exploration performance by SAC, and outperforms over the existing results such as PRM [ 29 ] and TD3 (Twin Delayed Deep Deterministic Policy Gradient)-based path planning [ 30 ].…”
Section: Introductionmentioning
confidence: 89%
“…In future research, to successfully reach the target point, the motion planning algorithm must be improved via reinforcement learning. For the robots to execute reinforcement learning based on earned rewards from several repeated several trials and errors, the manipulator should be able to determine the optimal trajectory by itself [30]. Also the manipulators decide the order to target fruit according to produced optimal trajectory based on reinforcement learning.…”
Section: B Reinforcement Learning-based Path Planningmentioning
confidence: 99%
“…The achievements of the research on multi-agent dynamic task allocation [30][31][32] are mainly based on heuristic intelligent algorithms. Intelligent algorithms mainly use environmental learning or heuristic search, such as A* algorithms [33], evolutionary algorithms [34][35][36], and neural network-based methods, etc.…”
Section: Algorithms For Task Allocationmentioning
confidence: 99%