2020
DOI: 10.3390/s20205911
|View full text |Cite
|
Sign up to set email alerts
|

Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay

Abstract: Since path planning for multi-arm manipulators is a complicated high-dimensional problem, effective and fast path generation is not easy for the arbitrarily given start and goal locations of the end effector. Especially, when it comes to deep reinforcement learning-based path planning, high-dimensionality makes it difficult for existing reinforcement learning-based methods to have efficient exploration which is crucial for successful training. The recently proposed soft actor–critic (SAC) is well known to have… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
37
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 66 publications
(56 citation statements)
references
References 34 publications
0
37
0
Order By: Relevance
“…The performances of the proposed approach were compared with those of two ML-based methods [ 26 , 27 ], the development of which was driven by the requirements of human–robot collaboration, an efficient model predictive control (MPC)-based planning algorithm for 6-DOF manipulators with dynamic obstacle avoidance [ 31 ], and the popular Rapidly-exploring Random Trees (RRT) Connect [ 57 ] algorithm, which is integrated in the Robotics Library [ 58 ] and other open-source motion planning libraries. The first ML-based method, called Soft Actor-Critic with Hindsight Experience Replay (SAC–HER) [ 26 ], builds on reinforcement learning to plan motion paths for a dual-arm robot, with each arm having 3-DOF. The planning occurs jointly for both robots, as if they were part of the same entity.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…The performances of the proposed approach were compared with those of two ML-based methods [ 26 , 27 ], the development of which was driven by the requirements of human–robot collaboration, an efficient model predictive control (MPC)-based planning algorithm for 6-DOF manipulators with dynamic obstacle avoidance [ 31 ], and the popular Rapidly-exploring Random Trees (RRT) Connect [ 57 ] algorithm, which is integrated in the Robotics Library [ 58 ] and other open-source motion planning libraries. The first ML-based method, called Soft Actor-Critic with Hindsight Experience Replay (SAC–HER) [ 26 ], builds on reinforcement learning to plan motion paths for a dual-arm robot, with each arm having 3-DOF. The planning occurs jointly for both robots, as if they were part of the same entity.…”
Section: Discussionmentioning
confidence: 99%
“…To create a baseline for comparison, the experimental results from references [ 26 , 27 , 31 ], which provide information about the performances of the respective planning methods, were drawn upon. For the RRT Connect planner, an experiment in the Robotics Library [ 58 ] simulation environment was conducted.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Many strategies and algorithms of path planning for a manipulator have been proposed in the literature. These works mainly focused on the two aspects: reactive planning and map-based planning [ 19 , 20 ]. For reactive planning, the robot has a perception system that allows it to know the environment in which it performs its task, and its main application is for environments with dynamic obstacles.…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, SAC has been widely used in autonomous decision-making, intelligent planning, and motion control of mobile robots, UAVs, and manipulators. Prianto et al [ 22 ] presented a deep reinforcement learning-based path planning algorithm for the multi-arm manipulator. To solve the problem of high-dimensional path planning, SAC is used to enhance the exploration performance of the robotic arm.…”
Section: Introductionmentioning
confidence: 99%