2019
DOI: 10.1108/ir-10-2018-0209
|View full text |Cite
|
Sign up to set email alerts
|

Position/force control of robot manipulators using reinforcement learning

Abstract: Purpose The position/force control of the robot needs the parameters of the impedance model and generates the desired position from the contact force in the environment. When the environment is unknown, learning algorithms are needed to estimate both the desired force and the parameters of the impedance model. Design/methodology/approach In this paper, the authors use reinforcement learning to learn only the desired force, then they use proportional-integral-derivative admittance control to generate the desi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
25
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
4

Relationship

2
7

Authors

Journals

citations
Cited by 54 publications
(25 citation statements)
references
References 24 publications
(28 reference statements)
0
25
0
Order By: Relevance
“…To improve the controller, prior data information can be used as the initial condition for the value function to avoid scratch learning. The use of a long short‐ term memory like eligibility traces can accelerate the convergence. Optimization procedures can be used to find the optimal number of k ‐nearest neighbors.…”
Section: Simulation and Experimentsmentioning
confidence: 99%
“…To improve the controller, prior data information can be used as the initial condition for the value function to avoid scratch learning. The use of a long short‐ term memory like eligibility traces can accelerate the convergence. Optimization procedures can be used to find the optimal number of k ‐nearest neighbors.…”
Section: Simulation and Experimentsmentioning
confidence: 99%
“…However, these controllers are avoided because they lose controllability at singularity points [20,21], that is, the Jacobian loses rank. Furthermore, the contact between the slider and the workpiece generates an external force [22] which must be taken into account in the controller design [23,24] by means of the virtual work principle.…”
Section: Introductionmentioning
confidence: 99%
“…Traditional manipulators are mostly used in industrial production, working in a structured environment on a given trajectory to complete specific tasks. To ensure excellent tracking performance and safety, manipulators generally adopt the design scheme of rigid mechanical structure and active compliance control such as position/force [1,2] or impedance/admittance control [3,4]. However, rigid manipulators based on active compliance control still present stiffness [5] when suffering sudden impacts or handling contact transients due to the control delay, which can cause damage to itself or surrounding environments.…”
Section: Introductionmentioning
confidence: 99%