2015
DOI: 10.1109/tnnls.2014.2378812
|View full text |Cite
|
Sign up to set email alerts
|

Optimal Critic Learning for Robot Control in Time-Varying Environments

Abstract: Abstract-In this paper, optimal critic learning is developed for robot control in a time-varying environment. The unknown environment is described as a linear system with time-varying parameters, and impedance control is employed for the interaction control. Desired impedance parameters are obtained in the sense of an optimal realization of the composite of trajectory tracking and force regulation. Q-function based critic learning is developed to determine the optimal impedance parameters without the knowledge… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0
1

Year Published

2015
2015
2021
2021

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 38 publications
(16 citation statements)
references
References 39 publications
0
15
0
1
Order By: Relevance
“…In the model, the contact parameters relate the end-effector position x to the interaction force f e at each contact effector, C e and K e are unknown timevarying damping and stiffness matrices of the dynamics, respectively. Introducing an environment model proposed (Wang et al, 2015), we define that k describes the time-step index, the unknown time-varying environment dynamics in discrete time is given as follows:…”
Section: Environment Modelmentioning
confidence: 99%
See 3 more Smart Citations
“…In the model, the contact parameters relate the end-effector position x to the interaction force f e at each contact effector, C e and K e are unknown timevarying damping and stiffness matrices of the dynamics, respectively. Introducing an environment model proposed (Wang et al, 2015), we define that k describes the time-step index, the unknown time-varying environment dynamics in discrete time is given as follows:…”
Section: Environment Modelmentioning
confidence: 99%
“…Consider our previous results in Wang et al (2015) and the cost-to-go function in (17), a Q-function with quadratic form is introduced as follows:…”
Section: Q-function Constructionmentioning
confidence: 99%
See 2 more Smart Citations
“…When the environment is unknown, impedance control does not work well for interaction performance (Wang et al , 2015; Wen and Murphy, 1991; Wilfinger et al , 1994), and the force control is not robust to generate the desired strength with respect to uncertainties in the environment (Singh and Popa, 1995; Perrusquía et al , 2017). There are several methods to estimate the dynamics of the environment (Kang et al , 2009), such as adaptive identification (Ficuciello et al , 2015), recursive least squares (RLS) (Kelly et al , 1989), neural networks (Yamamoto et al , 2008) and the adaptive impedance algorithm (Lu and Meng, 1991).…”
Section: Introductionmentioning
confidence: 99%