Robust control under worst‐case uncertainty for unknown nonlinear systems using modified reinforcement learning

Perrusquía, Adolfo; Yu, Wen

doi:10.1002/rnc.4911

Cited by 33 publications

(32 citation statements)

References 47 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The cognitive set C gives numerous models and skills that are useful to get the optimal decision making (13). Humanbehavior is supported by many intelligent techniques such as: reinforcement learning [8], [32], deep learning [17], [26], machine learning techniques [21], function approximators [5], [8], and so on; with the aim of providing an ability to learn by interacting with the actions (x t , u t ) to achieve the control task.…”

Section: Learning Human-behaviormentioning

confidence: 99%

“…The neural cognitive model (21) gives an approximate solution of (11) such that the optimal decision making control ( 13) is obtained. To achieve this goal, the approximation needs an exploration term which, in this case, is given by a PE exciting condition.…”

Section: Learning Human-behaviormentioning

confidence: 99%

“…This exploration is satisfied if q t fulfills a persistently exciting (PE) condition [31]. So, the PE skill is within the neural approximator (21), that is, q t = q t (x t , u 2,t ; P E).…”

Section: Learning Human-behaviormentioning

confidence: 99%

“…The neural model (21) updates the weights by considering only one sample [19]. The key idea of the experience replay is to store more samples in a memory vector and add them to the gradient descent update rule [12], such that it minimizes simultaneously the actual TD error δ t and the TD error of previous samples.…”

Section: Learning Human-behaviormentioning

confidence: 99%

See 3 more Smart Citations

Human-Behavior Learning for Infinite-Horizon Optimal Tracking Problems of Robot Manipulators

Perrusquía

2021

2021 60th IEEE Conference on Decision and Control (CDC)

Self Cite

View full text Add to dashboard Cite

In this paper, a human-behavior learning approach for optimal tracking control of robot manipulators is proposed. The approach is a generalization of the reinforcement learning control problem which merges the capabilities of different intelligent and control techniques in order to solve the tracking task. Three cognitive models are used: robot and reference dynamics and neural networks. The convergence of the algorithm is achieved under a persistent exciting and experience replay fulfillment. The algorithm learns online the optimal decision making controller according to the proposed cognitive models. Simulations were carry out to verify the approach using a 2-DOF planar robot.

show abstract

Section: Learning Human-behaviormentioning

confidence: 99%

Section: Learning Human-behaviormentioning

confidence: 99%

“…This exploration is satisfied if q t fulfills a persistently exciting (PE) condition [31]. So, the PE skill is within the neural approximator (21), that is, q t = q t (x t , u 2,t ; P E).…”

Section: Learning Human-behaviormentioning

confidence: 99%

Section: Learning Human-behaviormentioning

confidence: 99%

See 2 more Smart Citations

Human-Behavior Learning for Infinite-Horizon Optimal Tracking Problems of Robot Manipulators

Perrusquía

2021

2021 60th IEEE Conference on Decision and Control (CDC)

Self Cite

View full text Add to dashboard Cite

show abstract

“…To satisfy the control objectives, it has been developed different control techniques, such as PID [7]- [9], sliding mode control (SMC) [10]- [12], neural networks [13], intelligent techniques [14], [15] or even linear controllers [2], [16]. Each algorithm is capable to compensate the gravitational term and robustify the control law.…”

Section: Introductionmentioning

confidence: 99%

A Novel Tuning Method of PD With Gravity Compensation Controller for Robot Manipulators

2020

View full text Add to dashboard Cite

Proportional-Derivative (PD) control is one of the most widely used controllers, especially for robot manipulators. When the robot presents gravitational terms, PD control cannot guarantee position convergence, therefore compensation is required such as PD with gravity compensation, PD+G. PD+G control requires knowledge of the gravitational term and there exist several results that prove global asymptotic stability. However, there is no method to tune the PD gains. In this work, a novel method to tune the PD+G controller is proposed. The tuning method is obtained using the global asymptotic stability result of the La Salle's theorem and robot dynamics properties. A comparison between previous works is realized via simulations and experiments to verify our approach. The results show fast and smooth convergence to the desired reference without overshoots.

show abstract

Robot Control in Worst‐Case Uncertainty Using Reinforcement Learning

2021

Human‐Robot Interaction Control Using Reinforcement Learning

View full text Add to dashboard Cite

Robust control under worst‐case uncertainty for unknown nonlinear systems using modified reinforcement learning

Cited by 33 publications

References 47 publications

Human-Behavior Learning for Infinite-Horizon Optimal Tracking Problems of Robot Manipulators

Human-Behavior Learning for Infinite-Horizon Optimal Tracking Problems of Robot Manipulators

A Novel Tuning Method of PD With Gravity Compensation Controller for Robot Manipulators

Robot Control in Worst‐Case Uncertainty Using Reinforcement Learning

Contact Info

Product

Resources

About