2020
DOI: 10.1002/rnc.4911
|View full text |Cite
|
Sign up to set email alerts
|

Robust control under worst‐case uncertainty for unknown nonlinear systems using modified reinforcement learning

Abstract: Reinforcement learning (RL) is an effective method for the design of robust controllers of unknown nonlinear systems. Normal RLs for robust control, such as actor-critic (AC) algorithms, depend on the estimation accuracy. Uncertainty in the worst case requires a large state-action space, this causes overestimation and computational problems. In this article, the RL method is modified with the k-nearest neighbor and the double Q-learning algorithm. The modified RL does not need the neural estimator as AC and ca… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
32
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 33 publications
(32 citation statements)
references
References 47 publications
0
32
0
Order By: Relevance
“…The cognitive set C gives numerous models and skills that are useful to get the optimal decision making (13). Humanbehavior is supported by many intelligent techniques such as: reinforcement learning [8], [32], deep learning [17], [26], machine learning techniques [21], function approximators [5], [8], and so on; with the aim of providing an ability to learn by interacting with the actions (x t , u t ) to achieve the control task.…”
Section: Learning Human-behaviormentioning
confidence: 99%
See 3 more Smart Citations
“…The cognitive set C gives numerous models and skills that are useful to get the optimal decision making (13). Humanbehavior is supported by many intelligent techniques such as: reinforcement learning [8], [32], deep learning [17], [26], machine learning techniques [21], function approximators [5], [8], and so on; with the aim of providing an ability to learn by interacting with the actions (x t , u t ) to achieve the control task.…”
Section: Learning Human-behaviormentioning
confidence: 99%
“…The neural cognitive model (21) gives an approximate solution of (11) such that the optimal decision making control ( 13) is obtained. To achieve this goal, the approximation needs an exploration term which, in this case, is given by a PE exciting condition.…”
Section: Learning Human-behaviormentioning
confidence: 99%
See 2 more Smart Citations
“…To satisfy the control objectives, it has been developed different control techniques, such as PID [7]- [9], sliding mode control (SMC) [10]- [12], neural networks [13], intelligent techniques [14], [15] or even linear controllers [2], [16]. Each algorithm is capable to compensate the gravitational term and robustify the control law.…”
Section: Introductionmentioning
confidence: 99%