2017
DOI: 10.1002/oca.2391
|View full text |Cite
|
Sign up to set email alerts
|

Online reinforcement learning for a class of partially unknown continuous‐time nonlinear systems via value iteration

Abstract: Summary In this paper, a modified value iteration–based approximate dynamic programming method is proposed for a class of affine nonlinear continuous‐time systems, whose dynamics are partially unknown. The value iteration algorithm is established in an online fashion, and the convergence proof is given. To attenuate the effect caused by the unascertained characteristics of the system dynamics, the integral reinforcement learning scheme is also used. In the proposed approximate dynamic programming method, it is… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
22
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 19 publications
(22 citation statements)
references
References 72 publications
0
22
0
Order By: Relevance
“…Theorem 1. For the model NN (10), if the activation function is the sigmoid function and the hidden-to-output weight is updated by the adaptive law (13), then the approximate error x and wmo is uniformly ultimately bounded.…”
Section: Model Nn Design and The Convergence Analysismentioning
confidence: 99%
See 3 more Smart Citations
“…Theorem 1. For the model NN (10), if the activation function is the sigmoid function and the hidden-to-output weight is updated by the adaptive law (13), then the approximate error x and wmo is uniformly ultimately bounded.…”
Section: Model Nn Design and The Convergence Analysismentioning
confidence: 99%
“…The optimal control calculated from our algorithm FIGURE 7 The optimal control from the paper [13] FIGURE 8 The cost function produced in our algorithm (upper) and the optimal cost function (lower) [13]. Finally, the initial admissible control law is obtained from NMPC in proposed algorithm and the initial weights of neural networks can be designed reasonably.…”
Section: Figurementioning
confidence: 99%
See 2 more Smart Citations
“…Compared with the existing literature on VI based IRL[64] [65][66], the main contributions of this chapter are enumerated here.…”
mentioning
confidence: 99%