2021
DOI: 10.1002/asjc.2552
|View full text |Cite
|
Sign up to set email alerts
|

A novel optimal control design for unknown nonlinear systems based on adaptive dynamic programming and nonlinear model predictive control

Abstract: This paper presents a novel adaptive optimal control algorithm by combining adaptive dynamic programming with nonlinear model predictive control for unknown continuous-time affine nonlinear systems. The adaptive optimal control design is realized by the model-critic-actor architecture. Model neural network, critic neural network and actor neural network are constructed to approximate the system dynamics, the cost function and the optimal control law respectively. The random initialization of neural networks us… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(10 citation statements)
references
References 37 publications
(78 reference statements)
0
10
0
Order By: Relevance
“…The initial value of neural network remarkably affects the control performance and training convergence speed [20,21]. At the initial stage of agent searching the data points, if the Q-value estimated by the initial neural network is far from the ideal Q-value, the RL algorithm needs more samples and longer training time to converge.…”
Section: Neural Network Initializationmentioning
confidence: 99%
“…The initial value of neural network remarkably affects the control performance and training convergence speed [20,21]. At the initial stage of agent searching the data points, if the Q-value estimated by the initial neural network is far from the ideal Q-value, the RL algorithm needs more samples and longer training time to converge.…”
Section: Neural Network Initializationmentioning
confidence: 99%
“…According to earlier studies [11,12], the optimal control problem can be solved by minimizing the infinite horizon quadratic cost function. However, in this paper, because of security constraints, the traditional quadratic cost function cannot achieve our goal.…”
Section: Time-triggered Safe Control Schemementioning
confidence: 99%
“…Different from the TTC scheme, in the event-triggered safe control scheme, the update of the controller does not use the state of the current instant but uses the sampled state of the event-triggering instants to update. Therefore, a zero-order holder is adopted between event-triggering instants; according to (11), we can get the event-triggered optimal safety control law as follows:…”
Section: Event-triggered Safe Control Schemementioning
confidence: 99%
See 2 more Smart Citations