Neuro-optimal control for a class of unknown nonlinear dynamic systems using SN-DHP technique

Wang, Ding; Liu, Derong

doi:10.1016/j.neucom.2013.04.006

Cited by 53 publications

(17 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To overcome the difficulty, many approximation methods are proposed to obtain optimal tracking control law [2][3][4][5]. Among these approximate approaches, adaptive dynamic programming (ADP) algorithm, proposed by Werbos [6,7], has played an important role in seeking approximate solutions of dynamic programming problems as a way to solve the computational issue forward-in-time [8][9][10][11][12][13][14]. There are several synonyms used for ADP including "adaptive critic designs" [15], "adaptive dynamic programming" [16,17], "approximate dynamic programming" [18,19], "neural dynamic programming" [20], "neuro-dynamic programming" [21], and "reinforcement learning" [22].…”

Section: Introductionmentioning

confidence: 99%

Neural-network-based adaptive optimal tracking control scheme for discrete-time nonlinear systems with approximation errors

Wei

Liu

2015

Neurocomputing

Self Cite

View full text Add to dashboard Cite

Section: Introductionmentioning

confidence: 99%

Neural-network-based adaptive optimal tracking control scheme for discrete-time nonlinear systems with approximation errors

Wei

Liu

2015

Neurocomputing

Self Cite

View full text Add to dashboard Cite

“…A class of RL-based adaptive optimal controllers, called approximate/adaptive dynamic programming (ADP), was first developed by Werbos [5,6]. Extensions of the RLbased controllers to DT systems have been considered by many researchers [7][8][9][10][11][12][13][14][15][16][17][18][19][20]. In [7], the authors attempted to solve the DT nonlinear optimal control problem offline using ADP approaches and neural networks by assuming that there are no NN reconstruction errors.…”

Section: Introductionmentioning

confidence: 99%

“…The work of [9] analyzed the convergence of unknown DT nonlinear systems using offlinetrained neural networks, but this method introduced the Lebesgue integral [7], which required data of a subset of the plant, in the tuning law and thus spent too much time on off-line training. In [20], the authors developed one way to control the unknown DT nonlinear systems using globalized dual heuristic programming, and others employed the single network dual heuristic dynamic programming (SN-DHP) technique in the ADP algorithm in [19]. Both of them introduced the gradient-based adaptation tuning law instead of the way in [9].…”

Section: Introductionmentioning

confidence: 99%

“…Both of them introduced the gradient-based adaptation tuning law instead of the way in [9]. However, without using recorded system data, iterations were needed in the tuning law [19,20] and the critic NN and actor NN could not be updated with respect to time at each sampling interval. Moreover, although [9,20,21] constructed a NN to identify the unknown system dynamics, they assumed that the NN identification error approached to zero, and thus the effects of the estimation error on the convergence of the actor-critic algorithms were not considered.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Online optimal control of unknown discrete-time nonlinear systems by using time-based adaptive dynamic programming

2015

View full text Add to dashboard Cite

“…Moreover, for addressing the nonlinear optimal control problem [5][6][7][8][9][10][11][12][13][14], it can be converted to solve a Hamilton-Jacobi-Bellman (HJB) equation instead of the ARE. However, it is difficult or even impossible to solve the HJB equation.…”

Section: Introductionmentioning

confidence: 99%

Approximate guaranteed cost fault-tolerant control of unknown nonlinear systems with time-varying actuator faults

Xie

Yang

2015

Nonlinear Dyn

View full text Add to dashboard Cite

In this paper, the guaranteed cost faulttolerant control problem for unknown multi-input continuous nonlinear systems with loss of actuator effectiveness faults is investigated using the adaptive dynamic programming algorithm. Initially, by modifying the cost function to account for actuator faults, the problem is transformed into an optimal control problem of the nominal system. Subsequently, by using an existing policy iteration (PI) algorithm to solve the corresponding optimal control problem, a guaranteed cost controller is constructed approximately. Furthermore, a rigorous proof is given to show the convergence of the aforementioned PI algorithm while taking the neural network approximation errors into consideration. Finally, simulation examples are provided to show the effectiveness of the proposed approach.

show abstract

Neuro-optimal control for a class of unknown nonlinear dynamic systems using SN-DHP technique

Cited by 53 publications

References 30 publications

Neural-network-based adaptive optimal tracking control scheme for discrete-time nonlinear systems with approximation errors

Neural-network-based adaptive optimal tracking control scheme for discrete-time nonlinear systems with approximation errors

Online optimal control of unknown discrete-time nonlinear systems by using time-based adaptive dynamic programming

Approximate guaranteed cost fault-tolerant control of unknown nonlinear systems with time-varying actuator faults

Contact Info

Product

Resources

About