Online adaptive optimal control for continuous-time nonlinear systems with completely unknown dynamics

Lv, Yongfeng; Na, Jing; Yang, Qinmin; Wu, Xing; Guo, Yu

doi:10.1080/00207179.2015.1060362

Cited by 106 publications

(51 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For specific continuous-time linear systems, the OTC may be designed by using Riccita equation method [1,2]. However, only a few results have been suggested for nonlinear systems because it is not trivial to solve the associated Hamilton-Jacobi-Bellman (HJB) equation [3]. Nevertheless, the direct application of dynamic programming (DP) [5] to solve OTC problem also encountered difficulties for high order systems.…”

Section: Instructionmentioning

confidence: 99%

“…In this paper, we study the optimal tracking control of nonlinear CT systems with completely unknown dynamics by further improving our previously proposed 'identifier-critic' strategy [3,4,9]. However, different to [4,9], the overall optimal control can be obtained simultaneously by using the the system augmentation method [12].…”

mentioning

confidence: 99%

“…However, different to [4,9], the overall optimal control can be obtained simultaneously by using the the system augmentation method [12]. First, an adaptive identifier as [3,4,9] is used to estimate the unknown system dynamics. Then, an augmented system composed of the tracking error dynamics and the desired trajectory is constructed, and a new cost function for the augmented system is suggested.…”

mentioning

confidence: 99%

See 2 more Smart Citations

Adaptive optimal tracking control of unknown nonlinear systems using system augmentation

Yang

et al. 2016

2016 International Joint Conference on Neural Networks (IJCNN)

Self Cite

View full text Add to dashboard Cite

Abstract-In this paper, an alternative solution for adaptive optimal tracking control of nonlinear completely unknown systems is proposed. Firstly, an adaptive identifier is used to estimate the unknown system dynamics. Then, a recently developed system augmentation approach is adopted to design the optimal control, where the reference signal is incorporated into the augmented system. Thus, both the feedforward control and feedback control can be obtained simultaneously. Then, a critic neural network (NN) is used to estimate the augmented performance index, and calculate the optimal control action. Thus, the widely used actor NN is not needed. Finally, a new adaptive law recently proposed by the authors is used to online update the NN weight. The closed-loop stability and the convergence of the optimal control are all proved. The feasibility of the suggested approach is demonstrated by a simulation example. I. INSTRUCTION1 The objective of solving optimal tracking control (OTC) is to design a controller in such a way that the system state or output tracks a given reference in an optimal manner by minimizing a predefined performance index. The direct extension of optimal control schemes used for regulation to solve the OTC problem is not straightforward [1]. For specific continuous-time linear systems, the OTC may be designed by using Riccita equation method [1,2]. However, only a few results have been suggested for nonlinear systems because it is not trivial to solve the associated Hamilton-Jacobi-Bellman (HJB) equation [3]. Nevertheless, the direct application of dynamic programming (DP) [5] to solve OTC problem also encountered difficulties for high order systems.Adaptive dynamic programming (ADP) proposed by Werbos [6] has been developed as a feasible method to address the optimal control problems forward-in-time for discrete-time (DT) systems. However, extensions of the ADP methods for continuous-time (CT) systems [7] entail challenges in proving the closed-loop system stability. Moreover, most available ADP results assume that the system dynamics are partially or fully known. To relax these requirements of system dynamics, Zhang et al. [8] used a neural network (NN) identifier to reconstruct unknown drift dynamics, and proposed an adaptive optimal control. We have also suggested a new 'identifier-

show abstract

Section: Instructionmentioning

confidence: 99%

mentioning

confidence: 99%

mentioning

confidence: 99%

See 1 more Smart Citation

Adaptive optimal tracking control of unknown nonlinear systems using system augmentation

Yang

et al. 2016

2016 International Joint Conference on Neural Networks (IJCNN)

Self Cite

View full text Add to dashboard Cite

show abstract

“…It has been proved in [16][17][18] that adaptive estimation method considering the parameter information can greatly improve the convergence speed in contrast to the conventional estimation method driven by the observer error. Inspired from these facts, a novel robust estimation method of is presented in the following analysis.…”

Section: Remarkmentioning

confidence: 99%

Online Adaptive Optimal Control of Vehicle Active Suspension Systems Using Single‐Network Approximate Dynamic Programming

Ning

et al. 2017

Mathematical Problems in Engineering

View full text Add to dashboard Cite

In view of the performance requirements (e.g., ride comfort, road holding, and suspension space limitation) for vehicle suspension systems, this paper proposes an adaptive optimal control method for quarter-car active suspension system by using the approximate dynamic programming approach (ADP). Online optimal control law is obtained by using a single adaptive critic NN to approximate the solution of the Hamilton-Jacobi-Bellman (HJB) equation. Stability of the closed-loop system is proved by Lyapunov theory. Compared with the classic linear quadratic regulator (LQR) approach, the proposed ADP-based adaptive optimal control method demonstrates improved performance in the presence of parametric uncertainties (e.g., sprung mass) and unknown road displacement. Numerical simulation results of a sedan suspension system are presented to verify the effectiveness of the proposed control strategy.

show abstract

“…In [69], an ADP technique for online control and learning of a generalized multiple-input-multiple-output (MIMO) system was investigated. In [70], an adaptive NN based ADP control scheme was presented for a class of nonlinear systems with unknown dynamics. The optimal control law was calculated by using a dual neural network scheme with a critic NN and an identifier NN.…”

Section: Nn Based Adaptive Dynamicmentioning

confidence: 99%

A Brief Review of Neural Networks Based Learning and Control and Their Applications for Robots

Jiang

Yang

et al. 2017

Complexity

Self Cite

View full text Add to dashboard Cite

As an imitation of the biological nervous systems, neural networks (NNs), which have been characterized as powerful learning tools, are employed in a wide range of applications, such as control of complex nonlinear systems, optimization, system identification, and patterns recognition. This article aims to bring a brief review of the state-of-the-art NNs for the complex nonlinear systems by summarizing recent progress of NNs in both theory and practical applications. Specifically, this survey also reviews a number of NN based robot control algorithms, including NN based manipulator control, NN based human-robot interaction, and NN based cognitive control.

show abstract

Online adaptive optimal control for continuous-time nonlinear systems with completely unknown dynamics

Cited by 106 publications

References 40 publications

Adaptive optimal tracking control of unknown nonlinear systems using system augmentation

Adaptive optimal tracking control of unknown nonlinear systems using system augmentation

Online Adaptive Optimal Control of Vehicle Active Suspension Systems Using Single‐Network Approximate Dynamic Programming

A Brief Review of Neural Networks Based Learning and Control and Their Applications for Robots

Contact Info

Product

Resources

About