Discrete-Time Optimal Control via Local Policy Iteration Adaptive Dynamic Programming

Wei, Qinglai; Liu, Derong; Qiao, Lin; Song, Ruizhuo

doi:10.1109/tcyb.2016.2586082

Cited by 93 publications

(30 citation statements)

References 64 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The following design algorithm provides an appropriate selection of the coefficients a r−1 , … , a 0 in (7) in order to guarantee the satisfaction of the input constraints (4).…”

Section: Robust Feedback Linearization Control Of Constrained Affine mentioning

confidence: 99%

“…To pursue this goal, a lot of research work has been carried out in the context of optimal control. After the seminal work about time-optimal control of continuous-time linear systems by Pontryagin, 1 which leads to a bang-bang control scheme, many contributions toward the development of time-optimal control for other classes of systems, such as linear discrete-time systems 2,3 as well as both discrete-and continuous-time nonlinear systems, [4][5][6][7][8][9] have appeared in the literature. One of the main problems when implementing time-optimal control is that there is no guarantee that it results into a stable system.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Radial pole path approach for fast response of affine constrained nonlinear systems with matched uncertainties

Kaheni

Zarif

Kalat

et al. 2019

Intl J Robust & Nonlinear

View full text Add to dashboard Cite

Summary This article proposes a novel robust feedback linearization control scheme for affine uncertain nonlinear systems subject to matched uncertainties and constraints on the control input. In this method, instead of placing the linearized system poles at exact locations, radial paths in the open left‐hand plane are selected to freely move the poles so as to enhance as much as possible the speed of response while guaranteeing satisfaction of input signal constraints. The stability of our proposed method is analyzed by means of the multivariable circle criterion and the Kalman‐Yakubovich‐Popov lemma. Simulation results demonstrate how the method significantly increases the speed of response compared to fixed pole placements.

show abstract

“…The following design algorithm provides an appropriate selection of the coefficients a r−1 , … , a 0 in (7) in order to guarantee the satisfaction of the input constraints (4).…”

Section: Robust Feedback Linearization Control Of Constrained Affine mentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Radial pole path approach for fast response of affine constrained nonlinear systems with matched uncertainties

Kaheni

Zarif

Kalat

et al. 2019

Intl J Robust & Nonlinear

View full text Add to dashboard Cite

show abstract

“…It is hard to solve nonanalytical equations like (7) and (8). Thus, an event-triggered HDP algorithm with a novel triggering condition is studied in the next section to solve the problem.…”

Section: Problem Formulationmentioning

confidence: 99%

“…Adaptive dynamic programming (ADP) refers to a family of practical actor-critic methods for finding optimal solutions in real time, 1 and it is a self-learning method. [2][3][4][5][6][7][8] In 1977, adaptive critic design was first proposed by Werbos,9 which takes the advantages of neural networks (NNs). Then, several names emerged, eg, approximate dynamic programming and asymptotic dynamic programming.…”

Section: Introductionmentioning

confidence: 99%

A novel triggering condition of event‐triggered control based on heuristic dynamic programming for discrete‐time systems

Wang

Wei

Liu

2018

Optim Control Appl Methods

Self Cite

View full text Add to dashboard Cite

Summary In this paper, an event‐triggered heuristic dynamic programming algorithm for discrete‐time nonlinear systems with a novel triggering condition is studied. Different from traditional heuristic dynamic programming algorithms, the control law in this algorithm will only be updated when the triggering condition is satisfied to reduce the computational burden. Three neural networks are employed, which are model network, action network, and critic network. Model functions, control laws, and value functions are estimated using neural networks, respectively. The main contribution of this algorithm is the novel triggering condition with simpler form and fewer assumptions. Additionally, a proof of the stability for discrete‐time systems using Lyapunov technique is given. Finally, two simulations are shown to verify the effectiveness of the developed algorithm.

show abstract

“…The early studies in the field of RL and ADP included the works of Werbos 10 and Sutton. 11 After that, various RL and ADP were reported, such as integral RL, 12,13 online RL, [14][15][16] off-policy RL, [17][18][19] local value/policy iterative ADP, [20][21][22] Hamiltonian-driven ADP, 23 robust ADP, 24,25 and goal representation ADP. 26,27 Over the past several years, RL and ADP have been widely used to solve robust nonlinear control problems.…”

Section: Introductionmentioning

confidence: 99%

An off‐policy iteration algorithm for robust stabilization of constrained‐input uncertain nonlinear systems

Yang

Wei

2018

Intl J Robust & Nonlinear

Self Cite

View full text Add to dashboard Cite

Summary This paper studies the robust stabilization problem of constrained‐input nonlinear systems with mismatched uncertainties. Initially, the robust stabilization problem is converted into a constrained H2 optimal control problem by providing a proper infinite‐horizon cost function for the auxiliary system. It is demonstrated that the solution of the constrained H2 optimal control problem can force the original system to be stable in the sense of uniform ultimate boundedness. Then, under the framework of reinforcement learning, an off‐policy iteration algorithm is proposed to solve the constrained H2 optimal control problem. The off‐policy iteration algorithm is implemented by using two kinds of approximators. That is, the critic approximator is applied to approximate the optimal cost function and the actor approximator is used to approximate the augmented optimal control. The method of weighted residuals together with the Monte‐Carlo integration technique is employed to determine the weight parameters of critic and actor approximators. Finally, two examples, including a pendulum system, are presented to validate the proposed control algorithm.

show abstract

Discrete-Time Optimal Control via Local Policy Iteration Adaptive Dynamic Programming

Cited by 93 publications

References 64 publications

Radial pole path approach for fast response of affine constrained nonlinear systems with matched uncertainties

Radial pole path approach for fast response of affine constrained nonlinear systems with matched uncertainties

A novel triggering condition of event‐triggered control based on heuristic dynamic programming for discrete‐time systems

An off‐policy iteration algorithm for robust stabilization of constrained‐input uncertain nonlinear systems

Contact Info

Product

Resources

About