An adaptive recurrent neural-network controller using a stabilization matrix and predictive inputs to solve a tracking problem under disturbances

Fairbank, Michael; Li, Shuhui; Fu, Xingang; Alonso, Eduardo; Wunsch, Donald C.

doi:10.1016/j.neunet.2013.09.010

Cited by 36 publications

(20 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…where u d is given by (4) and u * e is given by (9). Remark 2: The feedback part of the control input (9) is designed to stabilize the tracking error dynamics.…”

Section: Problem Formulation and Its Standard Solutionmentioning

confidence: 99%

“…One can refer to [9], [45], and [46] for an exact gradient descent algorithm with improved convergence guarantees.…”

Section: Learning Rules For Actor and Critic Nnsmentioning

confidence: 99%

“…Several techniques have been proposed to approximate the HJB solution. Included are reinforcement learning (RL) [1]- [8] and backpropagation through time [9]. RL techniques have been successfully applied to find the solution to the HJB equation online in real time for unknown or partially unknown continuous-time (CT) systems [10]- [12] and discrete-time (DT) systems [13]- [17].…”

mentioning

confidence: 99%

See 2 more Smart Citations

Actor–Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems

Kiumarsi¹,

Lewis²

2015

IEEE Trans. Neural Netw. Learning Syst.

279

116

View full text Add to dashboard Cite

This paper presents a partially model-free adaptive optimal control solution to the deterministic nonlinear discrete-time (DT) tracking control problem in the presence of input constraints. The tracking error dynamics and reference trajectory dynamics are first combined to form an augmented system. Then, a new discounted performance function based on the augmented system is presented for the optimal nonlinear tracking problem. In contrast to the standard solution, which finds the feedforward and feedback terms of the control input separately, the minimization of the proposed discounted performance function gives both feedback and feedforward parts of the control input simultaneously. This enables us to encode the input constraints into the optimization problem using a nonquadratic performance function. The DT tracking Bellman equation and tracking Hamilton-Jacobi-Bellman (HJB) are derived. An actor-critic-based reinforcement learning algorithm is used to learn the solution to the tracking HJB equation online without requiring knowledge of the system drift dynamics. That is, two neural networks (NNs), namely, actor NN and critic NN, are tuned online and simultaneously to generate the optimal bounded control policy. A simulation example is given to show the effectiveness of the proposed method.

show abstract

“…where u d is given by (4) and u * e is given by (9). Remark 2: The feedback part of the control input (9) is designed to stabilize the tracking error dynamics.…”

Section: Problem Formulation and Its Standard Solutionmentioning

confidence: 99%

“…One can refer to [9], [45], and [46] for an exact gradient descent algorithm with improved convergence guarantees.…”

Section: Learning Rules For Actor and Critic Nnsmentioning

confidence: 99%

mentioning

confidence: 99%

See 1 more Smart Citation

Actor–Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems

Kiumarsi¹,

Lewis²

2015

IEEE Trans. Neural Netw. Learning Syst.

279

116

View full text Add to dashboard Cite

show abstract

“…The control strategy of a GFC is crucial to maintain the power quality and the security of the GFC system (Fairbank, Li, Fu, Alonso, & Wunsch, 2014). Both subsystems of a GFC may exhibit certain degrees of uncertainty and volatility.…”

Section: Introductionmentioning

confidence: 99%

“…In Fairbank et al (2014), Fu, Li, and Jaithwa (2015), Li et al (2014), recurrent neural networks (RNNs), as an intelligence control method, have been used to control GFC systems in which the RNN is trained by the back propagation through time (BPTT) algorithm. However, the training process is complex.…”

Section: Introductionmentioning

confidence: 99%

H₂/H_∞ control for grid-feeding converter considering system uncertainty

Zang

Zeng

et al. 2016

International Journal of Electronics

Self Cite

View full text Add to dashboard Cite

Three-phase grid-feeding converters are key components to integrate distributed generation and renewable power sources to the power utility. Conventionally, proportional integral and proportional resonantbased control strategies are applied to control the output power or current of a GFC. But, those control strategies have poor transient performance and are not robust against uncertainties and volatilities in the system. This paper proposes a H 2 /H ∞ -based control strategy, which can mitigate the above restrictions. The uncertainty and disturbance are included to formulate the GFC system state-space model, making it more accurate to reflect the practical system conditions. The paper uses a convex optimisation method to design the H 2 /H ∞ -based optimal controller. Instead of using a guess-and-check method, the paper uses particle swarm optimisation to search a H 2 /H ∞ optimal controller. Several case studies implemented by both simulation and experiment can verify the superiority of the proposed control strategy than the traditional PI control methods especially under dynamic and variable system conditions. ARTICLE HISTORY

show abstract

Emerging methodologies in stability and optimization problems of learning‐based nonlinear model predictive control: A survey

Meng

Shen

Karimi

2022

Circuit Theory & Apps

View full text Add to dashboard Cite

Since last 40 years, the theory and technology of model predictive control (MPC) have been developed rapidly. However, nonlinear MPC still faces difficulties such as high online computational complexity and inability to accurately model the system. In order to improve or solve the problems mentioned above of MPC, recent researches have deepened the learning‐based control. The learned method can model unknown or highly uncertain nonlinearities. And the emergence of efficient algorithms has greatly improved the feasibility of computing. Stability is at the heart of control design. Learning‐based nonlinear model predictive control (LB‐NMPC) has achieved systematic research results in the past 10 years. But the stability of LB‐NMPC is still an open question that has not been fully addressed in the literature. This review mainly summarizes the latest research progress of LB‐NMPC. More specifically, the uncertainty and online optimization problems of the considered systems are investigated mainly focusing on the use of learning techniques. At the same time, the research hotspots such as the control stability and constraint satisfaction of LB‐NMPC are briefly discussed. Finally, the application of LB‐NMPC technology in integrated circuits, path tracking control, and other fields is reviewed, which provides a reference for the research and application of LB‐NMPC.

show abstract

An adaptive recurrent neural-network controller using a stabilization matrix and predictive inputs to solve a tracking problem under disturbances

Cited by 36 publications

References 33 publications

Actor–Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems

Actor–Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems

H₂/H_∞ control for grid-feeding converter considering system uncertainty

Emerging methodologies in stability and optimization problems of learning‐based nonlinear model predictive control: A survey

Contact Info

Product

Resources

About

An adaptive recurrent neural-network controller using a stabilization matrix and predictive inputs to solve a tracking problem under disturbances

Cited by 36 publications

References 33 publications

Actor–Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems

Actor–Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems

H2/H∞ control for grid-feeding converter considering system uncertainty

Emerging methodologies in stability and optimization problems of learning‐based nonlinear model predictive control: A survey

Contact Info

Product

Resources

About

H₂/H_∞ control for grid-feeding converter considering system uncertainty