Maoguang Zhang scite author profile

This paper studies the online adaptive optimal controller design for a class of nonlinear systems through a novel policy iteration (PI) algorithm. By using the technique of neural network linear differential inclusion (LDI) to linearize the nonlinear terms in each iteration, the optimal law for controller design can be solved through the relevant algebraic Riccati equation (ARE) without using the system internal parameters. Based on PI approach, the adaptive optimal control algorithm is developed with the online linearization and the two-step iteration, i.e., policy evaluation and policy improvement. The convergence of the proposed PI algorithm is also proved. Finally, two numerical examples are given to illustrate the effectiveness and applicability of the proposed method.

show abstract

Online policy iterative-based H∞ optimization algorithm for a class of nonlinear systems

Fang

Zhang

et al. 2019

Information Sciences

View full text Add to dashboard Cite

Reinforcement learning and adaptive optimization of a class of Markov jump systems with completely unknown dynamic information

Zhang

Fang

et al. 2019

Neural Comput & Applic

View full text Add to dashboard Cite

providing relevant details, so we can investigate your claim. Download date:03. Nov. 2020 Abstract-In this paper, an online adaptive optimal control problem of a class of continuous-time Markov jump linear systems (MJLSs) is investigated by using a parallel reinforcement learning (RL) algorithm with completely unknown dynamics. Before collecting and learning the subsystems information of states and inputs, the exploration noise is firstly added to describe the actual control input. Then, a novel parallel RL algorithm is used to parallelly compute the corresponding N coupled algebraic Riccati equations (AREs) by online learning. By this algorithm, we will not need to know the dynamic information of the MJLSs. The convergence of the proposed algorithm is also proved. Finally, the effectiveness and applicability of this novel algorithm is illustrated by two simulation examples. Index Terms-Markov jump linear systems (MJLSs); adaptive optimal control; online; reinforcement learning (RL); coupled algebraic Riccati equations (AREs).

show abstract

Solving the Zero-Sum Control Problem for Tidal Turbine System: An Online Reinforcement Learning Approach

Fang

Zhang

et al. 2023

IEEE Trans. Cybern.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Maoguang Zhang

Adaptive Optimal Control for a Class of Nonlinear Systems: The Online Policy Iteration Approach

Online policy iterative-based H∞ optimization algorithm for a class of nonlinear systems

Reinforcement learning and adaptive optimization of a class of Markov jump systems with completely unknown dynamic information

Solving the Zero-Sum Control Problem for Tidal Turbine System: An Online Reinforcement Learning Approach

Contact Info

Product

Resources

About