Infinite time linear quadratic stackelberg game problem for unknown stochastic discrete‐time systems via adaptive dynamic programming approach

In this paper, an event‐triggered safe control method based on adaptive critic learning (ACL) is proposed for a class of nonlinear safety‐critical systems. First, a safe cost function is constructed by adding a control barrier function (CBF) to the traditional quadratic cost function; the optimization problem with safety constraints that is difficult to deal with by classical ACL methods is solved. Subsequently, the event‐triggered scheme is introduced to reduce the amount of computation. Further, combining the properties of CBF with the ACL‐based event‐triggering mechanism, the event‐triggered safe Hamilton–Jacobi–Bellman (HJB) equation is derived, and a single critic neural network (NN) framework is constructed to approximate the solution of the event‐triggered safe HJB equation. In addition, the concurrent learning method is applied to the NN learning process, so that the persistence of excitation (PE) condition is not required. The weight approximation error of the NN and the states of the system are proven to be uniformly ultimately bounded (UUB) in the safe set with the Lyapunov theory. Finally, the availability of the presented method can be validated through the simulation.

“…Situation 1: if t ∈ [𝜏 𝑗 , 𝜏 𝑗+1 ), we can get the derivative of (40) when the events are not triggered as follows; substituting (18) and (19)…”

Section: Stability Analysismentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Adaptive critic learning for event‐triggered safe control of nonlinear safety‐critical systems

Qin

Zhu

Wang

et al. 2023

“…Remark Note that when the control system in problem () is only disturbed by random variables or uncertain variables, the uncertain random system becomes stochastic system or uncertain system, and the LQ optimal control problems of the separate system [4,12,13,28] have been tackled in recent years. Compared with previous studies [4,12,13,28], we use the dynamic programming to solve optimal control problems. We study optimal control for uncertain random systems.…”

Section: Linear Quadratic Modelmentioning

confidence: 99%

“…In recent years, most of the researches on optimal control problems with stochastic noises or uncertain noises [27,28]. However, the system in practice is disturbed by not only stochastic noises but also uncertain noises [26].…”

Section: Introductionmentioning

confidence: 99%

Uncertain random linear quadratic control with multiplicative and additive noises

Chen

Zhu

2020

This paper studies two types of linear quadratic (LQ) optimal control models for multistage uncertain random systems. The first model is an LQ model with additive noises, while the second model is an LQ model with both multiplicative noises and additive noises. Chance theory is a useful tool to deal with the analysis of indeterminacy including both uncertainty and randomness. Based on Bellman's principle and chance theory, the recurrence equations are presented for settling such optimal control problems. Then, by applying recurrence equations, the analytical expressions are derived for solving both the optimal objective function and optimal control strategies of the LQ models. Meanwhile, an application and a numerical example are provided to show the effectiveness of our results.

“…The aim of every player in aggregative games is to find a Nash equilibrium to minimize its payoff function which not only depends its own decision variable but also the aggregate of all players' decision variables. To achieve this, some distributed Nash equilibrium seeking algorithms have been proposed [5][6][7][8][9][10][11][12][13][14][15][16]. The paper [16] proposed distributed consensus-based strategies to seek the Nash equilibrium of the box-constrained aggregative games.…”

Section: Introductionmentioning

confidence: 99%

Distributed Nash equilibrium seeking of aggregative games under networked attacks

Shao

Wang

2021

This paper investigates the distributed Nash equilibrium seeking problem of aggregative games in the presence of networked attacks. A switched algorithm with the fast estimate dynamics is proposed to explore the effect of attacks on seeking a Nash equilibrium. To guarantee the convergence of this algorithm, such a switched strategy is modeled as a hybrid system by using the hybrid automaton and time‐ratio conditions. Then, by constructing an appropriate Lyapunov function, we can prove that the hybrid dynamical system is exponentially stable and the Nash equilibrium can be achieved exponentially though the communication network is affected by attacks. Finally, a Nash–Cournot game is provided to verify the main results.