In this paper, a novel adaptive dynamic programming (ADP)-based event-triggered safe control method is proposed to solve the zero-sum game problem of nonlinear safety-critical systems with safety constraints and input saturation. First, the barrier function-based system transformation, the zerosum game problem with safety constraints and input saturation is transformed into an equivalent input saturation zero-sum game problem, so as to guarantee that the system does not violate the safety constraints. Furthermore, the non-quadratic utility function is introduced into the performance function to solve input saturation. Then, a critic neural network (NN) is constructed to approximate the optimal safety value function. Subsequently, a novel event-triggered scheme is developed to determine the update instant of the control law and the disturbance law. Therefore, the proposed ADP-based event-triggered safe control method can ensure that the states of nonlinear safety-critical systems satisfy the safety constraints, while greatly reducing the amount of calculation and saving communication resources. In addition, during the learning process, the concurrent learning is used to relax the persistence of excitation (PE) condition. According to the Lyapuov theory, it is proved that the weight estimation error of the critic neural network and the states are uniformly ultimately bounded (UUB), and the Zeno behavior is excluded. Finally, a simulation example verifies the effectiveness of the proposed method.
In this paper, we investigate the constrained optimal control problem of nonlinear multi-input safety-critical systems with uncertain disturbances and time-varying safety constraints. By utilizing a barrier function transformation, together with a new disturbance-related term and a smooth safety boundary function, a nominal system-dependent multi-input barrier transformation architecture is developed to deal with the time-varying safety constraints and uncertain disturbances. Based on the obtained transformation system, the coupled Hamilton–Jacobi–Bellman (HJB) function is established to obtain the constrained Nash equilibrium solution. In addition, due to the fact that it is difficult to solve the HJB function directly, the single critic neural network (NN) is constructed to approximate the optimal performance index function of different control inputs, respectively. It is proved theoretically that, under the influence of uncertain disturbances and time-varying safety constraints, the system states and neural network parameters can be uniformly ultimately bounded (UUB) by the proposed neural network approximation method. Finally, the effectiveness of the proposed method is verified by two nonlinear simulation examples.
In this paper, the H ∞ tracking control problem of partially unknown linear systems with output constraints and disturbance is studied by reinforcement learning (RL) method. Firstly, an augmented system is established based on the reference trajectory dynamics and target system dynamics, and a special cost function is established to realize asymptotic tracking. In addition, the barrier function (BF) is used to transform the augmented system, and the output constraints is realized simultaneously by minimizing the quadratic cost function of the transformed system. Using only the obtained data and part of the system dynamics, the optimal control strategy and the worst disturbance strategy are obtained by integral reinforcement learning (IRL). Rigorous stability analysis shows that the proposed method can make the trajectories of the system states converge, and the output of the control strategy can make the tracking error asymptotically stable. Finally, a simulation example is conducted to verify the effectiveness of the proposed algorithm.INDEX TERMS Barrier function, H ∞ tracking control, Integral reinforcement learning, Output constraints
In this paper, a robust trajectory tracking control method with state constraints and uncertain disturbances on the ground of adaptive dynamic programming (ADP) is proposed for nonlinear systems. Firstly, the augmented system consists of the tracking error and the reference trajectory, and the tracking control problems with uncertain disturbances is described as the problem of robust control adjustment. In addition, considering the nominal system of the augmented system, the guaranteed cost tracking control problem is transformed into the optimal control problem by using the discount coefficient in the nominal system. A new safe Hamilton–Jacobi–Bellman (HJB) equation is proposed by combining the cost function with the control barrier function (CBF), so that the behavior of violating the safety regulations for the system states will be punished. In order to solve the new safe HJB equation, a critic neural network (NN) is used to approximate the solution of the safe HJB equation. According to the Lyapunov stability theory, in the case of state constraints and uncertain disturbances, the system states and the parameters of the critic neural network are guaranteed to be uniformly ultimately bounded (UUB). At the end of this paper, the feasibility of the proposed method is verified by a simulation example.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.