Twin-Delayed Deep Deterministic Policy Gradient for Low-Frequency Oscillation Damping Control

Cui, Qiushi; Kim, Gyoungjae; Weng, Yang

doi:10.3390/en14206695

Cited by 5 publications

(2 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Meanwhile, DRL is robust to the uncertainties of the system, i.e., control signals latency, environment noise, and load variations. A Deep Deterministic Policy Gradient (DDPG) based method and the twin-delayed DDPG method are proposed to overcome various communication delays during damping control [11], [12]. To solve the high dimensionality problem of power systems, Mukherjee et al [13] introduce two model reduction approaches for scalable DRL wide-area damping control.…”

Section: Introductionmentioning

confidence: 99%

Optimal Inter-area Oscillation Damping Control: A Transfer Deep Reinforcement Learning Approach with Switching Control Strategy

Liang¹,

Huo²,

Chen³

et al. 2023

Preprint

View full text Add to dashboard Cite

Wide-area damping control for inter-area oscillation (IAO) is critical to modern power systems. The recent breakthroughs in deep learning and the broad deployment of phasor measurement units (PMU) promote the development of datadriven IAO damping controllers. In this paper, the damping control of IAOs is modeled as a Markov Decision Process (MDP) and solved by the proposed Deep Deterministic Policy Gradient (DDPG) based deep reinforcement learning (DRL) approach. The proposed approach optimizes the eigenvalue distribution of the system, which determines the IAO modes in nature. The eigenvalues are evaluated by the data-driven method called dynamic mode decomposition. For a given power system, only a subset of generators selected by participation factors needs to be controlled, alleviating the control and computing burdens. A Switching Control Strategy (SCS) is introduced to improve the transient response of IAOs. Numerical simulations of the IEEE-39 New England power grid model validate the effectiveness and advanced performance of the proposed approach as well as its robustness against communication delays. In addition, we demonstrate the transfer ability of the DRL model trained on the linearized power grid model to provide effective IAO damping control in the non-linear power grid model environment.

show abstract

Section: Introductionmentioning

confidence: 99%

Optimal Inter-area Oscillation Damping Control: A Transfer Deep Reinforcement Learning Approach with Switching Control Strategy

Liang¹,

Huo²,

Chen³

et al. 2023

Preprint

View full text Add to dashboard Cite

show abstract

“…Motivated by the previous investigations, this paper employs the DRL principle to design a parameter tuning model for a two-loop autopilot using the TD3 algorithm [33], which has the following striking advantages over existing autopilot design schemes:…”

Section: Introductionmentioning

confidence: 99%

Two-Loop Acceleration Autopilot Design and Analysis Based on TD3 Strategy

Fan

Dou

et al. 2023

International Journal of Aerospace Engineering

View full text Add to dashboard Cite

A two-loop acceleration autopilot is designed using the twin-delayed deep deterministic policy gradient (TD3) strategy to avoid the tedious design process of conventional tactical missile acceleration autopilots and the difficulty of meeting the performance requirements of the full flight envelope. First, a deep reinforcement learning model for the two-loop autopilot is developed. The flight state information serves as the state, the to-be-designed autopilot control parameters serve as the action, and a reward mechanism based on the stability margin index is designed. The TD3 strategy is subsequently used to offline learn the control parameters for the entire flight envelope. An autopilot control parameter fitting model that can be directly applied to the guidance loop is obtained. Finally, the obtained fitting model is combined with the impact angle constraint in the guidance system and verified online. The simulation results demonstrate that the autopilot based on the TD3 strategy can self-adjust the control parameters online based on the real-time flight state, ensuring system stability and achieving accurate acceleration command tracking.

show abstract

Event-triggered reconfigurable reinforcement learning motion-planning approach for mobile robot in unknown dynamic environments

Sun

Zhang

et al. 2023

Engineering Applications of Artificial Intelligence

View full text Add to dashboard Cite

Twin-Delayed Deep Deterministic Policy Gradient for Low-Frequency Oscillation Damping Control

Cited by 5 publications

References 39 publications

Optimal Inter-area Oscillation Damping Control: A Transfer Deep Reinforcement Learning Approach with Switching Control Strategy

Optimal Inter-area Oscillation Damping Control: A Transfer Deep Reinforcement Learning Approach with Switching Control Strategy

Two-Loop Acceleration Autopilot Design and Analysis Based on TD3 Strategy

Event-triggered reconfigurable reinforcement learning motion-planning approach for mobile robot in unknown dynamic environments

Contact Info

Product

Resources

About