2021
DOI: 10.3390/en14206695
|View full text |Cite
|
Sign up to set email alerts
|

Twin-Delayed Deep Deterministic Policy Gradient for Low-Frequency Oscillation Damping Control

Abstract: Due to the large scale of power systems, latency uncertainty in communications can cause severe problems in wide-area measurement systems. To resolve this issue, a significant amount of past work focuses on using emerging technology, including machine learning methods such as Q-learning, for addressing latency issues in modern controls. Although the method can deal with the stochastic characteristics of communication latency, the Q-values can be overestimated in Q-learning methods, leading to high bias. To add… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 39 publications
0
2
0
Order By: Relevance
“…Meanwhile, DRL is robust to the uncertainties of the system, i.e., control signals latency, environment noise, and load variations. A Deep Deterministic Policy Gradient (DDPG) based method and the twin-delayed DDPG method are proposed to overcome various communication delays during damping control [11], [12]. To solve the high dimensionality problem of power systems, Mukherjee et al [13] introduce two model reduction approaches for scalable DRL wide-area damping control.…”
Section: Introductionmentioning
confidence: 99%
“…Meanwhile, DRL is robust to the uncertainties of the system, i.e., control signals latency, environment noise, and load variations. A Deep Deterministic Policy Gradient (DDPG) based method and the twin-delayed DDPG method are proposed to overcome various communication delays during damping control [11], [12]. To solve the high dimensionality problem of power systems, Mukherjee et al [13] introduce two model reduction approaches for scalable DRL wide-area damping control.…”
Section: Introductionmentioning
confidence: 99%
“…Motivated by the previous investigations, this paper employs the DRL principle to design a parameter tuning model for a two-loop autopilot using the TD3 algorithm [33], which has the following striking advantages over existing autopilot design schemes:…”
Section: Introductionmentioning
confidence: 99%