2020
DOI: 10.1109/access.2020.3041007
|View full text |Cite
|
Sign up to set email alerts
|

Real-Time Optimal Power Flow Using Twin Delayed Deep Deterministic Policy Gradient Algorithm

Abstract: The general concept of AC Optimal Power Flow (ACOPF) refers to the economic dispatch planning under electric network constraints. Moreover, each instance with the entire network must be solved in real-time (i.e., every five minutes) to ensure cost-effective power system operation while satisfying power balance equation. As the operation of power systems penetrated with intermittent renewable energy becomes more complicated, this paper proposes Deep Neural Network (DNN) and Levenberg-Marquardt backpropagation-b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 32 publications
(8 citation statements)
references
References 22 publications
0
7
0
Order By: Relevance
“…Here, we focus on integrating our proposed training method with one of the state-of-the-art DRL algorithms, Twin Delayed Deep Deterministic policy gradient algorithm (TD3) [14]. We chose TD3 as it is a recent proposed algorithm which offers good performance in many tasks [27,45,56,55,22]. However, our proposed approach can be easily merged into other DRL algorithms as well and due to limited resources, we consider these alternatives outside of the scope of this paper.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…Here, we focus on integrating our proposed training method with one of the state-of-the-art DRL algorithms, Twin Delayed Deep Deterministic policy gradient algorithm (TD3) [14]. We chose TD3 as it is a recent proposed algorithm which offers good performance in many tasks [27,45,56,55,22]. However, our proposed approach can be easily merged into other DRL algorithms as well and due to limited resources, we consider these alternatives outside of the scope of this paper.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…Cost & Load Balance Mixed/NA [197], [198], [199] Cost & Comfort [200] Other/Mixed Residential A2C [201] HVAC, Fans, WH Cost Commercial A3C [202] P2P Trading Mixed/NA TD3 [203] HVAC, Fans, WH [204] Cost & Comfort [205] Other/Mixed Residential…”
Section: Referencementioning
confidence: 99%
“…Changes in operation status of the power system's elements, such as nodal voltage, are constructed by a sequence of real or complex numbers as a discrete time-domain signal. Rather than DQN, DDPG is appropriate for real-time changes in a discrete-time domain because DQN updates the neural network using a total reward in one episode unit, while DDPG updates the reward for each step [36]. Since the current and voltage data in discrete time-domain are changed with continuous form, unlike discrete movement such as top-bottom-left-right, the RL components can operate over continuous action spaces by using DDPG.…”
Section: Ddpg Algorithm For Controller Designmentioning
confidence: 99%
“…The observation of RL system plays a vital role since it is a core of the agent, which enables the agent to receive the results from the action and the environments changes [36]. In this paper, the observations represent that 𝑠 = [π‘‰π‘Žπ‘ π‘Ÿπ‘’π‘“ , π‘‰π‘Žπ‘, π‘‰π‘Žπ‘ 𝑑𝑖𝑓 ,…”
Section: The Proposed Approach To Control D-statcommentioning
confidence: 99%