2022
DOI: 10.1109/access.2022.3224460
|View full text |Cite
|
Sign up to set email alerts
|

Model-Based Approach on Multi-Agent Deep Reinforcement Learning With Multiple Clusters for Peer-To-Peer Energy Trading

Abstract: Peer-to-peer (P2P) energy trading system has the ability to completely revolutionize the current household energy system by sharing energy among residents. As the number of customers employing distributed energy resources (DERs) such as solar rooftops increase, innovation in the double auction market (DA) system is becoming more significant. In this paper, a novel model-based asynchronous advantage actor-centralized-critic with communication (MB-A3C3) approach is carried out. The model is conducted on a large … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 68 publications
0
1
0
Order By: Relevance
“…where, Pr n t represents the profit of the n the prosumer. E n selling,t and γ selling t are sold energy to the grid in kWh and energy selling price with TOU pricing scheme [85], respectively. The primary objectives of the electric grid are to supply energy to the prosumers during insufficient energy generation in DERs and insufficient energy transactions in P2P energy trading.…”
Section: ) Electric Grid (Utility)mentioning
confidence: 99%
“…where, Pr n t represents the profit of the n the prosumer. E n selling,t and γ selling t are sold energy to the grid in kWh and energy selling price with TOU pricing scheme [85], respectively. The primary objectives of the electric grid are to supply energy to the prosumers during insufficient energy generation in DERs and insufficient energy transactions in P2P energy trading.…”
Section: ) Electric Grid (Utility)mentioning
confidence: 99%
“…State in the context of DL in this study is the condition of the fish pond which consists of temperature conditions, pH conditions, and turbidity conditions. For example, the state temperature is 25 • C, the turbidity is 24 NTU, and the pH is 7, which are denoted as S: [7,24,25]. Then, with these conditions, the action is in the form of a declaration of the condition of the fish pool in a "normal" state, which is denoted as A: normal, which then triggers the system to give a reward (R) to become the next condition R t+1 and S t+1 .…”
Section: Deep Reinforcement Learning (Drl)mentioning
confidence: 99%
“…The structure algorithm uses a deep Q network (DQN), and the final results show that the agent can solve problems by controlling the load frequency and determining the appropriate power and frequency requirements. In contrast, Sanayha and Vateekul [25] developed a model-based agent on DRL for peer-to-peer energy trading cases. Agent performance is based on an environment that has been modeled with multivariate-long short-term memory (multivariate-LSTM) with time-series data.…”
Section: Introductionmentioning
confidence: 99%
“…Despite the recent increase in solar energy use in both the commercial and residential sectors, the intermittent behavior of solar energy's radiation intensity continues to pose a significant challenge to power grid operations. Most renewable energy sources suffer from unpredictability or an intermittent nature [10][11][12]. Due to the interplay between radiation and matter, solar irradiance is the radiative energy from the Sun that eventually reaches the photovoltaic (PV) cells [13,14].…”
Section: Introductionmentioning
confidence: 99%