2018
DOI: 10.1016/j.energy.2018.04.042
|View full text |Cite
|
Sign up to set email alerts
|

Smart generation control based on multi-agent reinforcement learning with the idea of the time tunnel

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
36
0
1

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 90 publications
(41 citation statements)
references
References 17 publications
0
36
0
1
Order By: Relevance
“…In recent years, scholars have attempted to combine TL with RL to form TRL; to combine TL with heuristic SI algorithms such as ABC to form TBO; to combine TL with DRL to form a DTRL algorithm; to combine DL with typical RL algorithms such as Q‐learning to form a new DQL algorithm; to combine DL with another typical RL algorithm, the ADP, to form a new D‐ADP algorithm; and to combine TL with extreme learning to form a new extreme TL algorithm. Among these, the theoretical framework for a new D‐ADP algorithm proposed in Yin et al contains three networks, prediction network, evaluation network, and execution network, which are improved by adopting the DQNs.…”
Section: Hybrid Learningmentioning
confidence: 99%
See 3 more Smart Citations
“…In recent years, scholars have attempted to combine TL with RL to form TRL; to combine TL with heuristic SI algorithms such as ABC to form TBO; to combine TL with DRL to form a DTRL algorithm; to combine DL with typical RL algorithms such as Q‐learning to form a new DQL algorithm; to combine DL with another typical RL algorithm, the ADP, to form a new D‐ADP algorithm; and to combine TL with extreme learning to form a new extreme TL algorithm. Among these, the theoretical framework for a new D‐ADP algorithm proposed in Yin et al contains three networks, prediction network, evaluation network, and execution network, which are improved by adopting the DQNs.…”
Section: Hybrid Learningmentioning
confidence: 99%
“…Among these, the theoretical framework for a new D‐ADP algorithm proposed in Yin et al contains three networks, prediction network, evaluation network, and execution network, which are improved by adopting the DQNs. These above‐mentioned HL algorithms have been preliminarily used to good effect in the SG and EI fields …”
Section: Hybrid Learningmentioning
confidence: 99%
See 2 more Smart Citations
“…It can be found that when a fault occurred, both of its upstream LPs and downstream LPs will be affected to some degree conditionally. The degree relies on the operations state of the involving protective devices and AS [24,25]. In other words, considering a particular component, the effect of its downstream failures on its upstream LPs will more or less depend on its own operation state.…”
Section: General Model For Distribution Systemmentioning
confidence: 99%