2020
DOI: 10.3390/w12102688
|View full text |Cite
|
Sign up to set email alerts
|

Aggregation–Decomposition-Based Multi-Agent Reinforcement Learning for Multi-Reservoir Operations Optimization

Abstract: Stochastic dynamic programming (SDP) is a widely-used method for reservoir operations optimization under uncertainty but suffers from the dual curses of dimensionality and modeling. Reinforcement learning (RL), a simulation-based stochastic optimization approach, can nullify the curse of modeling that arises from the need for calculating a very large transition probability matrix. RL mitigates the curse of the dimensionality problem, but cannot solve it completely as it remains computationally intensive in com… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 50 publications
0
3
0
Order By: Relevance
“…Stojkovic and Simonovic [16], Hooshyar et al [17], and Aivazidou and Tsolakis [18] address various issues in managing complex water problems. Stojkovic and Simonovic study the impact of climate change on the management of a complex multipurpose water system and present a set of steps of the climate change impact analysis process.…”
Section: The Special Issue Organization Of Contributionsmentioning
confidence: 99%
“…Stojkovic and Simonovic [16], Hooshyar et al [17], and Aivazidou and Tsolakis [18] address various issues in managing complex water problems. Stojkovic and Simonovic study the impact of climate change on the management of a complex multipurpose water system and present a set of steps of the climate change impact analysis process.…”
Section: The Special Issue Organization Of Contributionsmentioning
confidence: 99%
“…Urban drainage systems (UDSs) that convey stormwater away from cities can also be managed with nonstructural measures, such as optimizing operation policies with intelligent control approaches [6][7][8][9][10][11]. Computationally effective stochastic optimization models have been successfully implemented in optimizing reservoir system operations, e.g., [12,13], which are yet to be applied in UDSs. In this area of work, pre-emptive prediction of floods using enhanced rainfall forecasting algorithms can afford us additional possibilities to optimally employ flood control devices.…”
Section: Introductionmentioning
confidence: 99%
“…However, SDP has been referred to as having three curses (Giuliani et al 2016, Giuliani et al 2021: (I) multiple objectives-an inability to explicitly account for multiobjective tradeoffs, so a weighted sum method is often used (Soleimani et al 2016, Ortiz-Partida et al 2019, Celeste et al 2021; (ii) dimensionality-discretisation of the system state significantly increases computational cost when a large system is considered (Sahu, Mclaughlin 2018, Dobson et al 2019, Hooshyar et al 2020; and (iii) modelling-all variables need to be described in the simulation model, thus specific model and problem formulations are needed (Mortazavi et al 2012, Soleimani et al 2016, Sahu, Mclaughlin 2018, Dobson et al 2019, Ortiz-Partida et al 2019, Hooshyar et al 2020, Celeste et al 2021, which restrict its real-world applications.…”
Section: Introductionmentioning
confidence: 99%