Residential Demand Response of Thermostatically Controlled Loads Using Batch Reinforcement Learning

Ruelens, Frederik; Claessens, Bert; Vandael, Stijn; Schutter, Bart De; Babuška, Robert; Belmans, Ronnie

doi:10.1109/tsg.2016.2517211

Cited by 272 publications

(144 citation statements)

References 31 publications

Supporting

Mentioning

142

Contrasting

Unclassified

Order By: Relevance

“…The usual roles of domain knowledge are: · Making the computations necessary for solving the problem more time-or space-efficient, · Guiding the solution process, ( (2009)). Work presented in Ruelens et al (2016) extended the fitted Q iteration algorithm in order to take advantage of domain-specific knowledge (in particular case a forecast of the exogenous data is provided to design demand response control). Q(λ) with eligibility traces is used to take advantage of domain-specific knowledge in Yu et al (2011).…”

Section: Past and Recent Considerations Of Rl For Electric Power Systmentioning

confidence: 99%

“…Power system components considered include: dynamic brake Ernst et al (2004); Glavic (2005), thyristor controlled series capacitor Ernst et al (2004Ernst et al ( , 2009, quadrature booster Li and Wu (1999), synchronous generators (all AGC related references), individual or aggregated loads Vandael et al (2015); Ruelens et al (2016), etc. If used as a multi-agent system, then additional state variables must be introduced to ensure convergence of these essentially distributed computation schemes, and an adapted variant of standard RL methods is often used (for example correlated equilibrium Q(λ) Yu et al (2012a)).…”

Section: Past and Recent Considerations Of Rl For Electric Power Systmentioning

confidence: 99%

See 1 more Smart Citation

Reinforcement Learning for Electric Power System Decision and Control: Past Considerations and Perspectives

2017

View full text Add to dashboard Cite

Abstract:In this paper, we review past (including very recent) research considerations in using reinforcement learning (RL) to solve electric power system decision and control problems. The RL considerations are reviewed in terms of specific electric power system problems, type of control and RL method used. We also provide observations about past considerations based on a comprehensive review of available publications. The review reveals the RL is considered as viable solutions to many decision and control problems across different time scales and electric power system states. Furthermore, we analyse the perspectives of RL approaches in light of the emergence of new-generation, communications, and instrumentation technologies currently in use, or available for future use, in power systems. The perspectives are also analysed in terms of recent breakthroughs in RL algorithms (Safe RL, Deep RL and path integral control for RL) and other, not previously considered, problems for RL considerations (most notably restorative, emergency controls together with so-called system integrity protection schemes, fusion with existing robust controls, and combining preventive and emergency control).

show abstract

Section: Past and Recent Considerations Of Rl For Electric Power Systmentioning

confidence: 99%

Section: Past and Recent Considerations Of Rl For Electric Power Systmentioning

confidence: 99%

Reinforcement Learning for Electric Power System Decision and Control: Past Considerations and Perspectives

2017

View full text Add to dashboard Cite

show abstract

“…This results in a slow convergence rate of the Q-learning algorithm to an optimal policy [21]; more observations are needed to construct a control policy. In batch RL techniques (off-line RL) [22,23], a controller estimates a control policy based on a batch of its past experiences.…”

Section: Introductionmentioning

confidence: 99%

“…The ability of batch RL to reuse their past experiences makes them converge faster than online RL methods like Q-learning and SARSA. Batch RL has been used for demand response in [21,[24][25][26]. Vandael et al [27] used a batch RL technique to find a day-ahead consumption plan of a cluster of electric vehicles.…”

Section: Introductionmentioning

confidence: 99%

Battery Energy Management in a Microgrid Using Batch Reinforcement Learning

et al. 2017

Self Cite

View full text Add to dashboard Cite

Abstract:Motivated by recent developments in batch Reinforcement Learning (RL), this paper contributes to the application of batch RL in energy management in microgrids. We tackle the challenge of finding a closed-loop control policy to optimally schedule the operation of a storage device, in order to maximize self-consumption of local photovoltaic production in a microgrid. In this work, the fitted Q-iteration algorithm, a standard batch RL technique, is used by an RL agent to construct a control policy. The proposed method is data-driven and uses a state-action value function to find an optimal scheduling plan for a battery. The battery's charge and discharge efficiencies, and the nonlinearity in the microgrid due to the inverter's efficiency are taken into account. The proposed approach has been tested by simulation in a residential setting using data from Belgian residential consumers. The developed framework is benchmarked with a model-based technique, and the simulation results show a performance gap of 19%. The simulation results provide insight for developing optimal policies in more realistically-scaled and interconnected microgrids and for including uncertainties in generation and consumption for which white-box models become inaccurate and/or infeasible.

show abstract

“…The behaviour of these representative devices allows to capture the behaviour of the entire set of the aggregated ones (e.g., [1], [2], [3], [4]). The second approach is model-free since it infers the behaviour of the distributed devices from the interaction between them and a central unit (i.e., the aggregator) (e.g., [5], [6]). Usually, these approaches adopt data-driven learning techniques.…”

Section: Introductionmentioning

confidence: 99%

Aggregation of power capabilities of heterogeneous resources for real-time control of power grids

Bernstein

Boudec

Paolone

et al. 2016

2016 Power Systems Computation Conference (PSCC)

View full text Add to dashboard Cite

Abstract-Aggregation of electric resources is a fundamental function for the operation of power grids at different time scales. In the context of a recently proposed framework for the real-time control of microgrids with explicit power setpoints, we define and formally specify an aggregation method that explicitly accounts for delays and message asynchronism. The method allows to abstract the details of resources using high-level concepts that are device and grid-independent. We demonstrate the application of the method to a Cigre benchmark with heterogenous and lowinertia resources.

show abstract

Residential Demand Response of Thermostatically Controlled Loads Using Batch Reinforcement Learning

Cited by 272 publications

References 31 publications

Reinforcement Learning for Electric Power System Decision and Control: Past Considerations and Perspectives

Reinforcement Learning for Electric Power System Decision and Control: Past Considerations and Perspectives

Battery Energy Management in a Microgrid Using Batch Reinforcement Learning

Aggregation of power capabilities of heterogeneous resources for real-time control of power grids

Contact Info

Product

Resources

About