Multi-agent reinforcement learning for modeling and control of thermostatically controlled loads

Kazmi, Hussain; Suykens, Johan A. K.; Bálint, Attila; Driesen, Johan

doi:10.1016/j.apenergy.2019.01.140

Cited by 69 publications

(28 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Chen et al (2018) [66] benchmarked their controller with a "rule-based heuristic" control strategy. Kazmi et al (2019) [67] used a rule-based dead-band controller as the benchmark. Ahn and Park (2019) [68] claimed their controller saved 15.7% energy compared with the fixed pre-determined schedule on OA damper position and temperature setpoint.…”

Section: Performance Evaluationmentioning

confidence: 99%

Reinforcement learning for building controls: The opportunities and challenges

2020

View full text Add to dashboard Cite

Building controls are becoming more important and complicated due to the dynamic and stochastic energy demand, on-site intermittent energy supply, as well as energy storage, making it difficult for them to be optimized by conventional control techniques. Reinforcement Learning (RL), as an emerging control technique, has attracted growing research interest and demonstrated its potential to enhance building performance while addressing some limitations of other advanced control techniques, such as model predictive control. This study conducted a comprehensive review of existing studies that applied RL for building controls. It provided a detailed breakdown of the existing RL studies that use a specific variation of each major component of the Reinforcement Learning: algorithm, state, action, reward, and environment. We found RL for building controls is still in the research stage with limited applications (11%) in real buildings. Three significant barriers prevent the adoption of RL controllers in actual building controls: (1) the training process is time consuming and data demanding, (2) the control security and robustness need to be enhanced, and (3) the generalization capabilities of RL controllers need to be improved using approaches such as transfer learning. Future research may focus on developing RL controllers that could be used in real buildings, addressing current RL challenges, such as accelerating training and enhancing control robustness, as well as developing an opensource testbed and dataset for performance benchmarking of RL controllers.

show abstract

Section: Performance Evaluationmentioning

confidence: 99%

Reinforcement learning for building controls: The opportunities and challenges

2020

View full text Add to dashboard Cite

show abstract

“…Boltzmann probability distribution method [22] is used in this paper to describe the transition probability of the state in the evolutionary game. The Boltzmann probability distribution method selects the action by probability, and the probability of selecting the action a i in the state s is p(a i ) = e Q(s, a i )/λ ∑ a ∈ A e Q(s, a)/λ (16) where λ is the exponential function of the iteration period-k in evolutionary game. When λ increases, the agent's decision randomness also increases; and when λ decreases, the decision randomness also decreases.…”

Section: Regulation and Control Strategy On Decision Layer Based On Qmentioning

confidence: 99%

Research on hierarchical control and optimisation learning method of multi‐energy microgrid considering multi‐agent game

Liu

2020

IET Smart Grid

View full text Add to dashboard Cite

Due to the depletion of traditional fossil energy, to improve energy efficiency and build a cost-effective integrated energy system has become an inevitable choice. Aiming at the problems that the traditional centralised scheduling method is difficult to reflect the multi-dimensional interests of different agents in the multi-energy microgrid system, and the application of artificial intelligence technology in integrated energy scheduling still needs further exploration, this manuscript proposed a hierarchical control optimisation learning method with consideration of multi-agent game. Firstly, the multi-energy microgrid was taken as the research object, the microgrid system architecture was analysed, and the multi-agent partition in the system was pursued based on different economic interests. Secondly, for the technical aspects involved in the integrated energy regulation and management, the management layers of the multi-energy microgrid were divided, and the functions of different management layers were analysed. Based on this, the regulation functions were realised by considering the Nash Q-learning and the artificial intelligence method of Petri-net. Finally, the learning and decision-making ability of the method through practical cases were analysed, and the effectiveness and applicability of the proposed method were explained. This study explores the application of artificial intelligence technology in energy Internet energy management.

show abstract

“…Moreover, in this paper, a MARL method is used for sequential decision making in multi-agent environment where traditional SARL is difficult to deal with. MARL has been adopted in some fields, such as vehicle routing problem [24] and thermostatically loads modeling [25]. The most universal MARL is equilibrium-based MARL, whose framework accords with Markov games and the evaluation of the learning process is based on all agents' joint behaviors, the equilibrium concept from game theory is introduced to denote optimal joint action [26][27][28][29][30].…”

Section: Related Workmentioning

confidence: 99%

Multi-Agent Reinforcement Learning Approach for Residential Microgrid Energy Scheduling

et al. 2019

View full text Add to dashboard Cite

Residential microgrid is widely considered as a new paradigm of the home energy management system. The complexity of Microgrid Energy Scheduling (MES) is increasing with the integration of Electric Vehicles (EVs) and Renewable Generations (RGs). Moreover, it is challenging to determine optimal scheduling strategies to guarantee the efficiency of the microgrid market and to balance all market participants’ benefits. In this paper, a Multi-Agent Reinforcement Learning (MARL) approach for residential MES is proposed to promote the autonomy and fairness of microgrid market operation. First, a multi-agent based residential microgrid model including Vehicle-to-Grid (V2G) and RGs is constructed and an auction-based microgrid market is built. Then, distinguish from Single-Agent Reinforcement Learning (SARL), MARL can achieve distributed autonomous learning for each agent and realize the equilibrium of all agents’ benefits, therefore, we formulate an equilibrium-based MARL framework according to each participant’ market orientation. Finally, to guarantee the fairness and privacy of the MARL process, we proposed an improved optimal Equilibrium Selection-MARL (ES-MARL) algorithm based on two mechanisms, private negotiation and maximum average reward. Simulation results demonstrate the overall performance and efficiency of proposed MARL are superior to that of SARL. Besides, it is verified that the improved ES-MARL can get higher average profit to balance all agents.

show abstract

Multi-agent reinforcement learning for modeling and control of thermostatically controlled loads

Cited by 69 publications

References 34 publications

Reinforcement learning for building controls: The opportunities and challenges

Reinforcement learning for building controls: The opportunities and challenges

Research on hierarchical control and optimisation learning method of multi‐energy microgrid considering multi‐agent game

Multi-Agent Reinforcement Learning Approach for Residential Microgrid Energy Scheduling

Contact Info

Product

Resources

About