“…They use a preliminary learning phase, which is conducted off-line on a simulation model to guide the controller before it is implemented into the actual environment. A linear reinforcement learning controller (LRLC) has been introduced in [24], with the aim of achieving energy savings, high comfort and indoor air quality. After a simulated period of four years, LRLC only manages to perform close the ON/OFF and fuzzy-logic controllers.…”