“…Recently, there has been an increased volume of research which try to learn optimal treatment strategies for critically ill and in particular for septic patients (Komorowski et al, 2018;Chen et al, 2019;Raghu et al, 2017;Li et al, 2019;Peng et al, 2018;Festor et al, 2021;Nanayakkara et al, 2022b), using Reinforcement Learning (RL) methods. Given the enormous mortality, morbidity and economic burden (Liu et al, 2014;Rhee et al, 2017;Paoli et al, 2018), the ambiguity regarding optimal treatment strategies and lack of accepted guidelines for treatment (Marik, 2015;Jarczak et al, 2021), such attempts are certainly justified.…”