“…Since there is no interaction between the agent and physical model of ADN during training, this method can achieve physical model-free control. However, it requires a large amount of training data and distribution mismatch may degrade the performance of the algorithm even when sufficiently large and diverse data are given [33]. Ref.…”