“…systems with unknown models). Applying RL in wind farm operations has become a cutting-edge research area, and its feasibility has been proved in recent studies [23], [24], [25], [26], [27], [28], [29], [30], [31]. For example, a model-free approach for wind farm power optimization was introduced in [23] via the deep Q-network algorithm [32].…”