“…SAC is a model-free state-of-the-art RL algorithm (Haarnoja et al, 2018a,b). DQN (Mnih et al, 2013) is a well-known discrete action-space model-free method used in several previous works on RL for antenna down-tilt control (Vannella et al, 2021;Bouton et al, 2021;Aumayr et al, 2021). For DQN, the action space is changed to update the tilt by increments of {−1°, 0°, 1°}.…”