<p>The paper proposes a Reinforcement Learning based agent that controls three KPIs of the mobile network to reach a maximized sum throughput of the newtork, such that the number of uncovered users is kept minimum and the energy consumed due to the MIMO technology is kept minimum as well. </p>
<p>The environment is a simulated mobile network using NS3. </p>