Off-policy Learning for Remote Electrical Tilt Optimization

Vannella, Filippo; Jeong, Jaeseong; Proutière, Alexandre

doi:10.48550/arxiv.2005.10577

Cited by 2 publications

(4 citation statements)

References 14 publications

(19 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…When using Reinforcement learning, the problem of coordination still arises. Considering each antenna as independent learning agents have been used in the past to address the problem of optimizing mobile networks [6], [8], [9], [11], [12], hence failing to capture phenomena like interference. Learning algorithms leveraging coordination can use a centralized controller [7], [13] which does not scale to a large number of agents.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Coordinated Reinforcement Learning for Optimizing Mobile Networks

Bouton¹,

Farooq²,

Forgeat³

et al. 2021

Preprint

View full text Add to dashboard Cite

Mobile networks are composed of many base stations and for each of them many parameters must be optimized to provide good services. Automatically and dynamically optimizing all these entities is challenging as they are sensitive to variations in the environment and can affect each other through interferences. Reinforcement learning (RL) algorithms are good candidates to automatically learn base station configuration strategies from incoming data but they are often hard to scale to many agents. In this work, we demonstrate how to use coordination graphs and reinforcement learning in a complex application involving hundreds of cooperating agents. We show how mobile networks can be modeled using coordination graphs and how network optimization problems can be solved efficiently using multiagent reinforcement learning. The graph structure occurs naturally from expert knowledge about the network and allows to explicitly learn coordinating behaviors between the antennas through edge value functions represented by neural networks. We show empirically that coordinated reinforcement learning outperforms other methods. The use of local RL updates and parameter sharing can handle a large number of agents without sacrificing coordination which makes it well suited to optimize the ever denser networks brought by 5G and beyond.Preprint. Under review.

show abstract

Section: Related Workmentioning

confidence: 99%

“…Existing approaches for network optimization rely on hand-engineered strategies which are suboptimal and hard to scale [2]- [4]. Methods relying on mathematical models [5] or reinforcement learning (RL) are also used for network optimization [6]- [11]. They are more robust and principled.…”

Section: Introductionmentioning

confidence: 99%

Coordinated Reinforcement Learning for Optimizing Mobile Networks

Bouton¹,

Farooq²,

Forgeat³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…However, it is known that the large-scale exploration performed by RL algorithms can sometimes take the system to unsafe states [7]. In the problem of RET optimization, RL has been proven to be an effective framework for KPI optimization due to its self-learning capabilities and adaptivity to potential environment changes [16]. For addressing the safety problem (i.e., to guarantee that the desired KPIs remain in specified bounds) authors in [16] have proposed a statistical approach to empirically evaluate the RET optimization in different baseline policies and in different worst-case scenarios.…”

Section: Introductionmentioning

confidence: 99%

“…However, they assume that the system dynamics abstraction into an MDP is given, which is challenging in network applications that this demonstration refers to. As mentioned previously, authors in [16] address the safe RET optimization problem, but this approach relies on statistical guarantees and it cannot handle general LTL specifications that we treat with this manuscript.…”

Section: Introductionmentioning

confidence: 99%

Symbolic Reinforcement Learning for Safe RAN Control

Nikou,

Mujumdar,

Orlic

et al. 2021

Preprint

View full text Add to dashboard Cite

In this paper, we demonstrate a Symbolic Reinforcement Learning (SRL) architecture for safe control in Radio Access Network (RAN) applications. In our automated tool, a user can select a high-level safety specifications expressed in Linear Temporal Logic (LTL) to shield an RL agent running in a given cellular network with aim of optimizing network performance, as measured through certain Key Performance Indicators (KPIs). In the proposed architecture, network safety shielding is ensured through model-checking techniques over combined discrete system models (automata) that are abstracted through reinforcement learning. We demonstrate the user interface (UI) helping the user set intent specifications to the architecture and inspect the difference in allowed and blocked actions.

show abstract

Off-policy Learning for Remote Electrical Tilt Optimization

Cited by 2 publications

References 14 publications

Coordinated Reinforcement Learning for Optimizing Mobile Networks

Coordinated Reinforcement Learning for Optimizing Mobile Networks

Symbolic Reinforcement Learning for Safe RAN Control

Contact Info

Product

Resources

About