2022
DOI: 10.48550/arxiv.2202.08194
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep Contextual Bandits for Orchestrating Multi-User MISO Systems with Multiple RISs

Abstract: The emergent technology of Reconfigurable Intelligent Surfaces (RISs) has the potential to transform wireless environments into controllable systems, through programmable propagation of information-bearing signals. Techniques stemming from the field of Deep Reinforcement Learning (DRL) have recently gained popularity in maximizing the sum-rate performance in multi-user communication systems empowered by RISs. Such approaches are commonly based on Markov Decision Processes (MDPs). In this paper, we instead inve… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(11 citation statements)
references
References 17 publications
0
11
0
Order By: Relevance
“…In this subsection, we analyze two different strategies to solve similar optimization problems to understand the reasons that sustain our proposals. We focus on [36] and [37] which formulate the joint optimization of the IRS and precoder matrices as an RL and CB problem, respectively. Table I shows some differences and similarities between these two approaches.…”
Section: B Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…In this subsection, we analyze two different strategies to solve similar optimization problems to understand the reasons that sustain our proposals. We focus on [36] and [37] which formulate the joint optimization of the IRS and precoder matrices as an RL and CB problem, respectively. Table I shows some differences and similarities between these two approaches.…”
Section: B Related Workmentioning
confidence: 99%
“…On the other hand, [37] considers a CB-based approach to the joint optimization of the precoders and the IRS matrices in the downlink of a multi-IRS MU-MISO system. As in [36], the instantaneous sum-rate value is used as the reward but the state vectors are only composed of the channel coefficients for the current channel realization which is in accordance with the CB formulation.…”
Section: B Related Workmentioning
confidence: 99%
See 3 more Smart Citations