2022
DOI: 10.48550/arxiv.2205.13170
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost

Abstract: We study distributed contextual linear bandits with stochastic contexts, where N agents act cooperatively to solve a linear bandit-optimization problem with d-dimensional features. For this problem, we propose a distributed batch elimination version of the LinUCB algorithm, DisBE-LUCB, where the agents share information among each other through a central server. We prove that over T rounds (N T actions in total) the communication cost of DisBE-LUCB is only Õ(dN ) and its regret is at most Õ( √ dN T ), which is… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 17 publications
(29 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?