2021
DOI: 10.1609/aaai.v35i8.16820
|View full text |Cite
|
Sign up to set email alerts
|

Decentralized Multi-Agent Linear Bandits with Safety Constraints

Abstract: We study decentralized stochastic linear bandits, where a network of N agents acts cooperatively to efficiently solve a linear bandit-optimization problem over a d-dimensional space. For this problem, we propose DLUCB: a fully decentralized algorithm that minimizes the cumulative regret over the entire network. At each round of the algorithm each agent chooses its actions following an upper confidence bound (UCB) strategy and agents share information with their immediate neighbors through a carefully designed… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 18 publications
(33 reference statements)
0
1
0
Order By: Relevance
“…Multi-agent bandit problem has gained great attention in the past few years [30,3,12]. The proposed strategies for multi-agent problems are dividable into two main categories.…”
Section: Related Workmentioning
confidence: 99%
“…Multi-agent bandit problem has gained great attention in the past few years [30,3,12]. The proposed strategies for multi-agent problems are dividable into two main categories.…”
Section: Related Workmentioning
confidence: 99%