2021
DOI: 10.48550/arxiv.2109.05700
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Exploiting Heterogeneity in Robust Federated Best-Arm Identification

Abstract: We study a federated variant of the best-arm identification problem in stochastic multiarmed bandits: a set of clients, each of whom can sample only a subset of the arms, collaborate via a server to identify the best arm (i.e., the arm with the highest mean reward) with prescribed confidence. For this problem, we propose Fed-SEL, a simple communication-efficient algorithm that builds on successive elimination techniques and involves local sampling steps at the clients. To study the performance of Fed-SEL, we i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
10
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(11 citation statements)
references
References 60 publications
1
10
0
Order By: Relevance
“…A closely related problem to ours is considered in [69], where the authors study the best arm identification in a federated setting and propose the "Federated Successive Elimination" (Fed-SEL) algorithm. [69] also considers the case where some of the participants in the federation are Byzantine clients, and it provides the robust version of Fed-SEL. [70] studies a multi-agent MAB problem where the agents collaborate to minimize their cumulative regrets, with a subset of agents being malicious.…”
Section: E Byzantine Resilient Fmabmentioning
confidence: 99%
See 1 more Smart Citation
“…A closely related problem to ours is considered in [69], where the authors study the best arm identification in a federated setting and propose the "Federated Successive Elimination" (Fed-SEL) algorithm. [69] also considers the case where some of the participants in the federation are Byzantine clients, and it provides the robust version of Fed-SEL. [70] studies a multi-agent MAB problem where the agents collaborate to minimize their cumulative regrets, with a subset of agents being malicious.…”
Section: E Byzantine Resilient Fmabmentioning
confidence: 99%
“…[70] studies a multi-agent MAB problem where the agents collaborate to minimize their cumulative regrets, with a subset of agents being malicious. However, both [69] and [70] assume that the individual arm outcomes are i.i.d. across clients, that is, they do not consider the possible model heterogeneity arising from the client sampling discussed in Section II-D.…”
Section: E Byzantine Resilient Fmabmentioning
confidence: 99%
“…In [6], the honest agents face a non-stochastic (i.e., adversarial) bandit [4] and communicate at every time step, in contrast to the stochastic bandit and limited communication of our work. The authors of [47] consider the objective of best arm identification [2] instead of cumulative regret. Most of their paper involves a different communication model where the agents/clients collaborate via a central server; Section 6 studies a "peer-to-peer" model which is closer to ours but requires additional assumptions on the number of malicious neighbors.…”
Section: Other Related Workmentioning
confidence: 99%
“…The main focus in these papers is the design of coordination protocols among the agents that balance communication-efficiency with performance. A few very recent works [56][57][58][59] also look at the effect of attacks, but for the simpler unstructured multi-armed bandit problem [60]. Accounting for adversarial agents in the structured linear bandit setting we consider here requires significantly different ideas that we develop in this paper.…”
Section: Related Workmentioning
confidence: 99%