2019 IEEE International Symposium on Information Theory (ISIT) 2019
DOI: 10.1109/isit.2019.8849327
|View full text |Cite
|
Sign up to set email alerts
|

Overlapping Multi-Bandit Best Arm Identification

Abstract: In the multi-armed bandit literature, the multi-bandit best-arm identification problem consists of determining each best arm in a number of disjoint groups of arms, with as few total arm pulls as possible. In this paper, we introduce a variant of the multi-bandit problem with overlapping groups, and present two algorithms for this problem based on successive elimination and lower/upper confidence bounds (LUCB). We bound the number of total arm pulls required for high-probability best-arm identification in ever… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 11 publications
(9 citation statements)
references
References 12 publications
(20 reference statements)
0
9
0
Order By: Relevance
“…A notable grouped best-arm identification problem was studied in (Gabillon et al 2011;Bubeck, Wang, and Viswanathan 2013), where the arms are partitioned into disjoint groups, and the goal is to find the best arm in each group. A generalization of this problem to the case of overlapping groups was provided in (Scarlett, Bogunovic, and Cevher 2019). Another notable setting in which multiple arms are returned is that of subset selection, where one seeks to find a subset of k arms attaining the highest mean rewards (Kalyanakrishnan et al 2012;Kaufmann and Kalyanakrishnan 2013;Kaufmann, Cappé, and Garivier 2016).…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…A notable grouped best-arm identification problem was studied in (Gabillon et al 2011;Bubeck, Wang, and Viswanathan 2013), where the arms are partitioned into disjoint groups, and the goal is to find the best arm in each group. A generalization of this problem to the case of overlapping groups was provided in (Scarlett, Bogunovic, and Cevher 2019). Another notable setting in which multiple arms are returned is that of subset selection, where one seeks to find a subset of k arms attaining the highest mean rewards (Kalyanakrishnan et al 2012;Kaufmann and Kalyanakrishnan 2013;Kaufmann, Cappé, and Garivier 2016).…”
Section: Related Workmentioning
confidence: 99%
“…In addition, the results of (Jedor, Perchet, and Louedec 2019) are based on the arm means satisfying certain partial ordering assumptions between the groups (e.g., all arms in a better group beat all arms in a worse group), whereas we consider general instances without such restrictions. See also (Bouneffouf et al 2019;Ban and He 2021;Singh et al 2020) and the references therein for other MAB settings with a clustering structure.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…To achieve this, we set µ 1 j " µ j `p1 `αqpµ jworstpG ˚q ´µj q for arbitrarily small α ą 0. For any arms in G with mean exactly µ jworstpG ˚q , we can perform an arbitrarily small perturbation similar to Appendix A of (Scarlett, Bogunovic, and Cevher 2019). As a result, G 1 is no longer the best group in the new instance.…”
Section: C3 Proof Of Thmmentioning
confidence: 99%
“…A special case of this problem was studied in [21], where η is the mean parameter and the distributions are sub-Gaussian, i.e., E[e sX ] ≤ e σ 2 s 2 2 , ∀s ∈ R with σ ≤ 1/2.…”
Section: B Examplesmentioning
confidence: 99%