1989
DOI: 10.1214/aos/1176347144
|View full text |Cite
|
Sign up to set email alerts
|

Irreversible Adaptive Allocation Rules

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
10
0

Year Published

2006
2006
2019
2019

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 11 publications
(10 citation statements)
references
References 9 publications
0
10
0
Order By: Relevance
“…On the other hand, Lai [17] and Chang and Lai [6] have proposed simple index-type adaptive allocation rules that are asymptotically optimal in both the Bayes and frequentist senses either as N → ∞ (under uniform discounting) or as β → 1 (under geometric discounting). Brezzi and Lai [5] have recently refined and modified these adaptive allocation rules in the presence of switching costs, while Hu and Wei [15] have constructed asymptotically optimal adaptive allocation rules subject to the irreversibility constraint. Various applications of the theory of multi-armed bandits can be found in sequential clinical trials, market pricing, labor markets and search problems; see e.g.…”
Section: Introductionmentioning
confidence: 99%
“…On the other hand, Lai [17] and Chang and Lai [6] have proposed simple index-type adaptive allocation rules that are asymptotically optimal in both the Bayes and frequentist senses either as N → ∞ (under uniform discounting) or as β → 1 (under geometric discounting). Brezzi and Lai [5] have recently refined and modified these adaptive allocation rules in the presence of switching costs, while Hu and Wei [15] have constructed asymptotically optimal adaptive allocation rules subject to the irreversibility constraint. Various applications of the theory of multi-armed bandits can be found in sequential clinical trials, market pricing, labor markets and search problems; see e.g.…”
Section: Introductionmentioning
confidence: 99%
“…The proof can be found in [1]. We will discuss the relation of the lower bound with those in [6,7] and [3]. Theorem 1.…”
Section: The Regret Lower Boundmentioning
confidence: 99%
“…Here we further explore optimality properties of the proposed strategies. First, we show that the efficiency benchmark, which is given by the regret lower bound, reduces to those in Lai and Robbins (1985), Wei (1989), andHu (2000). This implies that the proposed strategy is also optimal under the settings of aforementioned papers.…”
mentioning
confidence: 96%
“…The improvements are due to adaptations of UCB to take into account the unavoidable experimentation costs unique to a particular problem, in this case the higher costs when the number of arms is large. The construction of optimal bandit algorithms for irreversible rules in Hu and Wei (1989) is also based on this principle.…”
Section: Introductionmentioning
confidence: 99%