2006
DOI: 10.1017/s000186780000121x
|View full text |Cite
|
Sign up to set email alerts
|

Some indexable families of restless bandit problems

Abstract: In 1988 Whittle introduced an important but intractable class of restless bandit problems which generalise the multiarmed bandit problems of Gittins by allowing state evolution for passive projects. Whittle's account deployed a Lagrangian relaxation of the optimisation problem to develop an index heuristic. Despite a developing body of evidence (both theoretical and empirical) which underscores the strong performance of Whittle's index policy, a continuing challenge to implementation is the need to establish t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
9
0

Year Published

2007
2007
2017
2017

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 12 publications
(10 citation statements)
references
References 16 publications
1
9
0
Order By: Relevance
“…Further classes of indexable problems are the dual speed problem of Glazebrook, Nino-Mora, and Ansell [24], the maintenance models of Glazebrook, Ruiz-Hernandez, and Kirkbride [25], and the spinning plates and squad models of Glazebrook, Kirkbride, and Ruiz-Hernandez [23]. Our paper is in line with these works in that it trades indexability for specific structural conditions.…”
Section: Indexabilitysupporting
confidence: 63%
“…Further classes of indexable problems are the dual speed problem of Glazebrook, Nino-Mora, and Ansell [24], the maintenance models of Glazebrook, Ruiz-Hernandez, and Kirkbride [25], and the spinning plates and squad models of Glazebrook, Kirkbride, and Ruiz-Hernandez [23]. Our paper is in line with these works in that it trades indexability for specific structural conditions.…”
Section: Indexabilitysupporting
confidence: 63%
“…Weber and Weiss (1990) prove that Whittle's index is asymptotically optimal when the ratio of the active number of bandits to the total number is fixed. Most papers (e.g., Ansell et al 2003;Glazebrook et al 2005Glazebrook et al , 2006 focus on proving indexability before proceeding to compute Whittle's index and numerically solving the problem. In our model, we prove that a much simpler myopic policy is optimal in special settings of the problem and use the policy to devise a heuristic for more general settings.…”
Section: Literature Reviewmentioning
confidence: 99%
“…At an abstract level, our model is most closely related to Glazebrook et al (2005), which studies the optimal allocation of repairmen to machines that deteriorate under usage. However, Glazebrook et al (2006) assume that the states of machines are known at every decision epoch. In our model, since a patient's health state is not observable without a diagnosis during the visit and every patient is not seen each period, we assume that the decision maker only knows the probability distribution over health states.…”
Section: Literature Reviewmentioning
confidence: 99%
“…In reality, set-up times would impose a penalty upon Red for such a policy. Glazebrook, Kirkbride and Ruiz (2005) propose modifications to indices which take account of switching penalties and/or times. Such modifications can be applied to all of the models discussed in this paper, although strict optimality is no longer achieved.…”
Section: Model 1 -Red Learns About the Nature Of Blue Targetsmentioning
confidence: 99%