2006
DOI: 10.1239/aap/1158684996
|View full text |Cite
|
Sign up to set email alerts
|

Some indexable families of restless bandit problems

Abstract: In 1988 Whittle introduced an important but intractable class of restless bandit problems which generalise the multiarmed bandit problems of Gittins by allowing state evolution for passive projects. Whittle's account deployed a Lagrangian relaxation of the optimisation problem to develop an index heuristic. Despite a developing body of evidence (both theoretical and empirical) which underscores the strong performance of Whittle's index policy, a continuing challenge to implementation is the need to establish t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
67
0

Year Published

2008
2008
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 65 publications
(68 citation statements)
references
References 17 publications
1
67
0
Order By: Relevance
“…In the general context of RMBP, there is a rich literature on indexability. See [23] for the linear programming representation of conditions for indexability and [9] for examples of specific indexable restless bandit processes. Constant-factor approximation algorithms for RMBP have also been explored in the literature.…”
Section: Related Workmentioning
confidence: 99%
“…In the general context of RMBP, there is a rich literature on indexability. See [23] for the linear programming representation of conditions for indexability and [9] for examples of specific indexable restless bandit processes. Constant-factor approximation algorithms for RMBP have also been explored in the literature.…”
Section: Related Workmentioning
confidence: 99%
“…We consider Whittle's index and other index-type policies. However, the implementation of Whittle's index is nontrivial because i) Whittle's index is only applicable to a restricted class of problems that satisfy the so-called indexability property, which is difficult to check in general (Whittle 1988, Glazebrook et al 2006; ii) the computation of Whittle's index involves repeated solution of single-bandit dynamic programs, which is difficult for our case since each single bandit has a multi-dimensional state space.…”
Section: Research Questions and Key Resultsmentioning
confidence: 99%
“…Because θ 1 (x, 1) = 1, Condition 2 is satisfied, and this version of Problem 2 is indexable. Glazebrook et al [11] formulated the same problem, slightly differently from us, as a restless bandit problem where one does not wait for the broken arm to be fixed before the reward stream is again available: if one plays a broken arm then he/she obtains its immediate reward minus the switching cost, and the arm is guaranteed to be available in the next period. However, the forms of their and our Whittle indices are the same.…”
Section: Proposition 5 Under Condition 2 Activating the Arm In Statmentioning
confidence: 99%
“…We have developed the Whittle indices for Problems 1 and 2 in the previous sections. Here we discuss how to compute the indices in (9) and (11). For this purpose, we develop the restartin problem representation of the indices.…”
Section: The Restart-in Problemmentioning
confidence: 99%
See 1 more Smart Citation