2004
DOI: 10.1007/s10645-004-2477-z
|View full text |Cite
|
Sign up to set email alerts
|

A survey on the bandit problem with switching costs

Abstract: SummaryThe paper surveys the literature on the bandit problem, focusing on its recent development in the presence of switching costs. Switching costs between arms makes not only the Gittins index policy suboptimal, but also renders the search for the optimal policy computationally infeasible. This survey will first discuss the decomposability properties of the arms that make the Gittins index policy optimal, and show how these properties break down upon the introduction of costs on switching arms. Having estab… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
45
0

Year Published

2008
2008
2019
2019

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 70 publications
(45 citation statements)
references
References 90 publications
0
45
0
Order By: Relevance
“…It is optimal to switch to another cash-flow stream as soon as the benefit of delaying the switch, as measured by an appropriate "adjoint variable," drops to zero. Switching costs, which in our problem tend to delay actions, have also been studied for the stochastic multi-armed bandit problem; see Jun (2004) for a survey.…”
Section: Literaturementioning
confidence: 99%
“…It is optimal to switch to another cash-flow stream as soon as the benefit of delaying the switch, as measured by an appropriate "adjoint variable," drops to zero. Switching costs, which in our problem tend to delay actions, have also been studied for the stochastic multi-armed bandit problem; see Jun (2004) for a survey.…”
Section: Literaturementioning
confidence: 99%
“…Additive switching cost are common in the literature and algorithms exist (e.g., Banks and Sundaram 1994;Dushochet and Hongler 2003;Jun 2004), but additive switching costs impose path dependence and make it infeasible to determine when to morph in real time. On the other hand, a multiplicative switching cost can be factored out in a Bellman equation.…”
Section: Assumption 1 Switching Costsmentioning
confidence: 99%
“…MAB with switching costs was first considered in [17]. An excellent survey on MAB with switching costs can be found in [18].…”
Section: Related Workmentioning
confidence: 99%