2019
DOI: 10.48550/arxiv.1907.05689
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Gittins' theorem under uncertainty

Abstract: We study dynamic allocation problems for discrete time multi-armed bandits under uncertainty, based on the the theory of nonlinear expectations. We show that, under strong independence of the bandits and with some relaxation in the definition of optimality, a Gittins allocation index gives optimal choices. This involves studying the interaction of our uncertainty with controls which determine the filtration. We also run a simple numerical example which illustrates the interaction between the willingness to exp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 53 publications
(72 reference statements)
0
1
0
Order By: Relevance
“…Remark 10. Unlike in [5], where a nonlinear expectation was used to encode uncertainty aversion in the problem, here it will serve to encourage random decisions and so smooth our value function.…”
Section: Smooth Entropymentioning
confidence: 99%
“…Remark 10. Unlike in [5], where a nonlinear expectation was used to encode uncertainty aversion in the problem, here it will serve to encourage random decisions and so smooth our value function.…”
Section: Smooth Entropymentioning
confidence: 99%