Gittins' theorem under uncertainty

Cohen, Samuel N.; Treetanthiploet, Tanut

doi:10.48550/arxiv.1907.05689

Search citation statements

Order By: Relevance

Paper Sections

Select...

Smooth Entropy1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2020

Publication Types

Select...

Other1

Relationship

Self Cite1

Independent0

Authors

Journals

Cited by 1 publication

(1 citation statement)

References 53 publications

(72 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Remark 10. Unlike in [5], where a nonlinear expectation was used to encode uncertainty aversion in the problem, here it will serve to encourage random decisions and so smooth our value function.…”

Section: Smooth Entropymentioning

confidence: 99%

Asymptotic Randomised Control with applications to bandits

Cohen¹,

Treetanthiploet²

2020

Preprint

Self Cite

View full text Add to dashboard Cite

We consider a general multi-armed bandit problem with correlated (and simple contextual and restless) elements, as a relaxed control problem. By introducing an entropy premium, we obtain a smooth asymptotic approximation to the value function. This yields a novel semi-index approximation of the optimal decision process, obtained numerically by solving a fixed point problem, which can be interpreted as explicitly balancing an exploration-exploitation trade-off. Performance of the resulting Asymptotic Randomised Control (ARC) algorithm compares favourably with other approaches to correlated multi-armed bandits.

show abstract

Section: Smooth Entropymentioning

confidence: 99%