2022
DOI: 10.1287/opre.2021.2207
|View full text |Cite
|
Sign up to set email alerts
|

Optimistic Gittins Indices

Abstract: We propose a tightening sequence of optimistic approximations to the Gittins index in “Optimistic Gittins Indices.” We show that the use of these approximations in concert with the use of an increasing discount factor appears to offer a compelling alternative to state-of-the-art index schemes proposed for the Bayesian multiarmed bandit problem. We prove that the use of these optimistic indices constitutes a regret optimal algorithm. Perhaps more interestingly, the use of even the loosest of these approximation… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
1

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 32 publications
(31 reference statements)
0
0
1
Order By: Relevance
“…Further works include extensions to the discounted infinite horizon case, to the frequentist setting, as well as the theoretical analysis of the DeCo policy. Also, while we have not reproduced their results, the DeCo policy seems to have performance close to the state of the art [19,9].…”
Section: Discussioncontrasting
confidence: 55%
“…Further works include extensions to the discounted infinite horizon case, to the frequentist setting, as well as the theoretical analysis of the DeCo policy. Also, while we have not reproduced their results, the DeCo policy seems to have performance close to the state of the art [19,9].…”
Section: Discussioncontrasting
confidence: 55%