Abstract:We study dynamic allocation problems for discrete time multi-armed bandits under uncertainty, based on the the theory of nonlinear expectations. We show that, under independence assumption on the bandits and with some relaxation in the definition of optimality, a Gittins allocation index gives optimal choices. This involves studying the interaction of our uncertainty with controls which determine the filtration. We also run a simple numerical example which illustrates the interaction between the willingness to… Show more
“…Results (6,7) are qualitatively what one would expect. Order L 2 arises as the variance of the gamblers' perceived probabilities, and order 2 as the effect of the gambler's bias when the bookmaker is not biased.…”
Section: The Mathematical Association Of Americasupporting
confidence: 66%
“…which is reasonable because a bookmaker obviously prefers gamblers to have a wide range of perceived probabilities to encourage actual betting, as will be seen in (6,7).…”
Section: The Mathematical Association Of Americamentioning
confidence: 99%
“…As perhaps the closest topic to our examples, the Gittins index [29] is analogous to the Kelly criterion in that it gives an optimal strategy within a certain "multiarmed bandits" context, assuming known distributions. A recent paper [7] describes analysis of the "uncertain distributions" case within a technically sophisticated "nonlinear expectations" framework. Final remarks.…”
Section: Remarks On the Academic Literaturementioning
confidence: 99%
“…The only situation where the casino does make a choice is in setting the odds on sports results 6. This is equivalent to paying 1 − x 1 for a contract for the opponent to win 7. In US sports betting, the width x 2 − x 1 of the spread interval is called the hold or the vig or the juice or the take or the house cut[18] 8.…”
The subject of decisions under uncertainty about future events, if lacking sufficient theory or data to make confident probability assessments, poses a challenge for any quantitative analysis. This article suggests one way to first look at this subject. We give elementary examples within a framework for studying decisions under uncertainty where probabilities are only roughly known. The framework, in gambling terms, is that the size of a bet is proportional to the gambler's perceived advantage based on their perceived probability, and their accuracy in estimating true probabilities is measured by mean-squared-error. Within this framework one can study the cost of estimation errors, and seek to formalize the "obvious" notion that in competitive interactions between agents whose actions depend on their perceived probabilities, those who are more accurate at estimating probabilities will generally be more successful than those who are less accurate.This article is an extended version of the Brouwer Medal talk at the 2021 Nederlands Mathematisch Congres.
“…Results (6,7) are qualitatively what one would expect. Order L 2 arises as the variance of the gamblers' perceived probabilities, and order 2 as the effect of the gambler's bias when the bookmaker is not biased.…”
Section: The Mathematical Association Of Americasupporting
confidence: 66%
“…which is reasonable because a bookmaker obviously prefers gamblers to have a wide range of perceived probabilities to encourage actual betting, as will be seen in (6,7).…”
Section: The Mathematical Association Of Americamentioning
confidence: 99%
“…As perhaps the closest topic to our examples, the Gittins index [29] is analogous to the Kelly criterion in that it gives an optimal strategy within a certain "multiarmed bandits" context, assuming known distributions. A recent paper [7] describes analysis of the "uncertain distributions" case within a technically sophisticated "nonlinear expectations" framework. Final remarks.…”
Section: Remarks On the Academic Literaturementioning
confidence: 99%
“…The only situation where the casino does make a choice is in setting the odds on sports results 6. This is equivalent to paying 1 − x 1 for a contract for the opponent to win 7. In US sports betting, the width x 2 − x 1 of the spread interval is called the hold or the vig or the juice or the take or the house cut[18] 8.…”
The subject of decisions under uncertainty about future events, if lacking sufficient theory or data to make confident probability assessments, poses a challenge for any quantitative analysis. This article suggests one way to first look at this subject. We give elementary examples within a framework for studying decisions under uncertainty where probabilities are only roughly known. The framework, in gambling terms, is that the size of a bet is proportional to the gambler's perceived advantage based on their perceived probability, and their accuracy in estimating true probabilities is measured by mean-squared-error. Within this framework one can study the cost of estimation errors, and seek to formalize the "obvious" notion that in competitive interactions between agents whose actions depend on their perceived probabilities, those who are more accurate at estimating probabilities will generally be more successful than those who are less accurate.This article is an extended version of the Brouwer Medal talk at the 2021 Nederlands Mathematisch Congres.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.