Che-Yu Liu scite author profile

Che-Yu Liu

5Publications

93Citation Statements Received

38Citation Statements Given

How they've been cited

111

How they cite others

Affiliations

University of California, Davis, National Yang Ming Chiao Tung University

Publications

Order By: Most citations

Prior-free and prior-dependent regret bounds for Thompson Sampling

Bubeck

Liu

2014

View full text Add to dashboard Cite

We consider the stochastic multi-armed bandit problem with a prior distribution on the reward distributions. We are interested in studying prior-free and prior-dependent regret bounds, very much in the same spirit as the usual distribution-free and distribution-dependent bounds for the non-Bayesian stochastic bandit. Building on the techniques of Audibert and Bubeck [2009] and Russo and Roy [2013] we first show that Thompson Sampling attains an optimal prior-free bound in the sense that for any prior distribution its Bayesian regret is bounded from above by 14 √ nK. This result is unimprovable in the sense that there exists a prior distribution such that any algorithm has a Bayesian regret bounded from below by 1 20 √ nK. We also study the case of priors for the setting of Bubeck et al. [2013] (where the optimal mean is known as well as a lower bound on the smallest gap) and we show that in this case the regret of Thompson Sampling is in fact uniformly bounded over time, thus showing that Thompson Sampling can greatly take advantage of the nice properties of these priors.

show abstract

Enhanced Light Extraction Efficiency of GaN-Based Hybrid Nanorods Light-Emitting Diodes

Huang

Liu

Chen

et al. 2015

IEEE J. Select. Topics Quantum Electron.

View full text Add to dashboard Cite

On the Prior Sensitivity of Thompson Sampling

Liu

2015

Preprint

View full text Add to dashboard Cite

On the Prior Sensitivity of Thompson Sampling

Liu

2016

View full text Add to dashboard Cite

Abstract. The empirically successful Thompson Sampling algorithm for stochastic bandits has drawn much interest in understanding its theoretical properties. One important benefit of the algorithm is that it allows domain knowledge to be conveniently encoded as a prior distribution to balance exploration and exploitation more effectively. While it is generally believed that the algorithm's regret is low (high) when the prior is good (bad), little is known about the exact dependence. This paper is a first step towards answering this important question: focusing on a special yet representative case, we fully characterize the algorithm's worst-case dependence of regret on the choice of prior. As a corollary, these results also provide useful insights into the general sensitivity of the algorithm to the choice of priors, when no structural assumptions are made. In particular, with p being the prior probability mass of the true reward-generating model, we prove O( T /p) and O( (1 − p)T ) regret upper bounds for the poor-and good-prior cases, respectively, as well as matching lower bounds. Our proofs rely on a fundamental property of Thompson Sampling and make heavy use of martingale theory, both of which appear novel in the Thompson-Sampling literature and may be useful for studying other behavior of the algorithm.

show abstract

Demonstration of distributed collaborative learning with end-to-end QoT estimation in multi-domain elastic optical networks

Chen

Proietti

et al. 2019

Opt. Express

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Che-Yu Liu

Prior-free and prior-dependent regret bounds for Thompson Sampling

Enhanced Light Extraction Efficiency of GaN-Based Hybrid Nanorods Light-Emitting Diodes

On the Prior Sensitivity of Thompson Sampling

On the Prior Sensitivity of Thompson Sampling

Demonstration of distributed collaborative learning with end-to-end QoT estimation in multi-domain elastic optical networks

Contact Info

Product

Resources

About