Anand Kalvit scite author profile

Anand Kalvit

5Publications

4Citation Statements Received

49Citation Statements Given

How they've been cited

How they cite others

Affiliations

Columbia University, Indian Institute of Technology Bombay

Publications

Order By: Most citations

A Closer Look at the Worst-case Behavior of Multi-armed Bandit Algorithms

Kalvit¹,

Zeevi²

2021

Preprint

View full text Add to dashboard Cite

One of the key drivers of complexity in the classical (stochastic) multi-armed bandit (MAB) problem is the difference between mean rewards in the top two arms, also known as the instance gap. The celebrated Upper Confidence Bound (UCB) policy is among the simplest optimism-based MAB algorithms that naturally adapts to this gap: for a horizon of play n, it achieves optimal O (log n) regret in instances with "large" gaps, and a near-optimal O √ n log n minimax regret when the gap can be arbitrarily "small." This paper provides new results on the arm-sampling behavior of UCB, leading to several important insights. Among these, it is shown that armsampling rates under UCB are asymptotically deterministic, regardless of the problem complexity. This discovery facilitates new sharp asymptotics and a novel alternative proof for the O √ n log n minimax regret of UCB. Furthermore, the paper also provides the first complete process-level characterization of the MAB problem under UCB in the conventional diffusion scaling. Among other things, the "small" gap worst-case lens adopted in this paper also reveals profound distinctions between the behavior of UCB and Thompson Sampling, such as an incomplete learning phenomenon characteristic of the latter.

show abstract

Capacity expansion of neutral ISPs via content provider participation: The bargaining edge

Kalvit

Pinjani

Kasbekar

et al. 2019

Performance Evaluation

View full text Add to dashboard Cite

Stochastic approximation algorithms for rumor source inference on graphs

Kalvit

Borkar

Karamchandani

2019

Performance Evaluation

View full text Add to dashboard Cite

Dynamic Learning in Large Matching Markets

Kalvit

Zeevi

2022

SIGMETRICS Perform. Eval. Rev.

View full text Add to dashboard Cite

We study a sequential matching problem faced by large centralized platforms where "jobs" must be matched to "workers" subject to uncertainty about worker skill proficiencies. Jobs arrive at discrete times (possibly in batches of stochastic size and composition) with "job-types" observable upon arrival. To capture the "choice overload" phenomenon, we posit an unlimited supply of workers where each worker is characterized by a vector of attributes (aka "worker-types") sampled from an underlying population-level distribution. The distribution as well as mean payoffs for possible workerjob type-pairs are unobservables and the platform's goal is to sequentially match incoming jobs to workers in a way that maximizes its cumulative payoffs over the planning horizon. We establish lower bounds on the regret of any matching algorithm in this setting and propose a novel rate-optimal learning algorithm that adapts to aforementioned primitives online. Our learning guarantees highlight a distinctive characteristic of the problem: achievable performance only has a second-order dependence on worker-type distributions; we believe this finding may be of interest more broadly.

show abstract

MCMC Approaches to Rumor Source Inference using Pairwise Information

Kalvit

Borkar

Karamchandani

2017

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Anand Kalvit

A Closer Look at the Worst-case Behavior of Multi-armed Bandit Algorithms

Capacity expansion of neutral ISPs via content provider participation: The bargaining edge

Stochastic approximation algorithms for rumor source inference on graphs

Dynamic Learning in Large Matching Markets

MCMC Approaches to Rumor Source Inference using Pairwise Information

Contact Info

Product

Resources

About