Periodic Bandits and Wireless Network Selection

Oh, Shunhao; Appavoo, Anuja Meetoo; Gilbert, Seth

doi:10.48550/arxiv.1904.12355

Cited by 1 publication

(1 citation statement)

References 21 publications

(43 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Auer et al [2019], Besbes et al [2014], , Cheung et al [2022], Luo et al [2018], Russac et al [2019], Trovo et al [2020], Wu et al [2018]) although they do not deal with periodically behaved dynamical system properly (see discussions in [Cai et al, 2021] as well). For discrete action settings, periodic bandit [Oh et al, 2019] was proposed, which aims at optimizing for the total regret. Also, if the period is known, Gaussian process bandit for periodic reward functions was proposed (Cai et al [2021]) under Gaussian noise assumption.…”

Section: Related Workmentioning

confidence: 99%

Dynamic Structure Estimation from Bandit Feedback

Ohnishi¹,

Ishikawa²,

Kuroki³

et al. 2022

Preprint

View full text Add to dashboard Cite

This work present novel method for structure estimation of an underlying dynamical system. We tackle problems of estimating dynamic structure from bandit feedback contaminated by sub-Gaussian noise. In particular, we focus on periodically behaved discrete dynamical system in the Euclidean space, and carefully identify certain obtainable subset of full information of the periodic structure. We then derive a sample complexity bound for periodic structure estimation. Technically, asymptotic results for exponential sums are adopted to effectively average out the noise effects while preventing the information to be estimated from vanishing. For linear systems, the use of the Weyl sum further allows us to extract eigenstructures. Our theoretical claims are experimentally validated on simulations of toy examples, including Cellular Automata.

show abstract