Proceedings of the 8th International Conference on Performance Evaluation Methodologies and Tools 2015
DOI: 10.4108/icst.valuetools.2014.258207
|View full text |Cite
|
Sign up to set email alerts
|

Exploration vs Exploitation with Partially Observable Gaussian Autoregressive Arms

Abstract: We consider a restless bandit problem with Gaussian autoregressive arms, where the state of an arm is only observed when it is played and the state-dependent reward is collected. Since arms are only partially observable, a good decision policy needs to account for the fact that information about the state of an arm becomes more and more obsolete while the arm is not being played. Thus, the decision maker faces a tradeoff between exploiting those arms that are believed to be currently the most rewarding (i.e. t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 18 publications
0
1
0
Order By: Relevance
“…Our goal is not to merely present a descriptive collection of the related work and gaps in these two lines of researnd for our future work on RL based methods for optimizing locality and globality in probabilistic forecasting. Albeit there is extensive literature work on proposing global methods over local ones or the other way around (de Bezenac et al, 2020;Rangapuram et al, 2018;Kuhn et al, 2015;Rasul et al, 2021;Salinas et al, 2020;Tang et al, 2020;Yao et al, 2017;Zhou et al, 2020;Zhu & Laptev, 2017), the research on hybrid models (global-local, local-global) is still incipient and not always suitable to be applied for collections of multivariate TS, which we mainly take as our focus scope (Hwang et al, 2016).…”
Section: Introductionmentioning
confidence: 99%
“…Our goal is not to merely present a descriptive collection of the related work and gaps in these two lines of researnd for our future work on RL based methods for optimizing locality and globality in probabilistic forecasting. Albeit there is extensive literature work on proposing global methods over local ones or the other way around (de Bezenac et al, 2020;Rangapuram et al, 2018;Kuhn et al, 2015;Rasul et al, 2021;Salinas et al, 2020;Tang et al, 2020;Yao et al, 2017;Zhou et al, 2020;Zhu & Laptev, 2017), the research on hybrid models (global-local, local-global) is still incipient and not always suitable to be applied for collections of multivariate TS, which we mainly take as our focus scope (Hwang et al, 2016).…”
Section: Introductionmentioning
confidence: 99%