2020
DOI: 10.1109/tnse.2019.2935256
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Armed Bandits on Partially Revealed Unit Interval Graphs

Abstract: A stochastic multi-armed bandit problem with side information on the similarity and dissimilarity across different arms is considered. The action space of the problem can be represented by a unit interval graph (UIG) where each node represents an arm and the presence (absence) of an edge between two nodes indicates similarity (dissimilarity) between their mean rewards. Two settings of complete and partial side information based on whether the UIG is fully revealed are studied and a general two-step learning st… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 28 publications
(46 reference statements)
0
1
0
Order By: Relevance
“…In the current work, many mobile group intelligence perception systems assume that the user's perceived quality is known, and, on this basis, specific optimization goals are used to recruit users [13]. However, in real life, the perceived quality of mobile users is often unknown.…”
Section: Introductionmentioning
confidence: 99%
“…In the current work, many mobile group intelligence perception systems assume that the user's perceived quality is known, and, on this basis, specific optimization goals are used to recruit users [13]. However, in real life, the perceived quality of mobile users is often unknown.…”
Section: Introductionmentioning
confidence: 99%