2019 IEEE International Symposium on Information Theory (ISIT) 2019
DOI: 10.1109/isit.2019.8849807
|View full text |Cite
|
Sign up to set email alerts
|

Learning to Detect an Odd Markov Arm

Abstract: A multi-armed bandit with finitely many arms is studied when each arm is a homogeneous Markov process on an underlying finite state space. The transition law of one of the arms, referred to as the odd arm, is different from the common transition law of all other arms. A learner, who has no knowledge of the above transition laws, has to devise a sequential test to identify the index of the odd arm as quickly as possible, subject to an upper bound on the probability of error. For this problem, we derive an asymp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4

Relationship

4
0

Authors

Journals

citations
Cited by 4 publications
(24 citation statements)
references
References 21 publications
(39 reference statements)
0
24
0
Order By: Relevance
“…Both the assumptions were crucial in establishing that the ML estimates of the TPMs converge to their true values. In [7]- [9], an analogue of the continuous selection property was established for the maximisers instead of δ-optimal solutions.…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…Both the assumptions were crucial in establishing that the ML estimates of the TPMs converge to their true values. In [7]- [9], an analogue of the continuous selection property was established for the maximisers instead of δ-optimal solutions.…”
Section: Discussionmentioning
confidence: 99%
“…It is worth noting here that in the prior works [7]- [9], the outer supremum in the expression for the constant appearing in the lower bound is over all unconditional probability distributions on the arms which are simpler objects to deal with than conditional probability distributions. Further, in these works, this outer supremum is attained by a unique (unconditional) probability distribution on the arms.…”
Section: E Contributions and Key Challenges To Overcomementioning
confidence: 99%
See 3 more Smart Citations