2021
DOI: 10.1007/s10994-021-05956-1
|View full text |Cite
|
Sign up to set email alerts
|

Multi-objective multi-armed bandit with lexicographically ordered and satisficing objectives

Abstract: We consider multi-objective multi-armed bandit with (i) lexicographically ordered and (ii) satisficing objectives. In the first problem, the goal is to select arms that are lexicographic optimal as much as possible without knowing the arm reward distributions beforehand. We capture this goal by defining a multi-dimensional form of regret that measures the loss due to not selecting lexicographic optimal arms, and then, propose an algorithm that achieves Õ(T 2∕3 ) gap-free regret and prove a regret lower bound o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

1
12
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(13 citation statements)
references
References 18 publications
(37 reference statements)
1
12
0
Order By: Relevance
“…To deal with these real-world applications, a natural idea is to utilize lexicographic ordering, as it ranks the objectives according to their importance (Ehrgott 2005;Wray, Zilberstein, and Mouaddib 2015;Hüyük and Tekin 2021;Hosseini et al 2021;Skalse et al 2022). Let X represent an arm space, and the expected payoffs for a, b ∈ X are [µ 1 (a), µ 2 (a), .…”
Section: Introductionmentioning
confidence: 99%
See 4 more Smart Citations
“…To deal with these real-world applications, a natural idea is to utilize lexicographic ordering, as it ranks the objectives according to their importance (Ehrgott 2005;Wray, Zilberstein, and Mouaddib 2015;Hüyük and Tekin 2021;Hosseini et al 2021;Skalse et al 2022). Let X represent an arm space, and the expected payoffs for a, b ∈ X are [µ 1 (a), µ 2 (a), .…”
Section: Introductionmentioning
confidence: 99%
“…, m}, such that µ i (a) = µ i (b) for 1 ≤ i ≤ i * − 1 and µ i * (a) > µ i * (b). The lexicographically optimal arm is the one that is not lexicographically dominated by any other arms (Hüyük and Tekin 2021).…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations