2010
DOI: 10.1007/s10472-010-9213-y
|View full text |Cite
|
Sign up to set email alerts
|

Analyzing bandit-based adaptive operator selection mechanisms

Abstract: Several techniques have been proposed to tackle the Adaptive Operator Selection (AOS) issue in Evolutionary Algorithms. Some recent proposals are based on the Multi-Armed Bandit (MAB) paradigm: each operator is viewed as one arm of a MAB problem, and the rewards are mainly based on the fitness improvement brought by the corresponding operator to the individual it is applied to. However, the AOS problem is dynamic, whereas standard MAB algorithms are known to optimally solve the exploitation versus exploration … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
123
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 133 publications
(123 citation statements)
references
References 38 publications
0
123
0
Order By: Relevance
“…There are three popular adaptive methods: probability matching, adaptive pursuit and multi-armed bandit. Fialho has authored (in collaboration with assorted others) a large body of work on adaptive operation selection, see, for example, [5,6]. The strategy we implement is multi-armed bandit with AUC credit assignment.…”
Section: Related Work and Discussionmentioning
confidence: 99%
“…There are three popular adaptive methods: probability matching, adaptive pursuit and multi-armed bandit. Fialho has authored (in collaboration with assorted others) a large body of work on adaptive operation selection, see, for example, [5,6]. The strategy we implement is multi-armed bandit with AUC credit assignment.…”
Section: Related Work and Discussionmentioning
confidence: 99%
“…These can customize an initial algorithm setup for a given problem off-line (before the run), or on-line (during the run) 63 . Techniques such as automated parameter tuning 64,65,66,67 and adaptive parameter control continue to make advances in this area 68,69,70,71 .…”
Section: Automated Design and Tuning Of Easmentioning
confidence: 99%
“…The former takes recent assist information on operator into account. Some recent assist information is employed as rewards which decide the credit assignment of operators [24]. Rank factor is used to increase the use frequency of better operators.…”
Section: Credit Assignmentmentioning
confidence: 99%
“…The parameter C plays a crucial role in deciding which factor plays a more important role. SlMAB [24] uses a sliding window with a mechanism of FIFO to store some latest information about operators. The latest information about operators truly reflects the operator performance.…”
Section: Multiarmed Banditmentioning
confidence: 99%
See 1 more Smart Citation