2018
DOI: 10.1016/j.ins.2018.01.005
|View full text |Cite
|
Sign up to set email alerts
|

Automatic design of hyper-heuristic based on reinforcement learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
27
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 92 publications
(35 citation statements)
references
References 23 publications
0
27
0
Order By: Relevance
“…In [70], RL was used to choose one of the lower heuristics from 30 actions, with three non-overlapping ranges as the number of states. In its process, the given computational time is divided into a number of equal epochs (|E|), which was determined by applying the algorithm on the instances taken from CHeSC 2011.…”
Section: Rl In Easmentioning
confidence: 99%
“…In [70], RL was used to choose one of the lower heuristics from 30 actions, with three non-overlapping ranges as the number of states. In its process, the given computational time is divided into a number of equal epochs (|E|), which was determined by applying the algorithm on the instances taken from CHeSC 2011.…”
Section: Rl In Easmentioning
confidence: 99%
“…Two approaches in [95,121] used such simple RL scheme to reward and punish the weights of low-level heuristics based on their previous performance and selected low-level heuristic using roulette wheel method according to their weights in each iteration. Rather than using such simple RL scheme, a more complex RL method, Q-learning, was introduced to learn to select heuristics in [36], which strictly follows the criteria of RL. In [47], the author proposed a RL-based hyper-heuristic method that fulfilled the criteria of RL.…”
Section: Rl-based Approachesmentioning
confidence: 99%
“…In [29], Ahmed and Ender proposed an iterated multi-stage selection hyperheuristic and investigated on one-dimensional bin-packing, personal scheduling, permutation flow-shop, travelling salesman problem, and vehicle routing problem. In [12], a reinforcement learning technique is incorporated in hyper-heuristic and the performance is evaluated on six different problem domains. Several state-of-the-art hyper-heuristics [6,12] are also developed…”
Section: Overview Of Hyper-heuristicmentioning
confidence: 99%