2021
DOI: 10.1609/aaai.v35i14.17476
|View full text |Cite
|
Sign up to set email alerts
|

Combining Reinforcement Learning with Lin-Kernighan-Helsgaun Algorithm for the Traveling Salesman Problem

Abstract: We address the Traveling Salesman Problem (TSP), a famous NP-hard combinatorial optimization problem. And we propose a variable strategy reinforced approach, denoted as VSR-LKH, which combines three reinforcement learning methods (Q-learning, Sarsa and Monte Carlo) with the well-known TSP algorithm, called Lin-Kernighan-Helsgaun (LKH). VSR-LKH replaces the inflexible traversal operation in LKH, and lets the program learn to make choice at each search step by reinforcement learning. Experimental results on 111 … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 42 publications
(17 citation statements)
references
References 25 publications
0
10
0
Order By: Relevance
“…ML has been applied to solve a number of COPs, including traveling salesman problems (Xin et al, 2021;Zheng et al, 2021), vehicle routing (Kool et al, 2018), boolean satisfiability (Selsam et al, 2018;Amizadeh et al, 2018) and general graph optimization problems (Khalil et al, 2017;Li et al, 2018).…”
Section: Appendix a Additional Related Workmentioning
confidence: 99%
“…ML has been applied to solve a number of COPs, including traveling salesman problems (Xin et al, 2021;Zheng et al, 2021), vehicle routing (Kool et al, 2018), boolean satisfiability (Selsam et al, 2018;Amizadeh et al, 2018) and general graph optimization problems (Khalil et al, 2017;Li et al, 2018).…”
Section: Appendix a Additional Related Workmentioning
confidence: 99%
“…Costa et al (2020) [ 19 ] proposed a 2-opt heuristic algorithm combined with deep RL for TSP, which essentially enhances the learning process of 2-opt. Combining the advantages of Q-learning (QL), Sarsa, and the Monte Carlo algorithm, Zheng et al (2020) [ 20 ] proposed a variable strategy reinforced (VSR) approach, optimized the k-opt process of LKH based on this, and designed VSR-LKH for TSP. Optimizing the parameters of the biased random-key genetic algorithm (BRKGA) by QL, Chaves et al (2021) [ 21 ] proposed a BRKGA-QL algorithm for TSP.…”
Section: Introductionmentioning
confidence: 99%
“…The meta-heuristics include discrete bat algorithm [13], adaptive ACO based on unique strategies [14], improved artificial bee colony algorithm [15], and discrete spider monkey optimization [16]. In addition, the advantages of deep learning based on combining it with TSP to solve the TSP problem [17,18].…”
Section: Introductionmentioning
confidence: 99%