2021
DOI: 10.1609/socs.v12i1.18556
|View full text |Cite
|
Sign up to set email alerts
|

SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems

Abstract: We study combinatorial problems with real world applications such as machine scheduling, routing, and assignment. We propose a method that combines Reinforcement Learning (RL) and planning. This method can equally be applied to both the offline, as well as online, variants of the combinatorial problem, in which the problem components (e.g., jobs in scheduling problems) are not known in advance, but rather arrive during the decision-making process. Our solution is quite generic, scalable, and leverages distribu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 15 publications
0
7
0
Order By: Relevance
“…In Wu et al. [159], the AM [9] was adapted together with some RNN units to solve the time‐dependent TSPTW variant. In Oren et al.…”
Section: Vehicle Routing Problemsmentioning
confidence: 99%
See 3 more Smart Citations
“…In Wu et al. [159], the AM [9] was adapted together with some RNN units to solve the time‐dependent TSPTW variant. In Oren et al.…”
Section: Vehicle Routing Problemsmentioning
confidence: 99%
“…In Oren et al. [160], the deep Q‐learning was explored together with an improved Monte Carlo Tree Search (MCTS) to solve online CVRP. However, the problem scales of the aforementioned works remain rather small.…”
Section: Vehicle Routing Problemsmentioning
confidence: 99%
See 2 more Smart Citations
“…Algorithms such as Taboo search [Taillard, 1994], simulated Annealing [Van Laarhoven et al, 1992], genetic algorithms and particle swarm optimization [Pezzella et al, 2008] have proven to solve the problem, but lack in either computation time or generalization capabilities. Advancements in DRL approaches in recent years have enabled considerable progress for the domain of COP applications [Cappart et al, 2021, Oren et al, 2021. Some of the major COPs have been successfully solved using DRL such as the Travelling Salesman Problem (TSP) [Zhang et al, 2021, d O Costa et al, 2020, Zhang et al, 2020b, the Knap Sack Problem [Afshar et al, 2020, Cappart et al, 2021 and the Steiner Tree Problem [Du et al, 2021].…”
Section: Related Workmentioning
confidence: 99%