2023
DOI: 10.1109/tits.2022.3207011
|View full text |Cite
|
Sign up to set email alerts
|

Learning to Solve Multiple-TSP With Time Window and Rejections via Deep Reinforcement Learning

Abstract: We propose a manager-worker framework 1 based on deep reinforcement learning to tackle a hard yet nontrivial variant of Travelling Salesman Problem (TSP), i.e., multiplevehicle TSP with time window and rejections (mTSPTWR), where customers who cannot be served before the deadline are subject to rejections. Particularly, in the proposed framework, a manager agent learns to divide mTSPTWR into sub-routing tasks by assigning customers to each vehicle via a Graph Isomorphism Network (GIN) based policy network. A w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
30
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 30 publications
(47 citation statements)
references
References 22 publications
0
30
0
Order By: Relevance
“…In Zhang et al. [168], a more practical TSPTW variant called multiple‐vehicle TSP with time window and rejections was studied, where customers can be rejected due to TW constraint.…”
Section: Vehicle Routing Problemsmentioning
confidence: 99%
See 1 more Smart Citation
“…In Zhang et al. [168], a more practical TSPTW variant called multiple‐vehicle TSP with time window and rejections was studied, where customers can be rejected due to TW constraint.…”
Section: Vehicle Routing Problemsmentioning
confidence: 99%
“…By proposing a novel Multi-Agent Attention Model, Zhang et al [150] tackled the VRP with Soft Time Windows (VRPSTW) and outperformed OR-Tools with shorter computation time. In Zhang et al [168], a more practical TSPTW variant called multiple-vehicle TSP with time window and rejections was studied, where customers can be rejected due to TW constraint.…”
Section: Classical Routing Problemsmentioning
confidence: 99%
“…The algorithm was tested for generalization ability by training the network with random events in the existing environment to quickly adapt to new problems. However, the agent was not tested with a completely new dataset, an issue which was overcome by Zhang et al [2020a]. The authors developed a graph neural network which enabled them to solve size-agnostic problems.…”
Section: Related Workmentioning
confidence: 99%
“…The action space was designed for n jobs along with an additional job called No-Op (No Operation). The agent was tested with different benchmark instances where the agent performed around 18% better than Zhang et al [2020a] and 10% better than Han and Yang [2020]. Even though they provide a near-optimum solution, the approach falls short of the generalization objective.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation