Learning the travelling salesperson problem requires rethinking generalization

Joshi, Chaitanya K.; Cappart, Quentin; Rousseau, Louis-Martin; Laurent, Thomas

doi:10.1007/s10601-022-09327-y

Cited by 30 publications

(26 citation statements)

References 41 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…After the first DNN model was trained (using example solutions) to construct TSP tours [64], many improvements have been proposed, e.g. different training strategies such as reinforcement learning (RL) [6,14,33,37] and model architectures, which enabled the same idea to be used for other routing problems [15,18,36,45,50,54,67]. Most constructive neural methods are auto-regressive, evaluating the model many times to predict one node at the time, but other works have considered predicting a heatmap of promising edges at once [19,32,52], which allows a tour to be constructed (using sampling or beam search) without further evaluating the model.…”

Section: Related Workmentioning

confidence: 99%

Deep Policy Dynamic Programming for Vehicle Routing Problems

Kool

Hoof

Gromicho

et al. 2022

Lecture Notes in Computer Science

108

134

View full text Add to dashboard Cite

Routing problems are a class of combinatorial problems with many practical applications. Recently, end-to-end deep learning methods have been proposed to learn approximate solution heuristics for such problems. In contrast, classical dynamic programming (DP) algorithms guarantee optimal solutions, but scale badly with the problem size. We propose Deep Policy Dynamic Programming (DPDP), which aims to combine the strengths of learned neural heuristics with those of DP algorithms. DPDP prioritizes and restricts the DP state space using a policy derived from a deep neural network, which is trained to predict edges from example solutions. We evaluate our framework on the travelling salesman problem (TSP), the vehicle routing problem (VRP) and TSP with time windows (TSPTW) and show that the neural policy improves the performance of (restricted) DP algorithms, making them competitive to strong alternatives such as LKH, while also outperforming most other 'neural approaches' for solving TSPs, VRPs and TSPTWs with 100 nodes.

show abstract

Section: Related Workmentioning

confidence: 99%

Deep Policy Dynamic Programming for Vehicle Routing Problems

Kool

Hoof

Gromicho

et al. 2022

Lecture Notes in Computer Science

108

134

View full text Add to dashboard Cite

show abstract

“…Both methods utilize advanced deep learning models such as Graph Neural Networks (GNNs) [36] and Graph Convolution Networks [37] to extract features of a graph and deploy Memory Augmented Neural Networks [38] and Recurrent Neural Networks (RNNs) [39] to pass sequential information. Both methods require training separate sets of parameters for different graph sizes to produce near-optimal solutions for TSP [40,41]. Table I summarizes studies focused on end-to-end supervised learning.…”

Section: A End-to-end Supervised Learning For Vrpsmentioning

confidence: 99%

“…Independently, [48] proposed a novel graph representation called Structure2Vec that can encode both the graph and the partial solution at any time step. [44] proposes fully attention-based encoder introducing transformers [49] to solve VRPs, while [41] uses GNNs, a deep learning model dedicated to learn graph information. In return, other studies adopt the proposed encoders including Pointer Networks [50], multi-head attention [51,52,53,54], recurrent neural networks (RNNs) [55], Structure2vec [56,57] and others.…”

Section: B End-to-end Deep Reinforcement Learning For Vrpsmentioning

confidence: 99%

“…[96] proposes to learn heatmaps for TSP on small sizes and apply them to arbitrarily large instances. Also, [41] investigates the generalization capabilities of supervised and reinforcement learning methods in end-to-end settings on TSP. The paper has shown that reinforcement learning is better than supervised learning because it does not learn from an existing solution.…”

Section: The Future Research Directionsmentioning

confidence: 99%

See 1 more Smart Citation

Learning to Solve Vehicle Routing Problems: A Survey

Bogyrbayeva¹,

Meraliyev²,

Mustakhov³

et al. 2022

Preprint

View full text Add to dashboard Cite

This paper provides a systematic overview of machine learning methods applied to solve NP-hard Vehicle Routing Problems (VRPs). Recently, there has been a great interest from both machine learning and operations research communities to solve VRPs either by pure learning methods or by combining them with the traditional hand-crafted heuristics. We present the taxonomy of the studies for learning paradigms, solution structures, underlying models, and algorithms. We present in detail the results of the state-of-the-art methods demonstrating their competitiveness with the traditional methods. The paper outlines the future research directions to incorporate learning-based solutions to overcome the challenges of modern transportation systems.

show abstract

“…In most existing research for VRP, fully connected undirected graphs are typical for modeling the mutual relationship between customers and vehicles. Such graphical representation facilitates a series of algorithms to exploit the graph neural networks (GNN) to learn the problem representations for solving VRP [9,81,102]. However, this dense topological structure is not applicable for JSSP since it can not describe the precedent constraints among operations.…”

Section: Introductionmentioning

confidence: 99%

Intelligent job shop scheduling via deep reinforcement learning over graphs

Zhang¹

View full text Add to dashboard Cite

show abstract

Learning the travelling salesperson problem requires rethinking generalization

Cited by 30 publications

References 41 publications

Deep Policy Dynamic Programming for Vehicle Routing Problems

Deep Policy Dynamic Programming for Vehicle Routing Problems

Learning to Solve Vehicle Routing Problems: A Survey

Intelligent job shop scheduling via deep reinforcement learning over graphs

Contact Info

Product

Resources

About