2022
DOI: 10.1109/tits.2021.3056120
|View full text |Cite
|
Sign up to set email alerts
|

Heterogeneous Attentions for Solving Pickup and Delivery Problem via Deep Reinforcement Learning

Abstract: Recently, there is an emerging trend to apply deep reinforcement learning to solve the vehicle routing problem (VRP), where a learnt policy governs the selection of next node for visiting. However, existing methods could not handle well the pairing and precedence relationships in the pickup and delivery problem (PDP), which is a representative variant of VRP. To address this challenging issue, we leverage a novel neural network integrated with a heterogeneous attention mechanism to empower the policy in deep r… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
31
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
2

Relationship

3
6

Authors

Journals

citations
Cited by 54 publications
(31 citation statements)
references
References 24 publications
0
31
0
Order By: Relevance
“…The node coordinates are also generated uniformly from the unit square for all three problems, following [21,23]. For CVRP, the demands of customers are generated uniformly from integers {1..9} with the capacity fixed as 40 + 0.1 × |V |, compatible with the largest CVRP (100 nodes) studied in [21].…”
Section: Experiments On Other Routing Problemsmentioning
confidence: 99%
“…The node coordinates are also generated uniformly from the unit square for all three problems, following [21,23]. For CVRP, the demands of customers are generated uniformly from integers {1..9} with the capacity fixed as 40 + 0.1 × |V |, compatible with the largest CVRP (100 nodes) studied in [21].…”
Section: Experiments On Other Routing Problemsmentioning
confidence: 99%
“…Some literature proposes special designs for the coupled challenge. For instance, Li et al [111] propose a special attention-based structure to leverage different relationships among all customer nodes in VRP with pick and delivery. Six different attention mechanisms in total are computed as a thorough measurement upon all nodes.…”
Section: Coupled Spatial-temporal Representationsmentioning
confidence: 99%
“…Recently, researchers tend to apply deep reinforcement learning (DRL) to automatically learn the searching rules in heuristic methods for solving routing problems including CVRP and TSP [19]- [24], by discovering the underlying patterns from a large number of instances. Generally, those DRL models are categorized as two classes, i.e., construction and improvement methods, respectively.…”
Section: Introductionmentioning
confidence: 99%