2021
DOI: 10.1109/access.2021.3131799
|View full text |Cite
|
Sign up to set email alerts
|

A Hybrid Reinforcement Learning-Based Model for the Vehicle Routing Problem in Transportation Logistics

Abstract: Currently, the number of deliveries handled by transportation logistics is rapidly increasing because of the significant growth of the e-commerce industry, resulting in the need for improved functional vehicle routing measures for logistic companies. The effective management of vehicle routing helps companies reduce operational costs and increases its competitiveness. The vehicle routing problem (VRP) seeks to identify optimal routes for a fleet of vehicles to deliver goods to customers while simultaneously co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(3 citation statements)
references
References 26 publications
0
3
0
Order By: Relevance
“…Solving large sized problems to optimality in reasonable time is usually unrealistic. Therefore, developing efficient heuristic or meta-heuristic algorithms to quickly obtain good enough solutions would be a better alternative [34].…”
Section: The Solution Methodsmentioning
confidence: 99%
“…Solving large sized problems to optimality in reasonable time is usually unrealistic. Therefore, developing efficient heuristic or meta-heuristic algorithms to quickly obtain good enough solutions would be a better alternative [34].…”
Section: The Solution Methodsmentioning
confidence: 99%
“…Lin et al 29 established an end-to-end DRL framework to effectively address the electric VRP considering time windows. Phiboonbanakit et al 30 formulated a novel optimization model that integrates RL with a complementary tree-based regression method to efficiently solve the VRP under changing requirements and uncertainties in the transportation environment. Fellek et al 31 introduced an attention-based end-to-end DRL model that incorporates edge information between nodes to enable comprehensive graph representation learning for solving the VRP.…”
Section: Introductionmentioning
confidence: 99%
“…In [19], the proximal policy optimization (PPO) is utilized to address the multiagent formation control with obstacle avoidance. Phiboonbanakit et al develop a hybrid optimization model via RL and a complementary tree-based regression method to solve the vehicle routing problem in transportation logistics [20]. An improved Dyna-Q algorithm is proposed to deal with the mobile robot path planning in an unknown environment [21], in which the action-selection strategy, ε -greedy policy, and heuristic reward function and actions are utilized to enhance the performance.…”
Section: Introductionmentioning
confidence: 99%