Dynamic Ride-Hailing with Electric Vehicles

Kullman, Nicholas; Cousineau, Martin; Goodson, Justin; Mendoza, Jorge E.

doi:10.1287/trsc.2021.1042

Cited by 50 publications

(22 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Multi-agent RL-based algorithms have also been introduced for the Dial-a-Ride Problem with multiple vehicles and stochastic orders (Qin et al 2020, Kullman et al 2020, Holler et al 2019. Qin et al (2020) and Tang et al (2019) implemented Q values in the form of one Deep Q Network (DQN) for each vehicle and used a central combinatorial optimization problem as a coordinator to assign orders to vehicles.…”

Section: Mdp-based Solution Methods For Stochastic and Dynamic Vrpsmentioning

confidence: 99%

“…Qin et al (2020) and Tang et al (2019) implemented Q values in the form of one Deep Q Network (DQN) for each vehicle and used a central combinatorial optimization problem as a coordinator to assign orders to vehicles. Kullman et al (2020) adopted an attention encoder-decoder as the central coordinator and trained the model with Actor-Critic. For a similar problem, Holler et al (2019) compared Actor-Critic and DQN methods without observing significant performance differences.…”

Section: Mdp-based Solution Methods For Stochastic and Dynamic Vrpsmentioning

confidence: 99%

See 1 more Smart Citation

Off-line approximate dynamic programming for the vehicle routing problem with stochastic customers and demands via decentralized decision-making

Dastpak¹,

Errico²,

Jabali³

2021

Preprint

View full text Add to dashboard Cite

This paper studies a stochastic variant of the vehicle routing problem (VRP) where both customer locations and demands are uncertain. In particular, potential customers are not restricted to a predefined customer set but are continuously spatially distributed in a given service area. The objective is to maximize the served demands while fulfilling vehicle capacities and time restrictions. We call this problem the VRP with stochastic customers and demands (VRPSCD). For this problem, we first propose a Markov Decision Process (MDP) formulation representing the classical centralized decision-making perspective where one decision-maker establishes the routes of all vehicles. While the resulting formulation turns out to be intractable, it provides us with the ground to develop a new MDP formulation of the VRPSCD representing a decentralized decision-making framework, where vehicles autonomously establish their own routes. This new formulation allows us to develop several strategies to reduce the dimension of the state and action spaces, resulting in a considerably more tractable problem. We solve the decentralized problem via Reinforcement Learning, and in particular, we develop a Q-learning algorithm featuring state-of-the-art acceleration techniques such as Replay Memory and Double Q Network. Computational results show that our method considerably outperforms two commonly adopted benchmark policies (random and heuristic). Moreover, when comparing with existing literature, we show that our approach can compete with specialized methods developed for the particular case of the VRPSCD where customer locations and expected demands are known in advance. Finally, we show that the value functions and policies obtained by our algorithm can be easily embedded in Rollout algorithms, thus further improving their performances.

show abstract

Section: Mdp-based Solution Methods For Stochastic and Dynamic Vrpsmentioning

confidence: 99%

Section: Mdp-based Solution Methods For Stochastic and Dynamic Vrpsmentioning

confidence: 99%

Off-line approximate dynamic programming for the vehicle routing problem with stochastic customers and demands via decentralized decision-making

Dastpak¹,

Errico²,

Jabali³

2021

Preprint

View full text Add to dashboard Cite

show abstract

“…This is not the case in our multiagent charging station search setting: here, each agent terminates her search when she found at least one non-shareable available resource. Existing work on multi-agent settings for EVs mostly focuses on autonomous EV fleet management, such as ride-sharing planning (Al-Kanj et al 2020) or online requests matching for ride-hailing (Kullman et al 2021a) and do not cover stochastic resource search problems.…”

Section: Related Literaturementioning

confidence: 99%

Coordinated Charging Station Search in Stochastic Environments: A Multi-Agent Approach

Guillet¹,

Schiffer²

2022

Preprint

View full text Add to dashboard Cite

Range and charge anxiety remain essential barriers to a faster electric vehicle market diffusion. To this end, quickly and reliably finding suitable charging stations may foster an electric vehicle uptake by mitigating drivers' anxieties. Here, existing commercial services help drivers to find available stations based on real-time availability data but struggle with data inaccuracy, e.g., due to conventional vehicles blocking the access to public charging stations. In this context, recent works have studied stochastic search methods to account for availability uncertainty in order to minimize a driver's detour until reaching an available charging station. So far, both practical and theoretical approaches ignore driver coordination enabled by charging requests centralization or sharing of data, e.g., sharing observations of charging stations' availability or visit intentions between drivers.Against this background, we study coordinated stochastic search algorithms, which help to reduce station visit conflicts and improve the drivers' charging experience. We model a multi-agent stochastic charging station search problem as a finite-horizon Markov decision process and introduce an online solution framework applicable to static and dynamic policies. In contrast to static policies, dynamic policies account for information updates during policy planning and execution. We present a hierarchical implementation of a single-agent heuristic for decentralized decision making and a rollout algorithm for centralized decision making. Extensive numerical studies show that compared to an uncoordinated setting, a decentralized setting with visit-intentions sharing decreases the system cost by 26%, which is nearly as good as the 28% cost decrease achieved in a centralized setting, and saves up to 23% of a driver's search time while increasing her search reliability.

show abstract

“…On the electric vehicle operations side of the problem, many studies have focused on simple myopic policies [13,3,24,4] while others have attempted to incorporate planning for future demand [1,40,20,17], though these methods do not necessarily scale to operational size.…”

Section: Literature Reviewmentioning

confidence: 99%

“…One approach is to use approximate dynamic programming (ADP), such as [1] which uses ADP to determine when vehicles get new passengers and whether vehicles should charge. In [40] and [20], deep reinforcement learning is used to develop policies for vehicles to determine when to accept new customers and when to charge. They suggest that the learning process allows the system to anticipate future demand.…”

Section: Literature Reviewmentioning

confidence: 99%