Abstract:Nowadays, ridesharing has become one of the most popular services offered by online ride-hailing platforms (e.g., Uber and Didi Chuxing). Existing ridesharing platforms adopt the strategy that dispatches orders over the entire city at a uniform time interval. However, the uneven spatio-temporal order distributions in realworld ridesharing systems indicate that such an approach is suboptimal in practice. Thus, in this paper, we exploit adaptive dispatching intervals to boost the platform's profit under a guaran… Show more
“…Multi-Region UNIFORM (MRUNIFORM). The UNIF-ORM algorithm [16] is a commonly used comparison algorithm, which will do the matching for every n time slots. We modify the UNIFORM algorithm to increase its dynamic matching property, with a half probability of matching at the current time slot and a half probability of not making a match, which is called the MRUNIFORM.…”
Section: A Benchmark Approaches and Metricsmentioning
For online ride-hailing platforms, choosing the right time to match idle vehicles with passengers is one of the most important factors affecting the platform's profit. On one hand, vehicles and passengers arrive dynamically, and an appropriate delayed matching may generate a highly efficient matching result with more values. On the other hand, different regions may have different states of supply (vehicles) and demand (passengers), and the matching time should be different. At this moment, we need an efficient matching time strategy that takes into account matching time and regional differences to maximize the platform's long-term profit. In this paper, we propose a dynamic matching time algorithm based on multi-agent reinforcement learning, which is called Multi-Region Differentiated Matching Decision. Firstly, we describe the order matching process and then model it as a decentralized partially observable Markov decision process (Dec-POMDP). Secondly, considering that there are regional differences in supply and demand, we divide the overall area based on historical data and propose an algorithm based on multi-agent reinforcement learning to realize multiregion differentiated dynamic matching. Finally, we conduct extensive experiments to evaluate our matching algorithm against benchmark algorithms in a real-world dataset. The experimental results show that our algorithm can outperform benchmark algorithms.
“…Multi-Region UNIFORM (MRUNIFORM). The UNIF-ORM algorithm [16] is a commonly used comparison algorithm, which will do the matching for every n time slots. We modify the UNIFORM algorithm to increase its dynamic matching property, with a half probability of matching at the current time slot and a half probability of not making a match, which is called the MRUNIFORM.…”
Section: A Benchmark Approaches and Metricsmentioning
For online ride-hailing platforms, choosing the right time to match idle vehicles with passengers is one of the most important factors affecting the platform's profit. On one hand, vehicles and passengers arrive dynamically, and an appropriate delayed matching may generate a highly efficient matching result with more values. On the other hand, different regions may have different states of supply (vehicles) and demand (passengers), and the matching time should be different. At this moment, we need an efficient matching time strategy that takes into account matching time and regional differences to maximize the platform's long-term profit. In this paper, we propose a dynamic matching time algorithm based on multi-agent reinforcement learning, which is called Multi-Region Differentiated Matching Decision. Firstly, we describe the order matching process and then model it as a decentralized partially observable Markov decision process (Dec-POMDP). Secondly, considering that there are regional differences in supply and demand, we divide the overall area based on historical data and propose an algorithm based on multi-agent reinforcement learning to realize multiregion differentiated dynamic matching. Finally, we conduct extensive experiments to evaluate our matching algorithm against benchmark algorithms in a real-world dataset. The experimental results show that our algorithm can outperform benchmark algorithms.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.