A Robust Motion Planning Algorithm Based on Reinforcement Learning

Xu, Yulong; Gong, Mengyu; Chen, Yafeng; Fan, Shuaifang; Qin, Zhimeng; Wu, Huan

doi:10.1109/cac57257.2022.10055364

Cited by 2 publications

(5 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In terms of itinerary planning, Zhou Xiao [20] proposed a tourism route planning method that comprehensively considers distance, the number of transfers between public transportation systems, and average road conditions based on a dynamic programming algorithm. Xu Ke [21] proposed a tourism route planning algorithm based on a reinforcement learning algorithm to avoid areas with frequent traffic accidents and traffic congestion. Huang Zebin [22] constrained tourist travel time by introducing time window coefficients into the ant colony algorithm heuristic function.…”

Section: Research On Itinerary Planning Algorithmsmentioning

confidence: 99%

“…In each iteration, we start from the user-selected base node combination (line 6), select nodes to add to the current combination, and perform itinerary planning until inserting a scenic spot would result in exceeding the cost limit, total time limit, or inaccessible scenic spots (lines 7-15). Then, the successful itinerary from the previous iteration is recorded as the recommendation for the current iteration, and itinerary-related parameters are calculated and propagated back to relevant candidate nodes (lines [16][17][18][19][20][21][22][23][24][25][26][27]. Finally, the recommendation with the maximum objective function among these itinerary recommendations is returned (line 28).…”

Section: Tour Itinerary Recommendation Algorithm Based On Tourist Com...mentioning

confidence: 99%

See 1 more Smart Citation

Personalized Tour Itinerary Recommendation Algorithm Based on Tourist Comprehensive Satisfaction

Liu,

Wang,

Zhong

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

Personalized travel itinerary recommendation algorithms are the focus of research in smart tourism and tourism GIS. Aiming to address issues present in travel itinerary recommendations for the increasingly popular “self-drive tour” mode, this study proposes an algorithm based on comprehensive tourist satisfaction to mitigate problems such as the neglect of important relevant factors and low degree of personalization. First, we construct a model of tourist satisfaction for travel itineraries by comprehensively considering factors including time utilization, the attractiveness of attractions, itinerary feasibility, and the diversity of attraction types. Unlike previous studies, we consider dining and accommodation time during the itinerary, the physical condition of tourists, and the diversity of attraction types, and establish penalty functions to flexibly constrain deviations from the expected conditions in itinerary planning. Then, with the optimization of comprehensive tourist satisfaction as the objective, we design a new algorithm to address the itinerary recommendation problem, supporting tourists in selecting must-visit attractions, restaurants, and hotels, as well as personalized preferences such as the sightseeing sequence. The experimental results demonstrate that our proposed algorithm outperforms two baseline algorithms, providing higher comprehensive tourist satisfaction while also exhibiting greater feasibility in itinerary planning. The proposed algorithm effectively addresses the issue of personalized travel itinerary recommendation, presenting an efficient, feasible, and practical solution.

show abstract

Section: Research On Itinerary Planning Algorithmsmentioning

confidence: 99%

Section: Tour Itinerary Recommendation Algorithm Based On Tourist Com...mentioning

confidence: 99%

Personalized Tour Itinerary Recommendation Algorithm Based on Tourist Comprehensive Satisfaction

Liu,

Wang,

Zhong

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

show abstract

“…Zhou Jibiao, Zhang Haisu et al [1,2] Risks in the surrounding areas of COVID-19 were not considered Ma Chang-xi et al [3] Other modes of transportation are not considered Tu Qiang et al [4] Only the car network is considered Jia Fuqiang et al [5] The objective function is relatively simple Subramani et al [6] The risk avoidance path has a high probability of error Liping Fu et al [7] Models and algorithms to be expanded A. Khani et al [8] Risk factors in travel are not considered Xu Ke, Liu Sijia, Luo Fei et al [9][10][11] The algorithm and model need to be further improved Wang Keyin et al [12] The model is not suitable for urban traffic path planning Wang A. et al [13] Different preferences and requirements of passengers are not considered Levy S. et al [14] The setting of reward function needs further improvement Therefore, we use the SUMO simulator to build the actual road network model and design a method to extract the road network impedance matrix, which greatly improves the efficiency and accuracy of road network modeling. We have established an enhanced learning path planning model in the context of urban traffic, and designed a search mechanism to avoid risk related areas of the new epidemic.…”

Section: Author Limitationsmentioning

confidence: 99%

“…We used the RRL-APF algorithm to conduct simulation experiments for up to 300 times of learning, and the process of an agent from the starting point to the end point through exploration is defined as a complete learning. In order to verify the superiority of the RRL-APF algorithm in terms of convergence speed and other aspects, the RRL-APF algorithm was compared with the Q-Learning [27] algorithm, the Sarsa [28] algorithm and the RLAPF [9,12] algorithm under the same starting and end point, and epidemic risk location information.…”

Section: Algorithm Verificationmentioning

confidence: 99%

“…In recent years, scholars at home and abroad have conducted research on the use of reinforcement learning to solve problems related to path planning. Some Chinese scholars [9][10][11][12] have optimized and improved guiding agents to choose actions and reward function settings. Although the results are better than traditional methods, they still need some improvement in the work.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Risk-Aware Travel Path Planning Algorithm Based on Reinforcement Learning during COVID-19

Wang

Yang

Zhang³

et al. 2022

Sustainability

View full text Add to dashboard Cite

The outbreak of COVID-19 brought great inconvenience to people’s daily travel. In order to provide people with a path planning scheme that takes into account both safety and travel distance, a risk aversion path planning model in urban traffic scenarios was established through reinforcement learning. We have designed a state and action space of agents in a “point-to-point” way. Moreover, we have extracted the road network model and impedance matrix through SUMO simulation, and have designed a Restricted Reinforcement Learning-Artificial Potential Field (RRL-APF) algorithm, which can optimize the Q-table initialization operation before the agent learning and the action selection strategy during learning. The greedy coefficient is dynamically adjusted through the improved greedy strategy. Finally, according to different scenarios, our algorithm is verified by the road network model and epidemic historical data in the surrounding areas of Xinfadi, Beijing, China, and comparisons are made with common Q-Learning, the Sarsa algorithm and the artificial potential field-based reinforcement learning (RLAFP) algorithm. The results indicate that our algorithm improves convergence speed by 35% on average and the travel distance is reduced by 4.3% on average, while avoiding risk areas, compared with the other three algorithms.

show abstract

A Robust Motion Planning Algorithm Based on Reinforcement Learning

Cited by 2 publications

References 9 publications

Personalized Tour Itinerary Recommendation Algorithm Based on Tourist Comprehensive Satisfaction

Personalized Tour Itinerary Recommendation Algorithm Based on Tourist Comprehensive Satisfaction

Risk-Aware Travel Path Planning Algorithm Based on Reinforcement Learning during COVID-19

Contact Info

Product

Resources

About