2021
DOI: 10.48550/arxiv.2110.00044
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Trajectory Planning with Deep Reinforcement Learning in High-Level Action Spaces

Abstract: This paper presentsa technique for trajectory planning based on continuously parameterized high-level actions (motion primitives) of variable duration. This technique leverages deep reinforcement learning (Deep RL) to formulate a policy which is suitable for real-time implementation. There is no separation of motion primitive generation and trajectory planning: each individual short-horizon motion is formed during the Deep RL training to achieve the full-horizon objective. Effectiveness of the technique is dem… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 54 publications
(93 reference statements)
0
3
0
Order By: Relevance
“…Under such a context, continuous feasibility and control liveliness are key concerns ensuring the accomplishment of higher level tasks, such as overtaking another slow vehicle, merging into a busy fast lane, or avoiding an obstacle at high speeds [117]. When selecting mode from a large library of complicated dynamic modes become computationally demanding, learning based approach such as reinforcement learning can be leveraged to efficiently learn the proper mode arbitration decision according to the environment [118], [119].…”
Section: Referencementioning
confidence: 99%
“…Under such a context, continuous feasibility and control liveliness are key concerns ensuring the accomplishment of higher level tasks, such as overtaking another slow vehicle, merging into a busy fast lane, or avoiding an obstacle at high speeds [117]. When selecting mode from a large library of complicated dynamic modes become computationally demanding, learning based approach such as reinforcement learning can be leveraged to efficiently learn the proper mode arbitration decision according to the environment [118], [119].…”
Section: Referencementioning
confidence: 99%
“…One possible solution is to apply Deep Reinforcement Learning (RL), which has demonstrated its effectiveness in resolving a variety of optimal control problems across different őelds, such as autonomous driving and game playing. As one of emerging technologies, Deep RL exhibits great promises to enhance the efficiency of trajectory planning [15,16], especially in systems with complicated constraints and dynamics. It also demonstrates the ability to maintain good performance in stochastic environments [17].…”
Section: Introductionmentioning
confidence: 99%
“…Its thoughtfully engineered reward shaping is one main factor in taking full advantage of its Deep RL algorithmÐSoft Actor-Critic (SAC) [21]. A recent study by Williams et al [15] also incorporates signs of motion primitives. However, instead of creating a non-intuitive motion primitive planning, they proposed a technique called high-level action space, which allows variable time-stepping in controlling the vehicle.…”
Section: Introductionmentioning
confidence: 99%