2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC) 2017
DOI: 10.1109/itsc.2017.8317738
|View full text |Cite
|
Sign up to set email alerts
|

Simultaneous policy learning and latent state inference for imitating driver behavior

Abstract: Human driving depends on latent states, such as aggression and intent, that cannot be directly observed. In this work, we propose a method for learning driver models that can account for unobserved states. When trained on a synthetic dataset, our model is able to learn encodings for vehicle trajectories that distinguish between four distinct classes of driver behavior. Such encodings are learned without any knowledge of the number of driver classes or any objective that directly requires the model to learn enc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
24
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 31 publications
(25 citation statements)
references
References 17 publications
(15 reference statements)
0
24
0
Order By: Relevance
“…For memory and prediction, LSTM, a type of RNN, has been adopted by mMP studies (30,52,127) to address the impact of memory on future speed choice. Lefe`vre et al (123) conducted a comparative evaluation of parametric and non-parametric approaches for speed prediction during highway driving.…”
Section: Memory and Predictionmentioning
confidence: 99%
“…For memory and prediction, LSTM, a type of RNN, has been adopted by mMP studies (30,52,127) to address the impact of memory on future speed choice. Lefe`vre et al (123) conducted a comparative evaluation of parametric and non-parametric approaches for speed prediction during highway driving.…”
Section: Memory and Predictionmentioning
confidence: 99%
“…Some works distinguish between distracted and attentive drivers for behavior prediction and cooperative planning [14], [15]. Driving style recognition has been addressed with both unsupervised and supervised learning methods, which we will discuss in detail below [3], [8], [16], [17].…”
Section: A Driver Internal State Estimationmentioning
confidence: 99%
“…Morton et al propose a method that first encodes driving trajectories with different driving styles to a latent space. Then, the latent encodings and the current driver states are fed into a feedforward policy that produces multimodal actions [8]. The encoder and the policy are optimized jointly.…”
Section: A Driver Internal State Estimationmentioning
confidence: 99%
See 1 more Smart Citation
“…However, BC has the compounding errors [35] and requires enormous training data [36]- [38]. Even with the recent advances in deep learning techniques, BC approaches [39], [40], trained with large datasets using deep learning, still showed the compounding errors during simulations. On the contrary, IRL tries to recover the reward function followed by the experts, assuming that they follow an optimal policy.…”
Section: Introductionmentioning
confidence: 99%