2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.00839
|View full text |Cite
|
Sign up to set email alerts
|

Self-Supervised Simultaneous Multi-Step Prediction of Road Dynamics and Cost Map

Abstract: While supervised learning is widely used for perception modules in conventional autonomous driving solutions, scalability is hindered by the huge amount of data labeling needed. In contrast, while end-to-end architectures do not require labeled data and are potentially more scalable, interpretability is sacrificed. We introduce a novel architecture that is trained in a fully self-supervised fashion for simultaneous multi-step prediction of space-time cost map and road dynamics. Our solution replaces the manual… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 37 publications
(44 reference statements)
0
4
0
Order By: Relevance
“…Trajectory prediction is of vital importance to many artificial intelligent applications. There is a large body of works on this topic designed to predict behavior of pedestrians [21,28,37,45,50] and vehicles [1,10,11,13,19,36].…”
Section: Trajectory Prediction Methodsmentioning
confidence: 99%
“…Trajectory prediction is of vital importance to many artificial intelligent applications. There is a large body of works on this topic designed to predict behavior of pedestrians [21,28,37,45,50] and vehicles [1,10,11,13,19,36].…”
Section: Trajectory Prediction Methodsmentioning
confidence: 99%
“…With the objective of encoding traffic scenes similar to ours (albeit not in the context of motion planning), encoder architectures for learning representations of occupancy maps have been proposed [50]- [52]. Using graphical or otherwise spatially-aware encoders similar to ours, recent works such as [53]- [57] predict occupancy grids [58] as an intermediate learning target for guiding the training of neural motion planners. However, these approaches do not provide global, low-dimensional representations appropriate for decoupled RL agents.…”
Section: Applications To Motion Planningmentioning
confidence: 99%
“…Semantic scene completion is a similar task to semantic mapping, however it is generally defined using sensor data from only a single frame. While there exist some deep learning 3D mapping methods, it is not a common task due to the lack of accurate outdoor dynamic data to supervise and quantify performance on [17,[45][46][47][48]. SSC is currently a difficult task with minimal generalizability to real life due to the lack of accurately labeled dynamic scenes, as discussed in the previous section.…”
Section: B Semantic Scene Completionmentioning
confidence: 99%
“…Dynamic occupancy maps construct binary labels for cells indicating free or occupied, and extend their domain to scenes with dynamic actors by incorporating scene dynamics [13][14][15]. While learning-based approaches have been attempted in 2D [16,17], most 3D maps rely on feature engineering which can decrease performance and efficiency.…”
Section: Introductionmentioning
confidence: 99%