2018 IEEE International Conference on Robotics and Automation (ICRA) 2018
DOI: 10.1109/icra.2018.8462979
|View full text |Cite
|
Sign up to set email alerts
|

Deep Auxiliary Learning for Visual Localization and Odometry

Abstract: Localization is an indispensable component of a robot's autonomy stack that enables it to determine where it is in the environment, essentially making it a precursor for any action execution or planning. Although convolutional neural networks have shown promising results for visual localization, they are still grossly outperformed by state-of-the-art local feature-based techniques. In this work, we propose VLocNet, a new convolutional neural network architecture for 6-DoF global pose regression and odometry es… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
155
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 234 publications
(156 citation statements)
references
References 28 publications
0
155
0
1
Order By: Relevance
“…All three methods operate on image sequences and thus use more information compared to Den-seVLAD, which only uses a single image for localization. Still, DenseVLAD outperforms VLocNet [74].…”
Section: Resultsmentioning
confidence: 93%
See 3 more Smart Citations
“…All three methods operate on image sequences and thus use more information compared to Den-seVLAD, which only uses a single image for localization. Still, DenseVLAD outperforms VLocNet [74].…”
Section: Resultsmentioning
confidence: 93%
“…The table also compares DenseVLAD and Den-seVLAD+Inter. against three sequence-based approaches, VLoc-Net [74], VLocNet++STL [53], and VLocNet++MTL [53]. All three directly fuse feature map responses from the previous time step t−1 into the CNN that predicts the pose at time t. VLocNet++MTL also integrates some form of higher-level scene understanding through semantic segmentation.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Kendall et al proposed PoseNet [22], a convolutional neural network that regresses the camera's 6 degrees of freedom pose relative to the scene from a single RGB image. A similar approach has been introduced by Valada et al with VLocNet [23]. The goal is to regress the global pose and simultaneously estimate the odometry between two frames.…”
Section: Related Workmentioning
confidence: 99%