2019 International Conference on Robotics and Automation (ICRA) 2019
DOI: 10.1109/icra.2019.8793512
|View full text |Cite
|
Sign up to set email alerts
|

GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks

Abstract: In the last decade, supervised deep learning approaches have been extensively employed in visual odometry (VO) applications, which is not feasible in environments where labelled data is not abundant. On the other hand, unsupervised deep learning approaches for localization and mapping in unknown environments from unlabelled data have received comparatively less attention in VO research. In this study, we propose a generative unsupervised learning framework that predicts 6-DoF pose camera motion and monocular d… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
78
0
4

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 141 publications
(93 citation statements)
references
References 39 publications
(60 reference statements)
0
78
0
4
Order By: Relevance
“…Years Training set Sup Semi-sup Unsup Main contributions GAN, LSTM, mask Almalioglu et al [91] Mono. sequences √ GAN, LSTM Table 2 Monocular depth results of semi-supervised and unsupervised methods on the KITTI dataset [32].…”
Section: Supervised (Sup) Manner Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Years Training set Sup Semi-sup Unsup Main contributions GAN, LSTM, mask Almalioglu et al [91] Mono. sequences √ GAN, LSTM Table 2 Monocular depth results of semi-supervised and unsupervised methods on the KITTI dataset [32].…”
Section: Supervised (Sup) Manner Methodsmentioning
confidence: 99%
“…In refs. [83,90,91], the generator consists of a pose network and a depth network, and the output of networks is used to synthesize images by view reconstruction. Then, a discriminator is designed to distinguish the real and predicted depth maps.…”
Section: Unsupervised Monocular Depth Estimationmentioning
confidence: 99%
“…Recently, another interesting approach to estimate the 6-DoF pose was published by Almalioglu et al [1]. They predict the pose of consecutive frames from different perspectives with a Generative Adversarial Network (GAN).…”
Section: Deep Learning Methodsmentioning
confidence: 99%
“…To reduce the dependency of labeled data, Zhou et al [10] propose an unsupervised method to predict image depth together with ego-motion using two networks, then compute the re-projected image residual as the loss function. After that, many enhanced works are proposed by adding additional 3D geometric loss [11], binocular loss [12] , deep feature reconstruction loss [13], dynamic and optical flow loss [14] or adversarial loss [15]. To achieve robustness of the system, Klodt et al [16] and Yang et al [17] further propose to estimate uncertainty of estimated ego-motion and depth.…”
Section: Introductionmentioning
confidence: 99%