2019 International Conference on 3D Vision (3DV) 2019
DOI: 10.1109/3dv.2019.00054
|View full text |Cite
|
Sign up to set email alerts
|

Enhancing Self-Supervised Monocular Depth Estimation with Traditional Visual Odometry

Abstract: Estimating depth from a single image represents an attractive alternative to more traditional approaches leveraging multiple cameras. In this field, deep learning yielded outstanding results at the cost of needing large amounts of data labeled with precise depth measurements for training. An issue softened by self-supervised approaches leveraging monocular sequences or stereo pairs in place of expensive ground truth depth annotations. This paper enables to further improve monocular depth estimation by integrat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
25
0
2

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 42 publications
(30 citation statements)
references
References 48 publications
2
25
0
2
Order By: Relevance
“…Supervised ACAN [46] Encoder-Decoder DenseDepth [47] Encoder-Decoder DORN [18] CNN VNL [48] Encoder-Decoder BTS [49] DeepV2D [50] Encoder-Decoder CNN LISM [51] Encoder-Decoder Self-supervised monoResMatch [38] CNN PackNet-SfM [52] CNN VOMonodepth [53] Auto-Decoder monodepth2 [42] CNN GASDA [54] CNN Semi-supervised…”
Section: Emdeom [32] Fcmentioning
confidence: 99%
“…Supervised ACAN [46] Encoder-Decoder DenseDepth [47] Encoder-Decoder DORN [18] CNN VNL [48] Encoder-Decoder BTS [49] DeepV2D [50] Encoder-Decoder CNN LISM [51] Encoder-Decoder Self-supervised monoResMatch [38] CNN PackNet-SfM [52] CNN VOMonodepth [53] Auto-Decoder monodepth2 [42] CNN GASDA [54] CNN Semi-supervised…”
Section: Emdeom [32] Fcmentioning
confidence: 99%
“…Among these, ref. [12] is the first notable attempt leveraging stereo pairs, eventually improved exploiting traditional stereo algorithm [13,14], visual odometry supervision [15,16] or 3D movies [17]. On the other hand, methods leveraging monocular videos do not even require a stereo camera at training time, at the cost of learning depth estimation up to a scale factor.…”
Section: Related Workmentioning
confidence: 99%
“…Bian et al [4] improve the scale consistency via using depth clues. To leverage the privilege of traditional 3D geometry, Andraghetti et al [2] enhance the self-supervised framework by traditional visual odometry. There also exist works [48,63,69] that constrain the network via introducing extra information (optical flows.…”
Section: Self-supervised Depth Estimationmentioning
confidence: 99%