2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.01003
|View full text |Cite
|
Sign up to set email alerts
|

Learning Monocular Depth Estimation Infusing Traditional Stereo Knowledge

Abstract: Depth estimation from a single image represents a fascinating, yet challenging problem with countless applications. Recent works proved that this task could be learned without direct supervision from ground truth labels leveraging image synthesis on sequences or stereo pairs. Focusing on this second case, in this paper we leverage stereo matching in order to improve monocular depth estimation. To this aim we propose monoResMatch, a novel deep architecture designed to infer depth from a single input image by sy… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
173
0
3

Year Published

2019
2019
2020
2020

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 202 publications
(176 citation statements)
references
References 59 publications
0
173
0
3
Order By: Relevance
“…Similar to before, we also show results for our method without pretraining ("Ours HR Resnet50 w/o pretraining"). Our non-pretrained model beats SuperDepth [26] in six out of seven metrics (tied in one), and compares favourably to the concurrent work monoResMatch [30], which makes use of a significantly more complex network compared to our encoder-decoder architecture.…”
Section: Depth From Color Tournamentmentioning
confidence: 77%
See 2 more Smart Citations
“…Similar to before, we also show results for our method without pretraining ("Ours HR Resnet50 w/o pretraining"). Our non-pretrained model beats SuperDepth [26] in six out of seven metrics (tied in one), and compares favourably to the concurrent work monoResMatch [30], which makes use of a significantly more complex network compared to our encoder-decoder architecture.…”
Section: Depth From Color Tournamentmentioning
confidence: 77%
“…The concurrent work monoResMatch by Tosi et al [30] also exploits proxy ground truth labels generated with a traditional stereo matching method [13]. The inclusion of the proxy supervision is shown to greatly improve accuracy over using a standard self-supervised loss.…”
Section: Additional Supervisionmentioning
confidence: 99%
See 1 more Smart Citation
“…Among methods tested which are dedicated to monocular cameras, we can find approaches such as SfmLearner [28], MonoResMatch [29], Monodepth [14] and Monodepth2 [30]. Except [28], which is trained on monocular image sequences by learning the structure from motion elements using three frames snippet, all of these algorithms require a pair of images from a calibrated stereoscopic camera in order to be able to do the training.…”
Section: Algorithm For Monocular Cameramentioning
confidence: 99%
“…On the other hand, while existing works leverage such knowledge at training time only, we deploy a monocular VO algorithm to obtain geometrical priors to feed our network with. Being such priors sourced by a monocular setup, they are available at inference time in contrast to others available from stereo images [40,46] and thus available at training time only.…”
Section: Related Workmentioning
confidence: 99%