2019 International Conference on 3D Vision (3DV) 2019
DOI: 10.1109/3dv.2019.00029
|View full text |Cite
|
Sign up to set email alerts
|

To Complete or to Estimate, That is the Question: A Multi-Task Approach to Depth Completion and Monocular Depth Estimation

Abstract: Robust three-dimensional scene understanding is now an ever-growing area of research highly relevant in many realworld applications such as autonomous driving and robotic navigation. In this paper, we propose a multi-task learningbased model capable of performing two tasks:-sparse depth completion (i.e. generating complete dense scene depth given a sparse depth image as the input) and monocular depth estimation (i.e. predicting scene depth from a single RGB image) via two sub-networks jointly trained end to en… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 14 publications
(7 citation statements)
references
References 61 publications
(193 reference statements)
0
7
0
Order By: Relevance
“…Following synthetic-to-real depth estimation methods [41,42], we use as a domain adaptation baseline a CycleGAN [1] to adapt the images. To sparsify the synthetic depth, we use the real LiDAR masks [43], shown in Table 1 to perform better than Bernoulli sampling. The performance of this domain adaptation baseline is presented in Table 3 in DA Base.…”
Section: Methods Evaluationmentioning
confidence: 99%
See 2 more Smart Citations
“…Following synthetic-to-real depth estimation methods [41,42], we use as a domain adaptation baseline a CycleGAN [1] to adapt the images. To sparsify the synthetic depth, we use the real LiDAR masks [43], shown in Table 1 to perform better than Bernoulli sampling. The performance of this domain adaptation baseline is presented in Table 3 in DA Base.…”
Section: Methods Evaluationmentioning
confidence: 99%
“…For monocular depth estimation, two domain adaptation approaches used style-transfer methods [41,42]. Sparse-to-dense methods, however, have used synthetic data without any adaptation [4,6,43] so far. Training on synthetic data requires a high rendering quality [44].…”
Section: Depth Completion From Rgb and Lidarmentioning
confidence: 99%
See 1 more Smart Citation
“…Finally, Koch et al [12] make an analysis and comparisons between all the methods mentioned before. Atapour-Abarghouei and Breckon [13] use an arrangement of eight CNN models (U-Net) [14] that first estimates the semantic segmentation of the image and then estimates depth from the segmented objects. Lin et al [15] proposed an architecture that joins a CNN that estimates depth and separately, a CNN that estimates the semantic segmentation.…”
Section: Related Workmentioning
confidence: 99%
“…Currently, CNNs have been a dominant solution for depth completion [21,27,28,29,30,31,32,19,33,34,17,18,35,36], outperforming traditional methods by a wide margin. In specific, to learn representations of the irregular and sparse LiDAR data, Uhrig et al [9] proposed the sparsity-invariant convolutional operation.…”
Section: Related Workmentioning
confidence: 99%