2021
DOI: 10.48550/arxiv.2109.06163
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

On the Sins of Image Synthesis Loss for Self-supervised Depth Estimation

Zhaoshuo Li,
Nathan Drenkow,
Hao Ding
et al.

Abstract: Scene depth estimation from stereo and monocular imagery is critical for extracting 3D information for downstream tasks such as scene understanding. Recently, learning-based methods for depth estimation have received much attention due to their high performance and flexibility in hardware choice. However, collecting ground truth data for supervised training of these algorithms is costly or outright impossible. This circumstance suggests a need for alternative learning approaches that do not require correspondi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 30 publications
0
3
0
Order By: Relevance
“…Self-supervised depth learning uses image synthesis to compute the photometric loss. However, traditional synthesis losses to regress depth are limited due to the aleatoric uncertainty of input images [50]. Poggi et al [12] are the first ones addressing such problem by including aleatoric uncertainty in the self-supervised monocular loss.…”
Section: B Single-view Depth Learningmentioning
confidence: 99%
“…Self-supervised depth learning uses image synthesis to compute the photometric loss. However, traditional synthesis losses to regress depth are limited due to the aleatoric uncertainty of input images [50]. Poggi et al [12] are the first ones addressing such problem by including aleatoric uncertainty in the self-supervised monocular loss.…”
Section: B Single-view Depth Learningmentioning
confidence: 99%
“…Learning-based stereo matching methods can be roughly divided into fullysupervised methods [15,16] and self-supervised methods [17,18]. The former uses accurate ground truth disparity maps for training, while the latter relies on formulating image synthesis loss.…”
Section: Introductionmentioning
confidence: 99%
“…13,14 Learning-based stereo matching methods could be classified into fully-supervised methods 15,16 and self-supervised methods. 17,18 The former class uses accurate ground truth disparity maps for training, whereas the latter class relies on formulating image synthesis loss. In the era of deep learning, learning-based stereo matching methods are reported to achieve high performance on several public benchmark datasets and outperform traditional methods.…”
Section: Introductionmentioning
confidence: 99%