Proceedings of the 30th ACM International Conference on Multimedia 2022
DOI: 10.1145/3503161.3548394
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Camera Collaborative Depth Prediction via Consistent Structure Estimation

Abstract: Depth map estimation from images is an important task in robotic systems. Existing methods can be categorized into two groups including multi-view stereo and monocular depth estimation. The former requires cameras to have large overlapping areas and sufficient baseline between cameras, while the latter that processes each image independently can hardly guarantee the structure consistency between cameras. In this paper, we propose a novel multicamera collaborative depth prediction method that does not require l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
17
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(17 citation statements)
references
References 69 publications
0
17
0
Order By: Relevance
“…1) Manhattan Normal Module: Depth prediction in coplanar regions can benefit from known surface normals [68], [69]. However, estimating surface normals in indoor scenes is challenging due to pervasive large untextured planes with consistent luminosity in rooms.…”
Section: A the Manhattan-constraint Network (Mcn) Branchmentioning
confidence: 99%
“…1) Manhattan Normal Module: Depth prediction in coplanar regions can benefit from known surface normals [68], [69]. However, estimating surface normals in indoor scenes is challenging due to pervasive large untextured planes with consistent luminosity in rooms.…”
Section: A the Manhattan-constraint Network (Mcn) Branchmentioning
confidence: 99%
“…In contrast, depth models are supposed to predict dense results with both accurate details and integral spatial structures. Due to insufficient structural information, supervised [35,36,53] or self-supervised [14,51,55] methods produce failure predictions with concave objects, erroneous outcomes, or noticeable artifacts on autonomous driving scenarios [6,16].…”
Section: Introductionmentioning
confidence: 99%
“…To enhance spatial structures, recent works [14,17,19,51,55] explore self-supervised manner on driving scenes [6,16]. Surround-Depth [51] employs pseudo labels from Structure-from-Motion [40] to pretrain their model.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations