2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.00166
|View full text |Cite
|
Sign up to set email alerts
|

P3Depth: Monocular Depth Estimation with a Piecewise Planarity Prior

Abstract: Monocular depth estimation is vital for scene understanding and downstream tasks. We focus on the supervised setup, in which ground-truth depth is available only at training time. Based on knowledge about the high regularity of real 3D scenes, we propose a method that learns to selectively leverage information from coplanar pixels to improve the predicted depth. In particular, we introduce a piecewise planarity prior which states that for each pixel, there is a seed pixel which shares the same planar 3D surfac… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
19
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 57 publications
(19 citation statements)
references
References 73 publications
0
19
0
Order By: Relevance
“…The performance gain is significant considering the saturated performance of the dataset in recent years. We see an improvement of 9.6% and 3.5% over the recently proposed methods [23] and NeWCRFs, respectively, in terms of the RMSE error. We see in Fig.…”
Section: Methodsmentioning
confidence: 73%
See 1 more Smart Citation
“…The performance gain is significant considering the saturated performance of the dataset in recent years. We see an improvement of 9.6% and 3.5% over the recently proposed methods [23] and NeWCRFs, respectively, in terms of the RMSE error. We see in Fig.…”
Section: Methodsmentioning
confidence: 73%
“…Lee et al [17] enforce a model to learn structural information about the scene by learning the relationship between image patches close to each other. Whereas, Patil et al [23] exploit coplanar pixels to improve the predicted depth.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, transformer structures have been introduced in monocular depth estimation, where DPT [43], and PixelFormer [1] have shown improved performances. To increase the robustness of monocular depth estimation, some methods introduce additional constraints such as uncertainty (UCRDepth [47]) or piecewise planarity prior (P3Depth [41]). NewCRFs [64] proposes windowseparated Conditional Random Fields (CRF) to enhance the local space relation with neighboring pixels.…”
Section: Related Workmentioning
confidence: 99%
“…Estimating depth from a single image is challenging due to the inherent ambiguity in the mapping between the 2D image and the 3D scene. To increase the robustness, the following methods utilizing constructed additional constraints such as uncertainty (UCRDepth [47]), and piecewise planarity prior (P3Depth [41]). The NewCRFs [64] introduces window-separated Conditional Random Fields (CRF) to enhance local space relation with neighbor pixels.…”
Section: Introductionmentioning
confidence: 99%
“…An encoder-decoder transformer-based architectural block that separates the depth range into bins with estimated centers depending on the image was proposed by Bhat et al [11]. Patil et al [12] recently proposed a two head CNN network for depth estimation, the first head produces pixel-wise plane values while the second head produces a dense vector field for each pixel position. The output from the second head is adaptively fused by the output of the first head.…”
Section: Related Workmentioning
confidence: 99%