2020 International Conference on 3D Immersion (IC3D) 2020
DOI: 10.1109/ic3d51119.2020.9376327
|View full text |Cite
|
Sign up to set email alerts
|

A Novel 3D-Unet Deep Learning Framework Based on High-Dimensional Bilateral Grid for Edge Consistent Single Image Depth Estimation

Abstract: The task of predicting smooth and edge-consistent depth maps is notoriously difficult for single image depth estimation. This paper proposes a novel Bilateral Grid based 3D convolutional neural network, dubbed as 3DBG-UNet, that parameterizes high dimensional feature space by encoding compact 3D bilateral grids with UNets and infers sharp geometric layout of the scene. Further, another novel 3DBGES-UNet model is introduced that integrate 3DBG-UNet for inferring an accurate depth map given a single color view. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 38 publications
0
4
0
Order By: Relevance
“…Refs. [ 31 , 41 , 42 , 43 , 44 , 45 ] proposed wavelet transform to extract sharp edge and strong object boundaries for image dehazing, enhancing the low–light image, reducing artifacts, image segmentation, and depth map estimation. Here, they replace the down–sampling layer with a DWT and the up–sampling layer with an IWT.…”
Section: Nested Dwt Net Architecturementioning
confidence: 99%
“…Refs. [ 31 , 41 , 42 , 43 , 44 , 45 ] proposed wavelet transform to extract sharp edge and strong object boundaries for image dehazing, enhancing the low–light image, reducing artifacts, image segmentation, and depth map estimation. Here, they replace the down–sampling layer with a DWT and the up–sampling layer with an IWT.…”
Section: Nested Dwt Net Architecturementioning
confidence: 99%
“…We use standard error metrics for quantitative analysis: Absolute relative error (absrel), Squared relative error (sqrel), Root mean square error (RMSE), Average log error (log 10 ), threshold accuracy (σ i ) and perception-based Structural Similarity Index Metric (SSIM) [35] [39]. Given a predicted depth image and its corresponding ground truth, the different error metrics are calculated as follows:…”
Section: Comparative Analysismentioning
confidence: 99%
“…Sharma et al. [17] proposed a novel Bilateral Grid based 3D convolutional neural network, which pays more attention to edges and details. In this paper, we propose a novel monocular image depth estimation algorithm.…”
Section: Introductionmentioning
confidence: 99%
“…Hu et al [16] proposed using multi-scale feature fusion module to obtain higher spatial resolution depth map. Sharma et al [17] proposed a novel Bilateral Grid based 3D convolutional neural network, which pays more attention to edges and details. In this paper, we propose a novel monocular image depth estimation algorithm.…”
Section: Introductionmentioning
confidence: 99%