2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00566
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical Deep Stereo Matching on High-Resolution Images

Abstract: Figure 1: Illustration of on-demand depth sensing with a coarse-to-fine hierarchy on the proposed dataset. Our method (HSM) captures the coarse layout of the scene in 91 milliseconds, finds the far-away car (shown in the red box) in 175 ms, and recovers the details of the car given extra 255 ms. AbstractWe explore the problem of real-time stereo matching on high-res imagery. Many state-of-the-art (SOTA) methods struggle to process high-res imagery because of memory constraints or speed limitations. To address … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
196
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 230 publications
(197 citation statements)
references
References 38 publications
1
196
0
Order By: Relevance
“…It is clear that our estimated disparity maps are all closer to the ground truth and have the fewest number of bad pixels. In contrast, the estimated disparity maps from LF OCC are noisy, those from RPRF look over-smoothed and have quantification errors, both EPINET and EPINET T fail to predict disparities, HSM [21] is disturbed by the ambiguous background, EPI-Shift [26] and LBDE-E [25] both seem not able to handle the foreground well.…”
Section: Performance On Wide-baseline Datasetsmentioning
confidence: 97%
See 3 more Smart Citations
“…It is clear that our estimated disparity maps are all closer to the ground truth and have the fewest number of bad pixels. In contrast, the estimated disparity maps from LF OCC are noisy, those from RPRF look over-smoothed and have quantification errors, both EPINET and EPINET T fail to predict disparities, HSM [21] is disturbed by the ambiguous background, EPI-Shift [26] and LBDE-E [25] both seem not able to handle the foreground well.…”
Section: Performance On Wide-baseline Datasetsmentioning
confidence: 97%
“…To verify the effectiveness of our network LLF-Net on widebaseline light field datasets, we conducted experiments on the synthetic WLF dataset and real-world Google [43] and ULB [44] datasets. The proposed LLF-Net is compared with recent state-of-the-art depth estimation methods, comprising of traditional light field depth estimation methods LF OCC [3] and RPRF [13], ConvNet-based methods HSM [21], EPINET [19] and LBDE-E [25]. With respect to ConvNet-based methods, HSM [21] is designed for stereo-based depth estimation, EPINET [19] achieves top performance among published methods in narrow-baseline light field datasets and LBDE-E [25] achieves considerable performance on both narrowand wide-baseline light field datasets.…”
Section: Performance On Wide-baseline Datasetsmentioning
confidence: 99%
See 2 more Smart Citations
“…In EdgeStereo [21], the general decoder structure was replaced by using the new designed residual pyramid network with different scales. The HSM [46] computed the encoder features in a coarse‐to‐fine with 3D residual convolution and volumetric pyramid pooling, and gradually increased the output resolution. It could predict accurate disparity map with low latency.…”
Section: Related Workmentioning
confidence: 99%