2021 IEEE International Conference on Image Processing (ICIP) 2021
DOI: 10.1109/icip42928.2021.9506330
|View full text |Cite
|
Sign up to set email alerts
|

Separable Convolutions for Optimizing 3D Stereo Networks

Abstract: Deep learning based 3D stereo networks give superior performance compared to 2D networks and conventional stereo methods. However, this improvement in the performance comes at the cost of increased computational complexity, thus making these networks non-practical for the real-world applications. Specifically, these networks use 3D convolutions as a major work horse to refine and regress disparities. In this work first, we show that these 3D convolutions in stereo networks consume up to 94% of overall network … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 23 publications
(33 reference statements)
0
3
0
Order By: Relevance
“…However, end‐to‐end fashion DNN solutions still suffer from unbalancing between low computation consumption and high‐quality depth estimation. For instance, Shamsafar et al (2022) and Rahim et al (2021) both identify that 3D convolution operations consume the most computational power and they provide alternative architectures to reduce the parameters and operations. However, the oversize models still prevent them from reaching real‐time requirements even on the powerful computing platform.…”
Section: Related Workmentioning
confidence: 99%
“…However, end‐to‐end fashion DNN solutions still suffer from unbalancing between low computation consumption and high‐quality depth estimation. For instance, Shamsafar et al (2022) and Rahim et al (2021) both identify that 3D convolution operations consume the most computational power and they provide alternative architectures to reduce the parameters and operations. However, the oversize models still prevent them from reaching real‐time requirements even on the powerful computing platform.…”
Section: Related Workmentioning
confidence: 99%
“…When faced with the ill-posed region challenges, SSPCV-Net [34] extracts details from semantic segmentation sub networks, the contentaware inter-scale cost aggregation method [35] adaptively aggregates and upsamples cost volume for reliable detail recovery, and MSMD-Net [36] constructs multi-scale and multi-dimension cost volume. Moreover, on the purpose of balancing real-time performance and accuracy, Gwc-Net [37] presents a group-wise correlation module that can not only provide similarity measurement, but also maintain better performance after reducing parameters, BGNet [38] proposes upsampling module based on the learned bilateral grid to get high quality cost volume form the low-resolution feature maps, and the method [39] realizes this goal by mean of the separable convolution. Targeting at earning a high level of accuracy, AcfNet [40] directly constraints the cost volume using true disparities peaked at unimodal distribution and the adaptive filtering cost volume.…”
Section: B Volumetric Stereo Matching Modelmentioning
confidence: 99%
“…We achieve all of this without the need of costly 3D-convolutions or fullyconnected layers that are used by many popular state-of-the art methods such as GC-Net [4], PSMNet [5] or MC-CNNacrt [6]. That 3D-convolutions are a major bottleneck for stereo estimation networks has been shown by R. Rahim et al in their recent work [41].…”
Section: Introductionmentioning
confidence: 98%