2019
DOI: 10.1109/lra.2019.2894913
|View full text |Cite
|
Sign up to set email alerts
|

DispSegNet: Leveraging Semantics for End-to-End Learning of Disparity Estimation From Stereo Imagery

Abstract: Recent work has shown that convolutional neural networks (CNNs) can be applied successfully in disparity estimation, but these methods still suffer from errors in regions of low-texture, occlusions and reflections. Concurrently, deep learning for semantic segmentation has shown great progress in recent years. In this paper, we design a CNN architecture that combines these two tasks to improve the quality and accuracy of disparity estimation with the help of semantic segmentation. Specifically, we propose a net… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 60 publications
(22 citation statements)
references
References 28 publications
0
21
0
Order By: Relevance
“…Displets [18] uses object knowledge that obtained by modeling 3D vehicles to resolve the stereo ambiguities for disparity estimation. The SegStereo [19] and DispSegNet [20] incorporate prior semantic information into stereo matching. The EdgeStereo [21] borrows the network architecture of edge detector HED [22] and achieves the mutual promotion between the edge detection task and stereo matching task.…”
Section: B Complementation-based Methodsmentioning
confidence: 99%
“…Displets [18] uses object knowledge that obtained by modeling 3D vehicles to resolve the stereo ambiguities for disparity estimation. The SegStereo [19] and DispSegNet [20] incorporate prior semantic information into stereo matching. The EdgeStereo [21] borrows the network architecture of edge detector HED [22] and achieves the mutual promotion between the edge detection task and stereo matching task.…”
Section: B Complementation-based Methodsmentioning
confidence: 99%
“…To the best of our knowledge, only four methods successfully achieved this result: SegStereo [37] and DispSegNet [38] leverage on segmentation as auxiliary task, AMNet [39] on Background foreground segmentation and EdgeStereo [40] that used edges cues. To better understand the proposed con- Figure 2.11: Architecture overview of SegStereo.…”
Section: Multi-task Learning In Stereo Depthmentioning
confidence: 99%
“…As this stage, the network would be able to provide in a parallel fashion disparity and segmentation outputs, anyway, a further and deeper exploitation of the segmentation cues is possible and it can be achieved with a specific refinement module. Influenced both by the AnyNet residual structure and by previous successful works [37,38] we decided to merge disparity cost volumes and semantic embeddings into hybrid volumes. Differently from other works though, we want to keep also this part of the architecture fully residual and pyramidal.…”
Section: Architecture Overviewmentioning
confidence: 99%
See 1 more Smart Citation
“…The image is synthesized from the network outputs, following the traditional Structure-frommotion procedure. Extra constraint and additional information have been introduced to improve the performance, like the temporal depth consistency [22], the stereo matching [23] and the semantic information [34]. Godard et al [11] achieved a significant improvement by compensating for image occlusion.…”
Section: Supervised Depth Estimationmentioning
confidence: 99%