2018
DOI: 10.1007/978-3-030-01234-2_39
|View full text |Cite
|
Sign up to set email alerts
|

SegStereo: Exploiting Semantic Information for Disparity Estimation

Abstract: Disparity estimation for binocular stereo images finds a wide range of applications. Traditional algorithms may fail on featureless regions, which could be handled by high-level clues such as semantic segments. In this paper, we suggest that appropriate incorporation of semantic cues can greatly rectify prediction in commonly-used disparity estimation frameworks. Our method conducts semantic feature embedding and regularizes semantic cues as the loss term to improve learning disparity. Our unified model SegSte… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
196
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 305 publications
(197 citation statements)
references
References 44 publications
1
196
0
Order By: Relevance
“…Ladicky et al [15] estimate the depth based on different canonical views and show that semantic knowledge helps to improve the prediction. This is verified for stereo-matching methods by [26,31], too. In this paper, we empirically prove that this concept also holds for depth estimation from a single monocular image with a CNN.…”
Section: Related Worksupporting
confidence: 58%
“…Ladicky et al [15] estimate the depth based on different canonical views and show that semantic knowledge helps to improve the prediction. This is verified for stereo-matching methods by [26,31], too. In this paper, we empirically prove that this concept also holds for depth estimation from a single monocular image with a CNN.…”
Section: Related Worksupporting
confidence: 58%
“…When applied to down-scaled images, these methods run faster, but gives blurry results and inaccurate disparity estimates for the far-field. Recent "deep" stereo methods perform well on low-resolution benchmarks [5,11,16,21,38], while failing to produce SOTA results on high-res benchmarks [26]. This is likely due to: 1) Their architectures are not efficiently designed to operate on high-resolution images.…”
Section: Introductionmentioning
confidence: 99%
“…On KITTI 2012 dataset "Noc" means non occluded regions and "All" mean all regions. Notice, that we perform comparable against SegStereo [27] on KITTI 2015 but way better in KITTI 2012 dataset.…”
Section: Resultsmentioning
confidence: 80%