2020
DOI: 10.1007/s11263-019-01287-w
|View full text |Cite
|
Sign up to set email alerts
|

EdgeStereo: An Effective Multi-task Learning Network for Stereo Matching and Edge Detection

Abstract: Recently, leveraging on the development of endto-end convolutional neural networks (CNNs), deep stereo matching networks have achieved remarkable performance far exceeding traditional approaches. However, state-of-theart stereo frameworks still have difficulties at finding correct correspondences in texture-less regions, detailed structures, small objects and near boundaries, which could be alleviated by geometric clues such as edge contours and corresponding constraints. To improve the quality of disparity es… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
66
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 144 publications
(73 citation statements)
references
References 67 publications
0
66
0
Order By: Relevance
“…It is tempting to use a state-of-the-art stereo algorithm instead, e.g. [3,2,29], however most modern stereo algorithms are supervised using the LiDAR ground truth from the KITTI dataset. Using one of these would cause us to be implicitly learning from laser-scanned ground-truth data.…”
Section: Computing Depth Hintsmentioning
confidence: 99%
“…It is tempting to use a state-of-the-art stereo algorithm instead, e.g. [3,2,29], however most modern stereo algorithms are supervised using the LiDAR ground truth from the KITTI dataset. Using one of these would cause us to be implicitly learning from laser-scanned ground-truth data.…”
Section: Computing Depth Hintsmentioning
confidence: 99%
“…e FlowNet provides the basic 2D encoder-decoder structure. Later, a lot of networks [23,24,26,27,32] have been proposed based on this. Optical flow estimation requires precise per-pixel localization, and it also depends on finding correspondences between two input images.…”
Section: End-to-end Stereo Matchingmentioning
confidence: 99%
“…Tons of algorithms based on this have been proposed. ese methods could roughly be categorized into two groups: 2D encode-decoder structures [23][24][25][26][27] and regularization modules composed of 3D convolutions [28][29][30][31]. DispNetC [24] computes a correlation volume from the left and right image features (encoding) and utilizes a CNN to directly regress (decoding) disparity maps.…”
Section: Introductionmentioning
confidence: 99%
“…The authors propose a feature-based matching methodology as opposed to a deep learning-based approach. The main reason for this decision is the fact that most deep learning methods demand a large amount of processing power [5][6][7]. This will be an extremely limiting factor if environments with restricted resources are considered where the resources are not abundant.…”
Section: Introductionmentioning
confidence: 99%