End-to-End Training of Hybrid CNN-CRF Models for Stereo

Knöbelreiter, Patrick; Reinbacher, Christian; Shekhovtsov, Alexander; Pock, Thomas

doi:10.1109/cvpr.2017.159

Cited by 125 publications

(77 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Several techniques have been developed in the field of structured support vector machines (SSVMs) [92,28,1,95] that are very relevant to the task of learning energy models, as SSVMs can be understood as bi-level problems with a lower-level energy that is linear in θ and often a noncontinuous higher-level loss. Various strategies such as margin rescaling [92], slack rescaling [95,97], softmaxmargins [40] exist and have also been applied recently in the training of computer vision models in [54,29], we will later return to their connection to the investigated strategies.…”

Section: Related Workmentioning

confidence: 99%

Parametric Majorization for Data-Driven Energy Minimization Methods

Geiping

Moeller

2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

View full text Add to dashboard Cite

Energy minimization methods are a classical tool in a multitude of computer vision applications. While they are interpretable and well-studied, their regularity assumptions are difficult to design by hand. Deep learning techniques on the other hand are purely data-driven, often provide excellent results, but are very difficult to constrain to predefined physical or safety-critical models. A possible combination between the two approaches is to design a parametric energy and train the free parameters in such a way that minimizers of the energy correspond to desired solution on a set of training examples. Unfortunately, such formulations typically lead to bi-level optimization problems, on which common optimization algorithms are difficult to scale to modern requirements in data processing and efficiency. In this work, we present a new strategy to optimize these bi-level problems. We investigate surrogate single-level problems that majorize the target problems and can be implemented with existing tools, leading to efficient algorithms without collapse of the energy function. This framework of strategies enables new avenues to the training of parameterized energy minimization models from large data.

show abstract

Section: Related Workmentioning

confidence: 99%

Parametric Majorization for Data-Driven Energy Minimization Methods

Geiping

Moeller

2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

View full text Add to dashboard Cite

show abstract

“…(ii) This positive trend is transferred to the test set for the average error and the RMS error. (iii) The bad{0.5, 1} errors on the test set are reduced and (iv) the bad{2, 4} errors slightly increase on the test set compared to [11]. One reason for this is the limited amount of training data for these very high-resolution images.…”

Section: Benchmark Performancementioning

confidence: 99%

“…We use our method on top of the CNN-CRF [11] stereo method for the official test set evaluation (see Table 2). We set the temperature parameter η = 0.075 in all experiments.…”

Section: Benchmark Performancementioning

confidence: 99%

Learned Collaborative Stereo Refinement

Knöbelreiter

Pock

2019

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

In this work, we propose a learning-based method to denoise and refine disparity maps of a given stereo method. The proposed variational network arises naturally from unrolling the iterates of a proximal gradient method applied to a variational energy defined in a joint disparity, color, and confidence image space. Our method allows to learn a robust collaborative regularizer leveraging the joint statistics of the color image, the confidence map and the disparity map. Due to the variational structure of our method, the individual steps can be easily visualized, thus enabling interpretability of the method. We can therefore provide interesting insights into how our method refines and denoises disparity maps. The efficiency of our method is demonstrated by the publicly available stereo benchmarks Middlebury 2014 and Kitti 2015.

show abstract

“…Supervised learning techniques include models which are trained on stereo images, but which can infer depth maps on monocular images. [12] propose an approach which follows this paradigm. They use the correlations of CNN feature maps of stereo images and derive the unary matching costs.…”

Section: Related Workmentioning

confidence: 99%

SDNet: Semantically Guided Depth Estimation Network

Ochs

Kretz

Mester

2019

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Autonomous vehicles and robots require a full scene understanding of the environment to interact with it. Such a perception typically incorporates pixel-wise knowledge of the depths and semantic labels for each image from a video sensor. Recent learning-based methods estimate both types of information independently using two separate CNNs. In this paper, we propose a model that is able to predict both outputs simultaneously, which leads to improved results and even reduced computational costs compared to independent estimation of depth and semantics. We also empirically prove that the CNN is capable of learning more meaningful and semantically richer features. Furthermore, our SD-Net estimates the depth based on ordinal classification. On the basis of these two enhancements, our proposed method achieves state-of-theart results in semantic segmentation and depth estimation from single monocular input images on two challenging datasets.

show abstract

End-to-End Training of Hybrid CNN-CRF Models for Stereo

Cited by 125 publications

References 44 publications

Parametric Majorization for Data-Driven Energy Minimization Methods

Parametric Majorization for Data-Driven Energy Minimization Methods

Learned Collaborative Stereo Refinement

SDNet: Semantically Guided Depth Estimation Network

Contact Info

Product

Resources

About