The Production of Ground Truths for Evaluating Highly Accurate Stereovision Algorithms

Dagobert, Tristan

doi:10.5201/ipol.2018.187

Cited by 2 publications

(4 citation statements)

References 18 publications

(23 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, the KITTI datasets were mainly developed for self-driving vehicle use-cases, while the few scenes in Middlebury are all captured in a laboratory setting. Other real-world stereo datasets include Make3D [47], ETH3D [49], CMLA [9], and Cityscapes [8] -each focused on a specific domain. More recent datasets such as Flickr1024 [57] and WSVD [55] provide more diverse scenes, however, Flickr1024 is relatively small compared to our dataset and WSVD images score significantly lower on quality metrics, as shown in Table 1.…”

Section: Stereo Datasetsmentioning

confidence: 99%

“…A majority of state-of-the-art methods in this domain are based on deep learning methods, where accuracy and quality scale with the amount of data available for training. Current datasets either cover a limited subset of real-world scenarios [16] [8] or are taken in a laboratory setting [48] [9]. Further, there is an increased need for a large-scale stereo dataset representative of the diversity of real-world scenarios to enable generalization.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Holopix50k: A Large-Scale In-the-wild Stereo Image Dataset

Hua,

Kohli,

Uplavikar

et al. 2020

Preprint

View full text Add to dashboard Cite

With the mass-market adoption of dual-camera mobile phones, leveraging stereo information in computer vision has become increasingly important. Current state-of-the-art methods utilize learningbased algorithms, where the amount and quality of training samples heavily influence results. Existing stereo image datasets are limited either in size or subject variety. Hence, algorithms trained on such datasets do not generalize well to scenarios encountered in mobile photography. We present Holopix50k, a novel in-the-wild stereo image dataset, comprising 49,368 image pairs contributed by users of the Holopix™ mobile social platform. In this work, we describe our data collection process and statistically compare our dataset to other popular stereo datasets. We experimentally show that using our dataset significantly improves results for tasks such as stereo super-resolution and self-supervised monocular depth estimation. Finally, we showcase practical applications of our dataset to motivate novel works and use cases. The dataset is available at http://github.com/leiainc/holopix50k.

show abstract

Section: Stereo Datasetsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Holopix50k: A Large-Scale In-the-wild Stereo Image Dataset

Hua,

Kohli,

Uplavikar

et al. 2020

Preprint

View full text Add to dashboard Cite

show abstract

“…The sequences used in this section belong to the CMLA dataset [5] and were produced by simulating a fronto-parallel camera motion. The images have near-zero noise and their disparity accuracy is in the order of 10 −6 .…”

Section: Experimental Protocolmentioning

confidence: 99%

“…On the one hand, this study implies the use of image databases with negligible noise levels and very accurate ground truths. Dagobert [5] found that commonly used databases poorly met these requirements and proposed a new one, the CMLA dataset. This database has the advantage of providing pairs of images created with different baselines, which have virtually no residual noise and dense disparity maps, whose accuracy is very high.…”

Section: Introductionmentioning

confidence: 99%

Comparison of Optical Flow Methods under Stereomatching with Short Baselines

Dagobert¹,

Monzón²,

Sánchez³

2019

Image Processing on Line

Self Cite

View full text Add to dashboard Cite

This article studies the effectiveness of optical flow methods applied to short baseline image pairs under different noise levels. New metrics have been developed to analyze the results because the usual metrics are inadequate in a subpixel context. We have used the implementation of some standard optical flow methods adapted to the stereo problem. Our experiments show that the Brox et al. method produces the least errors, with a 60% success rate and a relative precision at 1/100th of a pixel. On the other hand, our comparison shows that a discontinuity preserving method, derived from Brox et al., also provides competitive results at the same time that it yields disparities with more details and correct contours. Source Code Source codes of Lucas-Kanade 1D, Robust Optical Flow 1D and Robust Discontinuity Preserving 1D algorithms are provided in the web page of the article 1 .

show abstract

The Production of Ground Truths for Evaluating Highly Accurate Stereovision Algorithms

Cited by 2 publications

References 18 publications

Holopix50k: A Large-Scale In-the-wild Stereo Image Dataset

Holopix50k: A Large-Scale In-the-wild Stereo Image Dataset

Comparison of Optical Flow Methods under Stereomatching with Short Baselines

Contact Info

Product

Resources

About