2021 IEEE International Conference on Multimedia and Expo (ICME) 2021
DOI: 10.1109/icme51207.2021.9428423
|View full text |Cite
|
Sign up to set email alerts
|

IRS: A Large Naturalistic Indoor Robotics Stereo Dataset to Train Deep Models for Disparity and Surface Normal Estimation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(14 citation statements)
references
References 21 publications
0
14
0
Order By: Relevance
“…For many fundamental scene understanding tasks, it is difficult or impossible to obtain per-pixel ground truth labels from real images. In response to this challenge, the computer vision community has developed several photorealistic synthetic datasets and interactive simulation environments that have spurred rapid progress towards the goal of holistic indoor scene understanding [5,6,8,9,13,14,17,20,22,29,31,34,35,37,41,42,43,44,47,50,57,58,59,61,66,68,71,72,75,79,80]. tations (d,e); diffuse reflectance (f); diffuse illumination (g); and a non-diffuse residual image that captures view-dependent lighting effects like glossy surfaces and specular highlights (h).…”
Section: Introductionmentioning
confidence: 99%
“…For many fundamental scene understanding tasks, it is difficult or impossible to obtain per-pixel ground truth labels from real images. In response to this challenge, the computer vision community has developed several photorealistic synthetic datasets and interactive simulation environments that have spurred rapid progress towards the goal of holistic indoor scene understanding [5,6,8,9,13,14,17,20,22,29,31,34,35,37,41,42,43,44,47,50,57,58,59,61,66,68,71,72,75,79,80]. tations (d,e); diffuse reflectance (f); diffuse illumination (g); and a non-diffuse residual image that captures view-dependent lighting effects like glossy surfaces and specular highlights (h).…”
Section: Introductionmentioning
confidence: 99%
“…To better show the difference, we compare our dataset with the existing datasets under zero-shot cross-dataset setting. As shown in Table 4 (a), we train our NVDS with existing video depth datasets [33,39,40] on Sintel [6] dataset. With both quantity and diversity, using VDW as the training data yields the best accuracy and consistency.…”
Section: Comparisons With Other Video Depth Methodsmentioning
confidence: 99%
“…Video Depth Datasets According to the scenes of samples, existing video depth datasets can be categorized into closed-domain datasets and natural-scene datasets. Closeddomain datasets only contain samples in certain scenes, e.g., indoor scenes [9,33,39], office scenes [34], and autonomous driving [11]. To enhance the diversity of samples, natural-scene datasets are proposed, which use computerrendered videos [6,40] or crawl stereoscopic videos from YouTube [38].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations