Active Scene Understanding via Online Semantic Reconstruction

Zheng, Lintao; Zhu, Chenyang; Zhang, Jiazhao; Zhao, Hang; Huang, Hui; Nießner, Matthias; Xu, Kai

doi:10.1111/cgf.13820

Cited by 29 publications

(21 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…With few exceptions, however, RGB and 3D data are generally analyzed separately in the reconstruction process, most of the time exploiting RGB analysis for 3D data densification prior to the application of a pure geometric processing pipeline. Performing data fusion to combine visual and depth cues into multi‐modal feature descriptors on which to base further analysis is an important avenue for future work: such a joint analysis allows to better cope with heavily cluttered and partial acquisitions, as demonstrated by early results on boundary surface reconstruction [LWF18a] and indoor object reconstruction [SFCH12, ZZZ∗19, JDN19].…”

Section: Discussionmentioning

confidence: 99%

“…While the above approaches perform object recognition on images or inside the reconstruction step, 3D object segmentation can also be performed over the 3D reconstruction of scene geometry, in order to facilitate 3D spatial and structural reasoning [ZXTZ15, XHS∗15], at least when a dense input is available. In this context, Hou et al [JDN19] and Zheng et al [ZZZ∗19] have recently proposed methods for active scene understanding based on online RGB‐D reconstruction with volumetric segmentation. In those approaches, a deep neural network is leveraged to perform real‐time voxel‐based semantic labeling.…”

Section: Bounding Surfaces Reconstructionmentioning

confidence: 99%

See 1 more Smart Citation

State‐of‐the‐art in Automatic 3D Reconstruction of Structured Indoor Environments

Pintore¹,

Mura

Ganovelli

et al. 2020

Computer Graphics Forum

View full text Add to dashboard Cite

Creating high‐level structured 3D models of real‐world indoor scenes from captured data is a fundamental task which has important applications in many fields. Given the complexity and variability of interior environments and the need to cope with noisy and partial captured data, many open research problems remain, despite the substantial progress made in the past decade. In this survey, we provide an up‐to‐date integrative view of the field, bridging complementary views coming from computer graphics and computer vision. After providing a characterization of input sources, we define the structure of output models and the priors exploited to bridge the gap between imperfect sources and desired output. We then identify and discuss the main components of a structured reconstruction pipeline, and review how they are combined in scalable solutions working at the building level. We finally point out relevant research issues and analyze research trends.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Bounding Surfaces Reconstructionmentioning

confidence: 99%

State‐of‐the‐art in Automatic 3D Reconstruction of Structured Indoor Environments

Pintore¹,

Mura

Ganovelli

et al. 2020

Computer Graphics Forum

View full text Add to dashboard Cite

show abstract

“…However, all of the above systems suffer from incorrect single‐frame segmentation and error accumulation, resulting in noisy and inconsistent reconstructed objects. Zheng et al [ZZZ*19] achieve semantic understanding of indoor scenes, based on online RGBD reconstruction with volumetric semantic segmentation. However, it is difficult to distinguish different instances.…”

Section: Related Workmentioning

confidence: 99%

InstanceFusion: Real‐time Instance‐level 3D Reconstruction Using a Single RGBD Camera

Peng

et al. 2020

Computer Graphics Forum

View full text Add to dashboard Cite

We present InstanceFusion, a robust real-time system to detect, segment, and reconstruct instance-level 3D objects of indoor scenes with a hand-held RGBD camera. It combines the strengths of deep learning and traditional SLAM techniques to produce visually compelling 3D semantic models. The key success comes from our novel segmentation scheme and the efficient instancelevel data fusion, which are both implemented on GPU. Specifically, for each incoming RGBD frame, we take the advantages of the RGBD features, the 3D point cloud, and the reconstructed model to perform instance-level segmentation. The corresponding RGBD data along with the instance ID are then fused to the surfel-based models. In order to sufficiently store and update these data, we design and implement a new data structure using the OpenGL Shading Language. Experimental results show that our method advances the state-of-the-art (SOTA) methods in instance segmentation and data fusion by a big margin. In addition, our instance segmentation improves the precision of 3D reconstruction, especially in the loop closure. InstanceFusion system runs 20.5Hz on a consumer-level GPU, which supports a number of augmented reality (AR) applications (e.g., 3D model registration, virtual interaction, AR map) and robot applications (e.g., navigation, manipulation, grasping). To facilitate future research and reproduce our system more easily, the source code, data, and the trained model are released on Github: https://github.com/Fancomi2017/InstanceFusion. CCS Concepts • Computing methodologies → Scene understanding; Vision for robotics; Perception;

show abstract

“…It is considered a complex task due to the multiple sub-tasks that are involved, such as object recognition, scene classification, geometric reasoning, semantic segmentation, pose estimation, 3D reconstruction, saliency detection, physics-based reasoning, and affordance prediction. Scene understanding task has been addressed in different ways: through the parsing of single images [7,20,89,91,107,176,195,201,205], and considering the understanding of the whole environment where a robot moves [84,147,200]. In the literature, depending on the final goal pursued, the approaches emphasize some sub-tasks more than others.…”

Section: Related Workmentioning

confidence: 99%

“…. 18 2.6 Results of applying the approach of [200] shows the semantic maps generated considering only scene information. In column (d) are the semantic maps built through only object information.…”

Section: Introductionmentioning

confidence: 99%

Adding Uncertainty to an Object Detection System for Mobile Robots

Hernandez

Gómez

Crespo

et al. 2017

2017 6th International Conference on Space Mission Challenges for Information Technology (SMC-IT)

View full text Add to dashboard Cite

I would like to dedicate these lines to express my gratitude to all the people who have helped me in a way or another to achieve this goal that is so important to me. First, I would like to thank my tutor and director, Dr. Ramón Barber, for introducing me to the exciting world of research, for all his support, guidance, trust, and patience over these years of my Ph.D. studies. Thank you for giving me the opportunity to work with you. I also thank Dr. Oscar Martínez, for all the advice and technical talks and for allowing me to visit his laboratory in Sweden. I would like to thank the University Carlos III of Madrid and all the Robotics Lab members I met during these years. Thank you for your support, professionalism, and friendship; It is an honor to belong to this great research group.I would like to especially thank my lab partner Clara Gómez. Thank you for being such a great partner and friend over these years, for your support, professionalism, your dedication at work. Thank you for being my partner in conferences and research visits. Definitely, the Ph.D. would not have been the same without you.I would like to thank Dr. Zoltan Marton and his team for giving me the opportunity to work with them at the DLR institute in Germany. I would also like to thank Prof. Robert Babuška for welcoming me to his lab at the CTU in Prague. Especially thanks to Erik Derner for being a great lab partner, for our interesting discussions, for his professionalism and friendship. I also thank the people I met at Örebro University, especially Eduardo Gutiérrez; thank you for your support and friendship.Finally and most importantly, I would like to thank my family. Thank you for all your support, your love, and for always being there. Despite the more than 7000 km. that separate us, I am always with you, and I know that you are with me celebrating this achievement. I leave for last, the person I must thank for having embarked on this path. I want to thank my husband, Jesús. Thank you for supporting me and helping me accomplish this goal, for being my partner on this roller coaster of life, for always believing in me, and for having a word of encouragement when I felt like I could not continue. Without you, I am sure I would not have made it. For you all my love and my gratitude.To all my sincere thanks.

show abstract

Active Scene Understanding via Online Semantic Reconstruction

Cited by 29 publications

References 34 publications

State‐of‐the‐art in Automatic 3D Reconstruction of Structured Indoor Environments

State‐of‐the‐art in Automatic 3D Reconstruction of Structured Indoor Environments

InstanceFusion: Real‐time Instance‐level 3D Reconstruction Using a Single RGBD Camera

Adding Uncertainty to an Object Detection System for Mobile Robots

Contact Info

Product

Resources

About