2019
DOI: 10.1109/lra.2019.2923960
|View full text |Cite
|
Sign up to set email alerts
|

Volumetric Instance-Aware Semantic Mapping and 3D Object Discovery

Abstract: To autonomously navigate and plan interactions in real-world environments, robots require the ability to robustly perceive and map complex, unstructured surrounding scenes. Besides building an internal representation of the observed scene geometry, the key insight toward a truly functional understanding of the environment is the usage of higher-level entities during mapping, such as individual object instances. This work presents an approach to incrementally build volumetric objectcentric maps during online sc… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
168
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 211 publications
(182 citation statements)
references
References 27 publications
3
168
0
Order By: Relevance
“…Representation FQ FPS SemanticFusion [11] Dense Every 10 frames Under 8 Hz Hermans et al [34] Dense Every 6 frames 3 Hz PanopicFusion [44] Dense Every 10 frames 4.3 Hz Voxblox++ [16] Instance-oriented Every frame 1 Hz Pham et al [45] Instance-oriented Every frame 1 Hz Fusion++ [29] Instance-oriented progressively integrating predictions from partial observations into a global map but can learn from the entire 3D layout of the scene, they are not directly comparable with our work. Among the frameworks that study online, incremental instance-aware semantic mapping, we chose Grinvald et al [16] as a comparison. Because we relied on a Mask R-CNN model trained on the 80 Microsoft COCO [38] object classes to get the instance IDs, we evaluated the segmentation accuracy on the nine object categories that were common to the SceneNN dataset [21].…”
Section: Methodsmentioning
confidence: 65%
See 4 more Smart Citations
“…Representation FQ FPS SemanticFusion [11] Dense Every 10 frames Under 8 Hz Hermans et al [34] Dense Every 6 frames 3 Hz PanopicFusion [44] Dense Every 10 frames 4.3 Hz Voxblox++ [16] Instance-oriented Every frame 1 Hz Pham et al [45] Instance-oriented Every frame 1 Hz Fusion++ [29] Instance-oriented progressively integrating predictions from partial observations into a global map but can learn from the entire 3D layout of the scene, they are not directly comparable with our work. Among the frameworks that study online, incremental instance-aware semantic mapping, we chose Grinvald et al [16] as a comparison. Because we relied on a Mask R-CNN model trained on the 80 Microsoft COCO [38] object classes to get the instance IDs, we evaluated the segmentation accuracy on the nine object categories that were common to the SceneNN dataset [21].…”
Section: Methodsmentioning
confidence: 65%
“…us, the approach in [11] does not provide any information about the geometry and relative placement of individual objects in the scene. A number of other works have addressed the task of detecting and segmenting individual semantically meaningful objects in 3D scenes without predefined shape templates [10,16,17,27,[29][30][31][32][33][34]. Runz et al [32] employed the object detector for the first step and then updated the class probabilities of each element consisting of the reconstructed 3D map.…”
Section: Semantic Instance-awarementioning
confidence: 99%
See 3 more Smart Citations