3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera

Armeni, Iro; He, Zhimin; Zamir, Amir; Gwak, JunYoung; Malik, Jitendra; Fischer, Martin; Savarese, Silvio

doi:10.1109/iccv.2019.00576

Cited by 191 publications

(158 citation statements)

References 49 publications

(68 reference statements)

Supporting

Mentioning

158

Contrasting

Order By: Relevance

“…Expressing the environment with 3D information preserved supports the scene graph to record the environment detail, but it requires powerful computation capabilities. In [13], a framework for constructing a 3D scene semantic map is proposed. The map constructed by this framework is composed of four layers, which is more in line with human thinking and perception.…”

Section: Semantic Mapmentioning

confidence: 99%

“…Among the scene graph, each node represents the object and attributes, and each edge represents the relation between the objects. Based on [12], [13], MIT SPARK laboratory combines with the previous semantic mapping work, visual-inertial odometry, deep learning, and other methods to construct a scene graph of a dynamic 3D environment [11]. They propose a more comprehensive 3D semantic SGG framework, which adds the detection and tracking modules for dynamic targets, thus some of the impacts of dynamic changes is eliminated.…”

Section: Semantic Mapmentioning

confidence: 99%

“…Since the scene graph constructed from a single image is not specific enough, the scene graph generated from multiple images may miss some objects or repeatedly detect some objects. We refer to the method mentioned in [11]- [13] to build the scene graph from RGB-D videos.…”

Section: Semantic Mapmentioning

confidence: 99%

“…The 3D semantic map construction framework proposed in [11] is a masterpiece of semantic map research, which expresses the environment with five levels: Metric-Semantic Mesh, Objects and Agents, Places and Structures, Rooms, Building. Besides, the framework applies the tracking and detection module for dynamic targets, which eliminate the effects of dynamic changes [12], [13]. However, this framework has strict requirements on hardware and the construction process of the map is complicated.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

TSM: Topological Scene Map for Representation in Indoor Environment Understanding

et al. 2020

View full text Add to dashboard Cite

In the field of robotics, it is crucial to obtain a comprehensive semantic understanding of a scene for many applications. Based on the behavioral topological map and scene graph, we propose to employ a semantic map named Topological Scene Map (TSM) for representation in indoor environment understanding. The behavioral topological map we constructed expresses the spatial connection relations and semantically describes the navigation behavior between adjacent topological nodes. The scene graph promotes the TSM to record the objects that appear in the scene and the relations between objects. The addition of spatial and semantic relations makes the expression of the scene more specific, which improves the robot's abilities of scene understanding and human-robotic interaction. In this paper, we design a method for topological map construction and apply a novel approach to generate a scene graph from RGB-D data. The semantic representation of the environment generated in the experiments verifies that the TSM construction framework models the scene efficiently and the TSM is conducive to the realization of humanrobotic interaction.

show abstract

Section: Semantic Mapmentioning

confidence: 99%

Section: Semantic Mapmentioning

confidence: 99%

Section: Semantic Mapmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

TSM: Topological Scene Map for Representation in Indoor Environment Understanding

et al. 2020

View full text Add to dashboard Cite

show abstract

“…There are also works that use variety of approaches to scene description and generation, such as domain specific languages [23], scene graphs [24], stochastic grammars [25] for scenes description and generation.…”

Section: Related Workmentioning

confidence: 99%

Variable photorealistic image synthesis for training dataset generation

Sanzharov¹,

Frolov

Voloboy

2020

CPT2020 the 8th International Scientific Conference on Computing in Physics and Technology Proceedings

View full text Add to dashboard Cite

Photorealistic rendering systems have recently found new applications in artificial intelligence, specifically in computer vision for the purpose of generation of image and video sequence datasets. The problem associated with this application is producing large number of photorealistic images with high variability of 3d models and their appearance. In this work, we propose an approach based on combining existing procedural texture generation techniques and domain randomization to generate large number of highly variative digital assets during the rendering process. This eliminates the need for a large pre-existing database of digital assets (only a small set of 3d models is required), and generates objects with unique appearance during rendering stage, reducing the needed post-processing of images and storage requirements. Our approach uses procedural texturing and material substitution to rapidly produce large number of variations of digital assets. The proposed solution can be used to produce training datasets for artificial intelligence applications and can be combined with most of state-of-the-art methods of scene generation.

show abstract