“…Among the scene graph, each node represents the object and attributes, and each edge represents the relation between the objects. Based on [12], [13], MIT SPARK laboratory combines with the previous semantic mapping work, visual-inertial odometry, deep learning, and other methods to construct a scene graph of a dynamic 3D environment [11]. They propose a more comprehensive 3D semantic SGG framework, which adds the detection and tracking modules for dynamic targets, thus some of the impacts of dynamic changes is eliminated.…”