2019
DOI: 10.1007/978-3-030-20893-6_21
|View full text |Cite
|
Sign up to set email alerts
|

Visual Graphs from Motion (VGfM): Scene Understanding with Object Geometry Reasoning

Abstract: Recent approaches on visual scene understanding attempt to build a scene graph -a computational representation of objects and their pairwise relationships. Such rich semantic representation is very appealing, yet difficult to obtain from a single image, especially when considering complex spatial arrangements in the scene. Differently, an image sequence conveys useful information using the multi-view geometric relations arising from camera motions. Indeed, object relationships are naturally related to the 3D s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
24
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 20 publications
(24 citation statements)
references
References 38 publications
0
24
0
Order By: Relevance
“…Only a few works have explored scene graphs in 3D. Gay et al (2018) propose a 2.5D graph dataset based on ScanNet (Dai et al 2017), Armeni et al (2019), on the other hand, suggest hierarchical 3D scene graphs. They split the different components of a scene into 4 different layers: cameras, objects, buildings and rooms.…”
Section: D Object Context and Scene Layoutmentioning
confidence: 99%
See 3 more Smart Citations
“…Only a few works have explored scene graphs in 3D. Gay et al (2018) propose a 2.5D graph dataset based on ScanNet (Dai et al 2017), Armeni et al (2019), on the other hand, suggest hierarchical 3D scene graphs. They split the different components of a scene into 4 different layers: cameras, objects, buildings and rooms.…”
Section: D Object Context and Scene Layoutmentioning
confidence: 99%
“…Additionally to the data, Armeni et al (2019) and Gay et al (2018) propose graph prediction methods. Armeni et al (2019) sample images from a panoramic camera and apply a regularization technique to 2D mask predictions aiming to obtain improved 3D object nodes.…”
Section: D Object Context and Scene Layoutmentioning
confidence: 99%
See 2 more Smart Citations
“…Additionally, we integrate the results of classifiers by using rule-based architecture for estimating multi-modal information. This rule-based integration architecture is inspired by the methods which utilize structured knowledge to learn from small amounts of experience for scene understanding [7]. Fig.…”
Section: Introduction mentioning
confidence: 99%