2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.01891
|View full text |Cite
|
Sign up to set email alerts
|

Spatial Commonsense Graph for Object Localisation in Partial Scenes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
22
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(22 citation statements)
references
References 22 publications
0
22
0
Order By: Relevance
“…These predictions also highlight the difficulty of the task, as many predictions are plausible (such as a drawer keeping its state or the second chairs also changing) but are not realized. architectures, the GNN of 3DSSG [7], where we replace the PointNet input features with the SG embeddings of [7] to operate directly on a SG, and the recent architecture of Spatial Commonsense Graphs (SCG) [19] based on graph transformers [50]. We find that GNNs perform significantly better than equivalent fully connected architectures, as the MLP is not able to identify local context and barely predicts any state or instance changes.…”
Section: B Variable Scene Graph Prediction Performancementioning
confidence: 96%
See 1 more Smart Citation
“…These predictions also highlight the difficulty of the task, as many predictions are plausible (such as a drawer keeping its state or the second chairs also changing) but are not realized. architectures, the GNN of 3DSSG [7], where we replace the PointNet input features with the SG embeddings of [7] to operate directly on a SG, and the recent architecture of Spatial Commonsense Graphs (SCG) [19] based on graph transformers [50]. We find that GNNs perform significantly better than equivalent fully connected architectures, as the MLP is not able to identify local context and barely predicts any state or instance changes.…”
Section: B Variable Scene Graph Prediction Performancementioning
confidence: 96%
“…Rosinol et al [8] introduce dynamic SGs, providing rich hierarchical abstractions of the SG and accounting for short-term moving agents, such as humans. Recently, Giuliari et al [19] presented Spatial Commonsense Graphs (SCG), embedding additional nodes from knowledge graphs such as "used for reading", which they call commonsense concepts. Different approaches have been proposed to estimate SGs from images [20,21] or dense reconstructions [7,8].…”
Section: A 3d Semantic Scene Representationsmentioning
confidence: 99%
“…• We present extensive evaluation of our proposed solution, as well as an in-depth analysis of the internal working of the proposed solution to explain the utility of commonsense reasoning. This work is an extended version of [4]. The novel contributions with respect to our previous findings are: i) We propose D-SCG that describes the spatial information on the proximity edges via directional relative positions, which improves the previous formulation of the SCG that represents the proximity edges as distances.…”
Section: Introductionmentioning
confidence: 93%
“…It then converts the relative positions into absolute ones which agree on a single final position of the target object. Differently from the previous work that localises object with Spatial Commonsense Graph (SCG-OL) [4], our new D-SCG represents the proximity edges with directional relative positions, instead of the undirected relative distances. This allows us to regress and estimate the target position in an end-to-end trainable manner, without requiring a non-differentiable multilateration procedure for localising the target object, thus contributing to a more effective loss calculation and training procedure.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation