2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.01138
|View full text |Cite
|
Sign up to set email alerts
|

Fully Convolutional Scene Graph Generation

Abstract: Scene Graph Generation (SGG) has achieved significant progress recently. However, most previous works rely heavily on fixed-size entity representations based on bounding box proposals, anchors, or learnable queries. As each representation's cardinality has different trade-offs between performance and computation overhead, extracting highly representative features efficiently and dynamically is both challenging and crucial for SGG. In this work, a novel architecture called RepSGG is proposed to address the afor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 64 publications
(29 citation statements)
references
References 92 publications
(133 reference statements)
0
18
0
Order By: Relevance
“…than another one-stage model FCSGG [60]. Our model is also competitive compared with recent two-stage models, and outperforms state-of-the-art visual-based methods.…”
Section: Visual Genomementioning
confidence: 63%
See 3 more Smart Citations
“…than another one-stage model FCSGG [60]. Our model is also competitive compared with recent two-stage models, and outperforms state-of-the-art visual-based methods.…”
Section: Visual Genomementioning
confidence: 63%
“…Compared to the boom of two-stage approaches, onestage approaches are still in their infancy and have the advantage of being simple, fast and easy to train. To the best of our knowledge, FCSGG [60] is currently the only one-stage scene graph generation framework that encodes objects as box center points and relationships as 2D vector fields. While FCSGG model being lightweight and fast speed, it has a significant performance gap compared to other twostage methods.…”
Section: Scene Graph Generationmentioning
confidence: 99%
See 2 more Smart Citations
“…Scene graphs With the aforementioned goal, scene graphs, were pioneered by [4], where detected objects from input images are modeled as nodes and semantic relations between them are modeled as edges with semantic relational labels. Recently, more approaches [8], [19], [20], [21] are proposed on top of this and deliver state-of-the-art performance and efficiency on public benchmarks like Visual Genome [12]. Naturally, all of them rely on ground truth provided by the dataset for training deep scene graph generation models, and this holds for each new domain-specific task.…”
Section: Related Workmentioning
confidence: 99%