2018
DOI: 10.48550/arxiv.1811.06410
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

LinkNet: Relational Embedding for Scene Graph

Abstract: Objects and their relationships are critical contents for image understanding. A scene graph provides a structured description that captures these properties of an image. However, reasoning about the relationships between objects is very challenging and only a few recent works have attempted to solve the problem of generating a scene graph from an image. In this paper, we present a method that improves scene graph generation by explicitly modeling inter-dependency among the entire object instances. We design a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 29 publications
(50 reference statements)
0
1
0
Order By: Relevance
“…Each scene graph encodes high-level visual concepts, including objects, attributes, and relations, as nodes and edges. This type of structured representation has led to many state-of-the-art models in image captioning [43], image retrieval [17], relationship modeling [40], and image generation [16]. In addition, it promotes the publishing of several datasets containing well-annotated scene graphs, such as GQA [14] and Visual Genome [19], to facilitate research on the use of scene graphs for visual and textual content understanding.…”
Section: Introductionmentioning
confidence: 99%
“…Each scene graph encodes high-level visual concepts, including objects, attributes, and relations, as nodes and edges. This type of structured representation has led to many state-of-the-art models in image captioning [43], image retrieval [17], relationship modeling [40], and image generation [16]. In addition, it promotes the publishing of several datasets containing well-annotated scene graphs, such as GQA [14] and Visual Genome [19], to facilitate research on the use of scene graphs for visual and textual content understanding.…”
Section: Introductionmentioning
confidence: 99%