Knowledge-Embedded Routing Network for Scene Graph Generation

Chen, Tianshui; Yu, Weihao; Chen, Riquan; Li, Lin

doi:10.1109/cvpr.2019.00632

Cited by 369 publications

(358 citation statements)

References 34 publications

Supporting

Mentioning

358

Contrasting

Order By: Relevance

“…After the definition of the scene graph, the possibility of generating a scene graph from an image I can be composed by three components as similar to [28]:…”

Section: Methodsmentioning

confidence: 99%

“…Ref. [28] uses a gated graph neural network to model the fully connected scene graph. Each node will be affected equally by all other nodes in the graph.…”

Section: Related Workmentioning

confidence: 99%

“…In this way, each node can aggregate messages from the other nodes and transfer its message to the other nodes in the meantime, enabling interactions among all nodes in the graph. After T timesteps, we can obtain the final hidden state for each node, which can be represented by a set of subnode [28], we use a fully connected layer that takes the initial hidden state and final hidden state as input to compute the output feature for each subnode…”

Section: Attentive Gated Graph Neural Networkmentioning

confidence: 99%

See 2 more Smart Citations

Attentive Gated Graph Neural Network for Image Scene Graph Generation

Tang

Zhang

et al. 2020

Symmetry

View full text Add to dashboard Cite

Image scene graph is a semantic structural representation which can not only show what objects are in the image, but also infer the relationships and interactions among them. Despite the recent success in object detection using deep neural networks, automatically recognizing social relations of objects in images remains a challenging task due to the significant gap between the domains of visual content and social relation. In this work, we translate the scene graph into an Attentive Gated Graph Neural Network which can propagate a message by visual relationship embedding. More specifically, nodes in gated neural networks can represent objects in the image, and edges can be regarded as relationships among objects. In this network, an attention mechanism is applied to measure the strength of the relationship between objects. It can increase the accuracy of object classification and reduce the complexity of relationship classification. Extensive experiments on the widely adopted Visual Genome Dataset show the effectiveness of the proposed method.

show abstract

“…After the definition of the scene graph, the possibility of generating a scene graph from an image I can be composed by three components as similar to [28]:…”

Section: Methodsmentioning

confidence: 99%

“…Ref. [28] uses a gated graph neural network to model the fully connected scene graph. Each node will be affected equally by all other nodes in the graph.…”

Section: Related Workmentioning

confidence: 99%

Section: Attentive Gated Graph Neural Networkmentioning

confidence: 99%

See 1 more Smart Citation

Attentive Gated Graph Neural Network for Image Scene Graph Generation

Tang

Zhang

et al. 2020

Symmetry

View full text Add to dashboard Cite

show abstract

“…Inspired by the current graph propagation works [18,1,27,3], we adopt a gated recurrent update mechanism to propagate message through the graph and learn contextualized node-level features. Specifically, for each node v c ∈ V, it has a hidden state h t c at timestep t. In this work, as each node corresponds to a specific category and our model aims to explore the interactions among the semanticspecific features, we initialize the hidden state at t = 0 with the feature vector that relates to the corresponding category, formulated as…”

Section: Semantic Interactionmentioning

confidence: 99%

Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition

Chen

Hui

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

Self Cite

244

188

View full text Add to dashboard Cite

Recognizing multiple labels of images is a practical and challenging task, and significant progress has been made by searching semantic-aware regions and modeling label dependency. However, current methods cannot locate the semantic regions accurately due to the lack of part-level supervision or semantic guidance. Moreover, they cannot fully explore the mutual interactions among the semantic regions and do not explicitly model the label co-occurrence.

show abstract

“…2)MSDN[30], IMP[62], TFR[20], MOTIFS[73], Graph-RCNN[65], GPI[17], KER[6] are joint inference models, which adopt message passing to encode the context. All these models are optimized by XE based training objective.…”

mentioning

confidence: 99%

Counterfactual Critic Multi-Agent Training for Scene Graph Generation

Chen

Zhang

Xiao

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

149

105

View full text Add to dashboard Cite

Scene graphs -objects as nodes and visual relationships as edges -describe the whereabouts and interactions of objects in an image for comprehensive scene understanding. To generate coherent scene graphs, almost all existing methods exploit the fruitful visual context by modeling message passing among objects. For example, "person" on "bike" can help to determine the relationship "ride", which in turn contributes to the confidence of the two objects. However, we argue that the visual context is not properly learned by using the prevailing cross-entropy based supervised learning paradigm, which is not sensitive to graph inconsistency: errors at the hub or non-hub nodes should not be penalized equally. To this end, we propose a Counterfactual critic Multi-Agent Training (CMAT) approach. CMAT is a multi-agent policy gradient method that frames objects into cooperative agents, and then directly maximizes a graph-level metric as the reward. In particular, to assign the reward properly to each agent, CMAT uses a counterfactual baseline that disentangles the agent-specific reward by fixing the predictions of other agents. Extensive validations on the challenging Visual Genome benchmark show that CMAT achieves a state-of-the-art performance by significant gains under various settings and metrics.

show abstract

Knowledge-Embedded Routing Network for Scene Graph Generation

Cited by 369 publications

References 34 publications

Attentive Gated Graph Neural Network for Image Scene Graph Generation

Attentive Gated Graph Neural Network for Image Scene Graph Generation

Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition

Counterfactual Critic Multi-Agent Training for Scene Graph Generation

Contact Info

Product

Resources

About