Configurable Graph Reasoning for Visual Relationship Detection

Zhu, Yi; Liang, Xiwen; Lin, Bingqian; Ye, Qixiang; Jiao, J. B.; Lin, Liang; Liang, Xiaodan

doi:10.1109/tnnls.2020.3027575

Cited by 9 publications

(3 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In recent years, Zhu et al [12] proposed configurable graph reasoning (CGR) to decompose the reasoning path of visual relationships and the incorporation of external knowledge, achieving configurable knowledge selection and personalizing graph reasoning for each relationship type in each image. In addition, given a common sense knowledge graph, it adaptively configures the reasoning path based on the knowledge graph, bridges the semantic gap between the common sense knowledge and real world scenes, and achieves better knowledge generalization [12]. Hung et al [9] proposed a context-augmented translation embedding model that could capture both common and rare relationships.…”

Section: Related Work a Relationship Detectionmentioning

confidence: 99%

“…VRD contains 5000 images (4000 for training and 1000 for testing) with 100 object categories and 70 predicates. In total, it contains 37,993 relationships with 6,672 relations and 24.25 predicates per object category [12]. Visual Genome (VG) contains a large number of images with content semantic information that is richer than ImageNet, released by Stanford University in 2015.…”

Section: A Datasetsmentioning

confidence: 99%

See 1 more Smart Citation

Optimizing Continuous Prompts for Visual Relationship Detection by Affix-Tuning

Xiao

2022

IEEE Access

View full text Add to dashboard Cite

show abstract

Section: Related Work a Relationship Detectionmentioning

confidence: 99%

Section: A Datasetsmentioning

confidence: 99%

Optimizing Continuous Prompts for Visual Relationship Detection by Affix-Tuning

Xiao

2022

IEEE Access

View full text Add to dashboard Cite

show abstract

“…Visual language navigation can get rid of the dependence of map navigation in the process of mobile robots moving and can guide the movement of mobile robots according to the method of integrating language description and scenes. Visual relationship detection (VRD) [3] is applied to extract the scene features of objects to determine landmarks and select how to execute action. At present, visual language navigation system is based on single language description, which is suitable for short distance navigation.…”

Section: Introductionmentioning

confidence: 99%

Visual Navigation Based on Language Assistance and Memory

Xiao

2023

IEEE Access

View full text Add to dashboard Cite

In order to solve outdoor mobile robots' dependence on geographic information systems, and to realize automatic navigation in the face of complex and changeable scenes, we propose a method that selects landmark and adds prompt guidance so that the mobile robot can navigate relying on visual-language and memory. Visual-language can guide the direction of the mobile robot's movement, obeying the annotation of people and according to its memory of the scene, which refers to the strategy of selecting passed-by landmarks for the route and remembering the scene features. When passing it, the agent can ascertain the position and match it to carry out the action. Experiments showed that our proposed method can achieve the purpose of independent navigation without GIS, and is superior to existing methods.

show abstract

Interpretable modular knowledge reasoning for machine reading comprehension

Ren

Huang

2022

Neural Comput & Applic

View full text Add to dashboard Cite

Configurable Graph Reasoning for Visual Relationship Detection

Cited by 9 publications

References 30 publications

Optimizing Continuous Prompts for Visual Relationship Detection by Affix-Tuning

Optimizing Continuous Prompts for Visual Relationship Detection by Affix-Tuning

Visual Navigation Based on Language Assistance and Memory

Interpretable modular knowledge reasoning for machine reading comprehension

Contact Info

Product

Resources

About