2020
DOI: 10.1016/j.patcog.2020.107563
|View full text |Cite
|
Sign up to set email alerts
|

Cross-modal knowledge reasoning for knowledge-based visual question answering

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
39
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 87 publications
(39 citation statements)
references
References 5 publications
0
39
0
Order By: Relevance
“…Visual symbolic information in MMKG with graph-structured information conveying relations between visual concepts provides strong evidence to reason about the questions over graph network. Besides, the explicit semantic knowledge preserved in MMKG help refine the answers with more interpretability and generality [154]. The representations of different modalities preserved and unified in MMKG greatly benefit for relational reasoning across modalities.…”
Section: Visual Question Answeringmentioning
confidence: 99%
“…Visual symbolic information in MMKG with graph-structured information conveying relations between visual concepts provides strong evidence to reason about the questions over graph network. Besides, the explicit semantic knowledge preserved in MMKG help refine the answers with more interpretability and generality [154]. The representations of different modalities preserved and unified in MMKG greatly benefit for relational reasoning across modalities.…”
Section: Visual Question Answeringmentioning
confidence: 99%
“…Semantic relations features and additional commonsense knowledge answer the complex questions for natural language reasoning. J. Yu et al [111] proposed a framework in which visual contents of an image is extracted and processed in multiple perspectives of knowledge graph like semantic, visual, and factual perspectives.…”
Section: Multimodal External Knowledge Bases Models (Mmekm)mentioning
confidence: 99%
“…Once the particular type of question exceeds the scope of the question templates, the accuracy of the model decreases. Yu et al [29] formulated knowledge-based visual question answering as a recurrent reasoning process for obtaining complementary evidence from multimodal information. Marino et al [30] addressed the task of knowledgebased visual question answering and provided a benchmark where the image features relied on external knowledge resources.…”
Section: Knowledge Basementioning
confidence: 99%