2022 IEEE International Conference on Multimedia and Expo (ICME) 2022
DOI: 10.1109/icme52920.2022.9859591
|View full text |Cite
|
Sign up to set email alerts
|

Question-Driven Graph Fusion Network for Visual Question Answering

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…Recently, several models using multimodal techniques for VQA have been proposed. For instance, Qian et al [43] proposed a QD-GFN model that utilizes question information to guide the aggregation of semantic, spatial, and implicit graphs. Their model also incorporates an object filtering mechanism to remove irrelevant objects.…”
Section: Multimodal Interactionmentioning
confidence: 99%
“…Recently, several models using multimodal techniques for VQA have been proposed. For instance, Qian et al [43] proposed a QD-GFN model that utilizes question information to guide the aggregation of semantic, spatial, and implicit graphs. Their model also incorporates an object filtering mechanism to remove irrelevant objects.…”
Section: Multimodal Interactionmentioning
confidence: 99%