The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2016
DOI: 10.48550/arxiv.1612.00837
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

2
108
0

Year Published

2017
2017
2019
2019

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 25 publications
(110 citation statements)
references
References 0 publications
2
108
0
Order By: Relevance
“…The task of visual question answering (VQA) relates visual concepts with elements of language and, occasionally, common-sense or general knowledge. Examples of training questions and their correct answer from the VQA v2 dataset [14].…”
Section: What Is On the Coffee Table ? What Color Is The Hydrant ? Ca...mentioning
confidence: 99%
See 4 more Smart Citations
“…The task of visual question answering (VQA) relates visual concepts with elements of language and, occasionally, common-sense or general knowledge. Examples of training questions and their correct answer from the VQA v2 dataset [14].…”
Section: What Is On the Coffee Table ? What Color Is The Hydrant ? Ca...mentioning
confidence: 99%
“…on the VQA v2 benchmark [14]. Admittedly, a large part of such a search is necessarily guided by empirical exploration and validation.…”
Section: What Is On the Coffee Table ? What Color Is The Hydrant ? Ca...mentioning
confidence: 99%
See 3 more Smart Citations