2020
DOI: 10.1007/978-3-030-66096-3_8
|View full text |Cite
|
Sign up to set email alerts
|

A Dataset and Baselines for Visual Question Answering on Art

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 33 publications
(21 citation statements)
references
References 46 publications
0
21
0
Order By: Relevance
“…NoisyArt [5] is a dataset composed of artwork images collected from Google Images and Flickr containing also metadata (e.g., artwork title, comments, description and creation location) gathered from DBpedia. A recent work presented the AQUA [10] (Art QUestion Answering) dataset which contains question-answer pairs automatically generated using state-ofthe-art question generation methods on the basis of paintings and comments provided by the SemArt [10] dataset. EGO-CH [28] is a dataset of egocentric videos for visitors' behaviour understanding in cultural sites.…”
Section: Datasets In Cultural Sitesmentioning
confidence: 99%
“…NoisyArt [5] is a dataset composed of artwork images collected from Google Images and Flickr containing also metadata (e.g., artwork title, comments, description and creation location) gathered from DBpedia. A recent work presented the AQUA [10] (Art QUestion Answering) dataset which contains question-answer pairs automatically generated using state-ofthe-art question generation methods on the basis of paintings and comments provided by the SemArt [10] dataset. EGO-CH [28] is a dataset of egocentric videos for visitors' behaviour understanding in cultural sites.…”
Section: Datasets In Cultural Sitesmentioning
confidence: 99%
“…In [ 36 ], the authors annotated a subset of the ArtPedia dataset with visual and contextual question–answer pairs and introduced a question classifier that discriminates between visual and contextual questions and a model that is able to answer both types of questions. In [ 37 ], the authors introduce a novel dataset AQUA (Art QUestion Answering), which consists of automatically generated visual and knowledge-based question-answer pairs, and also present a two-branch model where the visual and knowledge questions are handled independently.…”
Section: Related Workmentioning
confidence: 99%
“…They introduced a question classifier that discriminates between visual and contextual questions and a model capable of answering both types of questions. Garcia et al [49] presented a novel dataset AQUA, which consists of automatically generated visual-and knowledge-based QA pairs, and introduced a two-branch model where the visual and knowledge questions are managed independently. Apart from VAQ, a few recent works addressed the task of image captioning where the goal is to automatically generate accurate textual descriptions of images.…”
Section: Multimodal Tasksmentioning
confidence: 99%