Arijit Ray scite author profile

Visual Question Answering (VQA) is the task of answering natural-language questions about images. We introduce the novel problem of determining the relevance of questions to images in VQA. Current VQA models do not reason about whether a question is even related to the given image (e.g., What is the capital of Argentina?) or if it requires information from external resources to answer correctly. This can break the continuity of a dialogue in human-machine interaction. Our approaches for determining relevance are composed of two stages. Given an image and a question, (1) we first determine whether the question is visual or not, (2) if visual, we determine whether the question is relevant to the given image or not. Our approaches, based on LSTM-RNNs, VQA model uncertainty, and caption-question similarity, are able to outperform strong baselines on both relevance tasks. We also present human studies showing that VQA models augmented with such question relevance reasoning are perceived as more intelligent, reasonable, and human-like.

show abstract

Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation

Ray¹,

Sikka²,

Divakaran³

et al. 2019

View full text Add to dashboard Cite

While models for Visual Question Answering (VQA) have steadily improved over the years, interacting with one quickly reveals that these models lack consistency. For instance, if a model answers "red" to "What color is the balloon?", it might answer "no" if asked, "Is the balloon red?". These responses violate simple notions of entailment and raise questions about how effectively VQA models ground language. In this work, we introduce a dataset, ConVQA, and metrics that enable quantitative evaluation of consistency in VQA. For a given observable fact in an image (e.g. the balloon's color), we generate a set of logically consistent question-answer (QA) pairs (e.g. Is the balloon red?) and also collect a humanannotated set of common-sense based consistent QA pairs (e.g. Is the balloon the same color as tomato sauce?). Further, we propose a consistency-improving data augmentation module, a Consistency Teacher Module (CTM). CTM automatically generates entailed (or similar-intent) questions for a source QA pair and fine-tunes the VQA model if the VQA's answer to the entailed question is consistent with the source QA pair. We demonstrate that our CTM-based training improves the consistency of VQA models on the Con-VQA datasets and is a strong baseline for further research.

show abstract

Carbonate geochemistry across the Eocene/Oligocene boundary of Kutch, western India: implications to oceanic O2-poor condition and foraminiferal extinction

et al. 2003

View full text Add to dashboard Cite

Petrology, geochemistry and paleomagnetism of the earliest magmatic rocks of Deccan Volcanic Province, Kutch, Northwest India

Paul

Ray

Das

et al. 2008

Lithos

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Arijit Ray

Deccan plume, lithosphere rifting, and volcanism in Kutch, India

Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions

Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation

Carbonate geochemistry across the Eocene/Oligocene boundary of Kutch, western India: implications to oceanic O2-poor condition and foraminiferal extinction

Petrology, geochemistry and paleomagnetism of the earliest magmatic rocks of Deccan Volcanic Province, Kutch, Northwest India

Contact Info

Product

Resources

About