Amanpreet Singh scite author profile

Every natural text is written in some style. The style is formed by a complex combination of different stylistic factors, including formality markers, emotions, metaphor, etc. Some factors implicitly reflect the author's personality, while others are explicitly controlled by the author's choices in order to achieve some personal or social goal. One cannot form a complete understanding of a text and its author without considering these factors. The factors combine and co-vary in complex ways to form styles. Studying the nature of the covarying combinations sheds light on stylistic language in general, sometimes called crossstyle language understanding. This paper provides a benchmark corpus (xSLUE) with an online platform (http://xslue.com) for crossstyle language understanding and evaluation. The benchmark contains text in 15 different styles and 23 classification tasks. For each task, we provide the fine-tuned classifier for further analysis. Our analysis shows that some styles are highly dependent on each other (e.g., impoliteness and offense), and some domains (e.g., tweets, political debates) are stylistically more diverse than others (e.g., academic manuscripts). We discuss the technical challenges of cross-style understanding and potential directions for future research: crossstyle modeling which shares the internal representation for low-resource or low-performance styles and other applications such as crossstyle generation.

show abstract

GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding

Wang

Singh

Michael

et al. 2018

Preprint

450

606

View full text Add to dashboard Cite

Neural Network Acceptability Judgments

Warstadt

Singh

Bowman

2019

Transactions of the Association for Computational Linguistics

520

345

View full text Add to dashboard Cite

This paper investigates the ability of artificial neural networks to judge the grammatical acceptability of a sentence. Machine learning research of this kind is well placed to answer important open questions about the role of prior linguistic bias in language acquisition by providing a test for the Poverty of the Stimulus Argument. In service of this goal, we introduce the Corpus of Linguistic Acceptability (CoLA), a set of 10,657 English sentences labeled as grammatical or ungrammatical from published linguistics literature. As baselines, we train several recurrent neural network models for acceptability classification. These models show promise on the task, and error-analysis on specific grammatical phenomena reveals that they learn some systematic generalizations like subject-verbobject word order without any grammatical supervision. However, human-like performance across a wide range of grammatical constructions remains far off.

show abstract

Towards VQA Models That Can Read

et al. 2019

View full text Add to dashboard Cite

Studies have shown that a dominant class of questions asked by visually impaired users on images of their surroundings involves reading text in the image. But today's VQA models can not read! Our paper takes a first step towards addressing this problem. First, we introduce a new "TextVQA" dataset to facilitate progress on this important problem. Existing datasets either have a small proportion of questions about text (e.g., the VQA dataset) or are too small (e.g., the VizWiz dataset). TextVQA contains 45,336 questions on 28,408 images that require reasoning about text to answer. Second, we introduce a novel model architecture that reads text in the image, reasons about it in the context of the image and the question, and predicts an answer which might be a deduction based on the text and the image or is composed of the strings found in the image. Consequently, we call our approach Look, Read, Reason & Answer (LoRRA) 1 . We show that LoRRA outperforms existing state-of-the-art VQA models on our TextVQA dataset. We find that the gap between human performance and machine performance is significantly larger on TextVQA than on VQA 2.0, suggesting that TextVQA is well-suited to benchmark progress along directions complementary to VQA 2.0. VQA ComponentSimilar to many VQA models [7,17], we first embed the question words w 1 , w 2 , . . . , w L of the question q with a pre-trained embedding function (e.g. GloVe [36]) and then encode the resultant word embeddings iteratively with a re-

show abstract

Automatically generating test data from a Boolean specification

Weyuker

Goradia

Singh

1994

IIEEE Trans. Software Eng.

233

220

View full text Add to dashboard Cite

Iterative Answer Prediction With Pointer-Augmented Multimodal Transformers for TextVQA

et al. 2020

View full text Add to dashboard Cite

TextCaps: A Dataset for Image Captioning with Reading Comprehension

et al. 2020

View full text Add to dashboard Cite

Neural Network Acceptability Judgments

Warstadt¹,

Singh²,

Bowman³

2018

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Amanpreet Singh

GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding

GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding

Neural Network Acceptability Judgments

Towards VQA Models That Can Read

Automatically generating test data from a Boolean specification

Iterative Answer Prediction With Pointer-Augmented Multimodal Transformers for TextVQA

TextCaps: A Dataset for Image Captioning with Reading Comprehension

Neural Network Acceptability Judgments

Contact Info

Product

Resources

About