Pegah Alipoormolabashi scite author profile

Pegah Alipoormolabashi

5Publications

48Citation Statements Received

64Citation Statements Given

How they've been cited

How they cite others

Affiliations

Southern California University for Professional Studies, University of Southern California

Publications

Order By: Most citations

Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks

Wang¹,

Mishra²,

Alipoormolabashi³

et al. 2022

Preprint

View full text Add to dashboard Cite

How can we measure the generalization of models to a variety of unseen tasks when provided with their language instructions? To facilitate progress in this goal, we introduce NATURAL-INSTRUCTIONS v2 , a benchmark of 1,600+ diverse language tasks and their expertwritten instructions. It covers 70+ distinct task types, such as tagging, in-filling, and rewriting. These tasks are collected with contributions of NLP practitioners in the community and through an iterative peer review process to ensure their quality. With this large and diverse collection of tasks, we are able to rigorously benchmark cross-task generalization of models-training on a subset of tasks and evaluating on the remaining unseen ones. For instance, we quantify generalization as a function of various scaling parameters, such as the number of observed tasks, the number of instances, and model sizes. Based on these insights, we introduce Tk-INSTRUCT, an encoder-decoder Transformer that is trained to follow a variety of in-context instructions (plain language task definitions or k-shot examples) which outperforms existing larger models on our benchmark. We hope this benchmark facilitates future progress toward more general-purpose language understanding models. 1

show abstract

Understanding Multimodal Procedural Knowledge by Sequencing Multimodal Instructional Manuals

Wu¹,

Spangher²,

Alipoormolabashi³

et al. 2022

View full text Add to dashboard Cite

Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Wang¹,

Mishra²,

Alipoormolabashi³

et al. 2022

View full text Add to dashboard Cite

COM2SENSE: A Commonsense Reasoning Benchmark with Complementary Sentences

Singh

Wen

Hou

et al. 2021

View full text Add to dashboard Cite

Commonsense reasoning is intuitive for humans but has been a long-term challenge for artificial intelligence (AI). Recent advancements in pretrained language models have shown promising results on several commonsense benchmark datasets. However, the reliability and comprehensiveness of these benchmarks towards assessing model's commonsense reasoning ability remains unclear. To this end, we introduce a new commonsense reasoning benchmark dataset comprising natural language true/false statements, with each sample paired with its complementary counterpart, resulting in 4k sentence pairs. We propose a pairwise accuracy metric to reliably measure an agent's ability to perform commonsense reasoning over a given situation. The dataset is crowdsourced and enhanced with an adversarial model-in-the-loop setup to incentivize challenging samples. To facilitate a systematic analysis of commonsense capabilities, we design our dataset along the dimensions of knowledge domains, reasoning scenarios and numeracy. Experimental results demonstrate that our strongest baseline (UnifiedQA-3B), after fine-tuning, achieves~71% standard accuracy and~51% pairwise accuracy, well below human performance (~95% for both metrics).

show abstract

COM2SENSE: A Commonsense Reasoning Benchmark with Complementary Sentences

Singh

Wen

Hou

et al. 2021

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.