Zaid Alyafeai scite author profile

Large language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is a consequence of implicit multitask learning in language model training . Can zero-shot generalization instead be directly induced by explicit multitask learning? To test this question at scale, we develop a system for easily mapping general natural language tasks into a human-readable prompted form. We convert a large set of supervised datasets, each with multiple prompts using varying natural language. These prompted datasets allow for benchmarking the ability of a model to perform completely unseen tasks specified in natural language. We fine-tune a pretrained encoder-decoder model on this multitask mixture covering a wide variety of tasks. The model attains strong zero-shot performance on several standard datasets, often outperforming models up to 16× its size. Further, our approach attains strong performance on a subset of tasks from the BIG-Bench benchmark, outperforming models up to 6× its size. All prompts and trained models are available at github.com/bigscience-workshop/promptsource/ and huggingface.co/bigscience/T0pp.

show abstract

A fully-automated deep learning pipeline for cervical cancer classification

Alyafeai

Ghouti

2020

Expert Systems with Applications

118

View full text Add to dashboard Cite

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts

Bach¹,

Sanh²,

Yong³

et al. 2022

View full text Add to dashboard Cite

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Scao¹,

Fan²,

Akiki³

et al. 2022

Preprint

View full text Add to dashboard Cite

Crosslingual Generalization through Multitask Finetuning

Muennighoff¹,

Wang²,

Sutawika³

et al. 2023

View full text Add to dashboard Cite

Evaluating Various Tokenizers for Arabic Text Classification

Alyafeai

Al-shaibani

Ghaleb

et al. 2022

Neural Process Lett

View full text Add to dashboard Cite

Meter classification of Arabic poems using deep bidirectional recurrent neural networks

Al-shaibani

Alyafeai

Ahmad

2020

Pattern Recognition Letters

View full text Add to dashboard Cite

ARBML: Democritizing Arabic Natural Language Processing Tools

Alyafeai¹,

Al-shaibani²

2020

View full text Add to dashboard Cite

Automating natural language understanding is a lifelong quest addressed for decades. With the help of advances in machine learning and particularly, deep learning, we are able to produce state of the art models that can imitate human interactions with languages. Unfortunately, these advances are controlled by the availability of language resources. Arabic advances in this field , although it has a great potential, are still limited. This is apparent in both research and development. In this paper, we showcase some NLP models we trained for Arabic. We also present our methodology and pipeline to build such models from data collection, data preprocessing, tokenization and model deployment. These tools help in the advancement of the field and provide a systematic approach for extending NLP tools to many languages.

show abstract

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Zaid Alyafeai

Multitask Prompted Training Enables Zero-Shot Task Generalization

A fully-automated deep learning pipeline for cervical cancer classification

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Crosslingual Generalization through Multitask Finetuning

Evaluating Various Tokenizers for Arabic Text Classification

Meter classification of Arabic poems using deep bidirectional recurrent neural networks

ARBML: Democritizing Arabic Natural Language Processing Tools

Contact Info

Product

Resources

About