Timo Schick scite author profile

Some NLP tasks can be solved in a fully unsupervised fashion by providing a pretrained language model with "task descriptions" in natural language (e.g., Radford et al., 2019). While this approach underperforms its supervised counterpart, we show in this work that the two ideas can be combined: We introduce Pattern-Exploiting Training (PET), a semi-supervised training procedure that reformulates input examples as cloze-style phrases to help language models understand a given task. These phrases are then used to assign soft labels to a large set of unlabeled examples. Finally, standard supervised training is performed on the resulting training set. For several tasks and languages, PET outperforms supervised training and strong semi-supervised approaches in lowresource settings by a large margin. 1

show abstract

It’s Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

Schick¹,

Schütze²

2021

353

292

View full text Add to dashboard Cite

When scaled to hundreds of billions of parameters, pretrained language models such as GPT-3 (Brown et al., 2020) achieve remarkable few-shot performance on challenging natural language understanding benchmarks. In this work, we show that performance similar to GPT-3 can be obtained with language models whose parameter count is several orders of magnitude smaller. This is achieved by converting textual inputs into cloze questions that contain some form of task description, combined with gradient-based optimization; additionally exploiting unlabeled data gives further improvements. Based on our findings, we identify several key factors required for successful natural language understanding with small language models. 1

show abstract

Exploiting Cloze Questions for Few Shot Text Classification and Natural Language Inference

Schick¹,

Schütze²

2020

Preprint

108

View full text Add to dashboard Cite

Automatically Identifying Words That Can Serve as Labels for Few-Shot Text Classification

Schick¹,

Schmid²,

Schütze³

2020

110

View full text Add to dashboard Cite

A recent approach for few-shot text classification is to convert textual inputs to cloze questions that contain some form of task description, process them with a pretrained language model and map the predicted words to labels. Manually defining this mapping between words and labels requires both domain expertise and an understanding of the language model's abilities. To mitigate this issue, we devise an approach that automatically finds such a mapping given small amounts of training data. For a number of tasks, the mapping found by our approach performs almost as well as hand-crafted label-to-word mappings. 1

show abstract

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Scao¹,

Fan²,

Akiki³

et al. 2022

Preprint

114

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Timo Schick

Exploiting Cloze-Questions for Few-Shot Text Classification and Natural Language Inference

It’s Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

Exploiting Cloze Questions for Few Shot Text Classification and Natural Language Inference

Automatically Identifying Words That Can Serve as Labels for Few-Shot Text Classification

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Contact Info

Product

Resources

About