Francis Song scite author profile

Language modelling provides a step towards intelligent communication systems by harnessing large repositories of written human knowledge to better predict and understand the world. In this paper, we present an analysis of Transformer-based language model performance across a wide range of model scales -from models with tens of millions of parameters up to a 280 billion parameter model called Gopher. These models are evaluated on 152 diverse tasks, achieving state-of-the-art performance across the majority. Gains from scale are largest in areas such as reading comprehension, fact-checking, and the identification of toxic language, but logical and mathematical reasoning see less benefit. We provide a holistic analysis of the training dataset and model's behaviour, covering the intersection of model scale with bias and toxicity. Finally we discuss the application of language models to AI safety and the mitigation of downstream harms.

show abstract

Red Teaming Language Models with Language Models

Perez¹,

Huang²,

Song³

et al. 2022

Preprint

View full text Add to dashboard Cite

A Molecular Variant of Angiotensinogen Is Associated With Idiopathic Intrauterine Growth Restriction

Zhang

Varner

Dizon‐Townson

et al. 2003

Obstetrics & Gynecology

View full text Add to dashboard Cite

Teaching language models to support answers with verified quotes

Menick¹,

Trebacz²,

Mikulik³

et al. 2022

Preprint

View full text Add to dashboard Cite

Recent large language models often answer factual questions correctly. But users can't trust any given claim a model makes without fact-checking, because language models can hallucinate convincing nonsense. In this work we use reinforcement learning from human preferences (RLHP) to train "open-book" QA models that generate answers whilst also citing specific evidence for their claims, which aids in the appraisal of correctness. Supporting evidence is drawn from multiple documents found via a search engine, or from a single user-provided document. Our 280 billion parameter model, GopherCite, is able to produce answers with high quality supporting evidence and abstain from answering when unsure. We measure the performance of GopherCite by conducting human evaluation of answers to questions in a subset of the NaturalQuestions and ELI5 datasets. The model's response is found to be high-quality 80% of the time on this Natural Questions subset, and 67% of the time on the ELI5 subset. Abstaining from the third of questions for which it is most unsure improves performance to 90% and 80% respectively, approaching human baselines. However, analysis on the adversarial TruthfulQA dataset shows why citation is only one part of an overall strategy for safety and trustworthiness: not all claims supported by evidence are true.

show abstract

Alchemy: A benchmark and analysis toolkit for meta-reinforcement learning agents

Wang¹,

King²,

Porcel³

et al. 2021

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Francis Song

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

Red Teaming Language Models with Language Models

A Molecular Variant of Angiotensinogen Is Associated With Idiopathic Intrauterine Growth Restriction

Teaching language models to support answers with verified quotes

Alchemy: A benchmark and analysis toolkit for meta-reinforcement learning agents

Contact Info

Product

Resources

About