Miles Brundage scite author profile

We introduce Codex, a GPT language model finetuned on publicly available code from GitHub, and study its Python code-writing capabilities. A distinct production version of Codex powers GitHub Copilot. On HumanEval, a new evaluation set we release to measure functional correctness for synthesizing programs from docstrings, our model solves 28.8% of the problems, while GPT-3 solves 0% and GPT-J solves 11.4%. Furthermore, we find that repeated sampling from the model is a surprisingly effective strategy for producing working solutions to difficult prompts. Using this method, we solve 70.2% of our problems with 100 samples per problem. Careful investigation of our model reveals its limitations, including difficulty with docstrings describing long chains of operations and with binding operations to variables. Finally, we discuss the potential broader impacts of deploying powerful code generation technologies, covering safety, security, and economics.

show abstract

Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims

Brundage¹,

Avin²,

Wang³

et al. 2020

Preprint

100

View full text Add to dashboard Cite

Limitations and risks of machine ethics

Brundage

2014

Journal of Experimental & Theoretical Artificial Intelligen

View full text Add to dashboard Cite

All the News That’s Fit to Fabricate: AI-Generated Text as a Tool of Media Misinformation

2020

View full text Add to dashboard Cite

Online misinformation has become a constant; only the way actors create and distribute that information is changing. Advances in artificial intelligence (AI) such as GPT-2 mean that actors can now synthetically generate text in ways that mimic the style and substance of human-created news stories. We carried out three original experiments to study whether these AI-generated texts are credible and can influence opinions on foreign policy. The first evaluated human perceptions of AI-generated text relative to an original story. The second investigated the interaction between partisanship and AI-generated news. The third examined the distributions of perceived credibility across different AI model sizes. We find that individuals are largely incapable of distinguishing between AI- and human-generated text; partisanship affects the perceived credibility of the story; and exposure to the text does little to change individuals’ policy views. The findings have important implications in understanding AI in online misinformation campaigns.

show abstract

All the News that’s Fit to Fabricate: AI-Generated Text as a Tool of Media Misinformation

Kreps

McCain²,

Brundage³

2020

SSRN Journal

View full text Add to dashboard Cite

Artificial Intelligence and Responsible Innovation

Brundage

2016

View full text Add to dashboard Cite

Filling gaps in trustworthy development of AI

Avin

Belfield

Brundage

et al. 2021

Science

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Miles Brundage

Deep Reinforcement Learning: A Brief Survey

Evaluating Large Language Models Trained on Code

Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims

Limitations and risks of machine ethics

All the News That’s Fit to Fabricate: AI-Generated Text as a Tool of Media Misinformation

All the News that’s Fit to Fabricate: AI-Generated Text as a Tool of Media Misinformation

Artificial Intelligence and Responsible Innovation

Filling gaps in trustworthy development of AI

Contact Info

Product

Resources

About