David R. So scite author profile

We present Meena, a multi-turn open-domain chatbot trained end-to-end on data mined and filtered from public domain social media conversations. This 2.6B parameter neural network is simply trained to minimize perplexity of the next token. We also propose a human evaluation metric called Sensibleness and Specificity Average (SSA), which captures key elements of a human-like multi-turn conversation. Our experiments show strong correlation between perplexity and SSA. The fact that the best perplexity end-to-end trained Meena scores high on SSA (72% on multi-turn evaluation) suggests that a human-level SSA of 86% is potentially within reach if we can better optimize perplexity. Additionally, the full version of Meena (with a filtering mechanism and tuned decoding) scores 79% SSA, 23% higher in absolute SSA than the existing chatbots we evaluated.

show abstract

Carbon Emissions and Large Neural Network Training

Patterson¹,

Le²,

Chen³

et al. 2021

Preprint

156

180

View full text Add to dashboard Cite

Classification of crystallization outcomes using deep convolutional neural networks

et al. 2018

View full text Add to dashboard Cite

The Machine Recognition of Crystallization Outcomes (MARCO) initiative has assembled roughly half a million annotated images of macromolecular crystallization experiments from various sources and setups. Here, state-of-the-art machine learning algorithms are trained and tested on different parts of this data set. We find that more than 94% of the test images can be correctly labeled, irrespective of their experimental origin. Because crystal recognition is key to high-density screening and the systematic analysis of crystallization experiments, this approach opens the door to both industrial and fundamental research applications.

show abstract

The Carbon Footprint of Machine Learning Training Will Plateau, Then Shrink

et al. 2022

View full text Add to dashboard Cite

The Carbon Footprint of Machine Learning Training Will Plateau, Then Shrink

Patterson¹,

Hölzle²,

Le³

et al. 2022

Preprint

View full text Add to dashboard Cite

<div> <div> <div> <p>Machine Learning (ML) workloads have rapidly grown in importance, but raised concerns about their carbon footprint. Four best practices can reduce ML training energy by up to 100x and CO2 emissions up to 1000x. By following best practices, overall ML energy use (across research, development, and production) held steady at <15% of Google’s total energy use for the past three years. If the whole ML field were to adopt best practices, total carbon emissions from training would reduce. Hence, we recommend that ML papers include emissions explicitly to foster competition on more than just model quality. As estimates of emissions in papers that omitted them have been off 100x–100,000x, publishing emissions has the added benefit of ensuring accurate accounting. Given the importance of climate change, we must get the numbers right to make certain that we work on its biggest challenges.<br></p> </div> </div> </div>

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

David R. So

Towards a Human-like Open-Domain Chatbot

Carbon Emissions and Large Neural Network Training

Classification of crystallization outcomes using deep convolutional neural networks

The Carbon Footprint of Machine Learning Training Will Plateau, Then Shrink

The Carbon Footprint of Machine Learning Training Will Plateau, Then Shrink

Contact Info

Product

Resources

About