Kevin Ro Wang scite author profile

Kevin Ro Wang

1Publication

16Citation Statements Received

29Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

Geva¹,

Caciularu²,

Wang³

et al. 2022

Preprint

View full text Add to dashboard Cite

Transformer-based language models (LMs) are at the core of modern NLP, but their internal prediction construction process is opaque and largely not understood. In this work, we make a substantial step towards unveiling this underlying prediction process, by reverseengineering the operation of the feed-forward network (FFN) layers, one of the building blocks of transformer models. We view the token representation as a changing distribution over the vocabulary, and the output from each FFN layer as an additive update to that distribution. Then, we analyze the FFN updates in the vocabulary space, showing that each update can be decomposed to sub-updates corresponding to single FFN parameter vectors, each promoting concepts that are often humaninterpretable. We then leverage these findings for controlling LM predictions, where we reduce the toxicity of GPT2 by almost 50%, and for improving computation efficiency with a simple early exit rule, saving 20% of computation on average. 1 * Equal contribution. † Work done during an internship at AI2. 1 https://github.com/aviclu/ffn-values.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Kevin Ro Wang

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

Contact Info

Product

Resources

About