Simpler and effective radiological evaluations for modiolar proximity of a slim modiolar cochlear implant electrode

Lee, Sang Yeon; Han, Jin Hee; Carandang, Marge; Bae, Yun Jung; Choi, Byung Yoon

doi:10.1038/s41598-020-74738-x

Neel Nanda

1Publication

8Citation Statements Received

22Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

Progress measures for grokking via mechanistic interpretability

Nanda¹,

Chan²,

Tom³

et al. 2023

Preprint

View full text Add to dashboard Cite

Neural networks often exhibit emergent behavior, where qualitatively new capabilities arise from scaling up the amount of parameters, training data, or training steps. One approach to understanding emergence is to find continuous progress measures that underlie the seemingly discontinuous qualitative changes. We argue that progress measures can be found via mechanistic interpretability: reverseengineering learned behaviors into their individual components. As a case study, we investigate the recently-discovered phenomenon of "grokking" exhibited by small transformers trained on modular addition tasks. We fully reverse engineer the algorithm learned by these networks, which uses discrete Fourier transforms and trigonometric identities to convert addition to rotation about a circle. We confirm the algorithm by analyzing the activations and weights and by performing ablations in Fourier space. Based on this understanding, we define progress measures that allow us to study the dynamics of training and split training into three continuous phases: memorization, circuit formation, and cleanup. Our results show that grokking, rather than being a sudden shift, arises from the gradual amplification of structured mechanisms encoded in the weights, followed by the later removal of memorizing components.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Neel Nanda

Progress measures for grokking via mechanistic interpretability

Contact Info

Product

Resources

About