Michael R. Zhang scite author profile

Michael R. Zhang

3Publications

40Citation Statements Received

42Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

Lookahead Optimizer: k steps forward, 1 step back

Zhang¹,

Lucas²,

Hinton³

et al. 2019

Preprint

View full text Add to dashboard Cite

The vast majority of successful deep neural networks are trained using variants of stochastic gradient descent (SGD) algorithms. Recent attempts to improve SGD can be broadly categorized into two approaches: (1) adaptive learning rate schemes, such as AdaGrad and Adam, and (2) accelerated schemes, such as heavy-ball and Nesterov momentum. In this paper, we propose a new optimization algorithm, Lookahead, that is orthogonal to these previous approaches and iteratively updates two sets of weights. Intuitively, the algorithm chooses a search direction by looking ahead at the sequence of "fast weights" generated by another optimizer. We show that Lookahead improves the learning stability and lowers the variance of its inner optimizer with negligible computation and memory cost. We empirically demonstrate Lookahead can significantly improve the performance of SGD and Adam, even with their default hyperparameter settings on ImageNet, CIFAR-10/100, neural machine translation, and Penn Treebank.Preprint. Under review.

show abstract

Learning Domain Invariant Representations in Goal-conditioned Block MDPs

Han¹,

Zheng²,

Chan³

et al. 2021

Preprint

View full text Add to dashboard Cite

Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve

Bae¹,

Zhang²,

Ruan³

et al. 2022

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Michael R. Zhang

Lookahead Optimizer: k steps forward, 1 step back

Learning Domain Invariant Representations in Goal-conditioned Block MDPs

Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve

Contact Info

Product

Resources

About