Direct Nonlinear Acceleration

Dutta, Aritra; Bergou, El Houcine; Xiao, Yunming; Canini, Marco; Richtárik, Peter

doi:10.48550/arxiv.1905.11692

Search citation statements

Order By: Relevance

Paper Sections

Select...

Discussion1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2019

Publication Types

Select...

Other1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

(1 citation statement)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Exploring the possibility of developing a new way to obtain a better guess of the gradient would be an interesting direction. One possibility is by considering a very recent work of [11] which proposes a new extrapolation algorithm.…”

Section: Discussionmentioning

confidence: 99%

An Optimistic Acceleration of AMSGrad for Nonconvex Optimization

Wang,

Li,

Karimi

et al. 2019

Preprint

View full text Add to dashboard Cite

This paper 1 considers a new variant of AMSGrad called Optimistic-AMSGrad. AMSGrad [31] is a popular adaptive gradient based optimization algorithm that is widely used in training deep neural networks. The new variant assumes that minibatch gradients in consecutive iterations have some underlying structure, which makes the gradients sequentially predictable. By exploiting the predictability and some ideas from Optimistic Online learning, the proposed algorithm can accelerate the convergence and also enjoys a tighter regret bound. We evaluate Optimistic-AMSGrad and AMSGrad in terms of various performance measures (i.e., training loss, testing loss, and classification accuracy on training/testing data), which demonstrate that Optimistic-AMSGrad improves AMSGrad. We release the code for reproducing the experiments on a github repository https://github.com/jimwang123/optimistic-amsgrad. 2

show abstract

Section: Discussionmentioning

confidence: 99%