Elena Sukmanova scite author profile

Elena Sukmanova

2Publications

1Citation Statement Received

24Citation Statements Given

How they've been cited

How they cite others

Affiliations

National Research University Higher School of Economics

Publications

Order By: Most citations

Guided Layer-Wise Learning for Deep Models Using Side Information

Sulimov

Sukmanova

Chereshnev

et al. 2020

View full text Add to dashboard Cite

Training of deep models for classification tasks is hindered by local minima problems and vanishing gradients, while unsupervised layerwise pretraining does not exploit information from class labels. Here, we propose a new regularization technique, called diversifying regularization (DR), which applies a penalty on hidden units at any layer if they obtain similar features for different types of data. For generative models, DR is defined as divergence over the variational posteriori distributions and included in the maximum likelihood estimation as a prior. Thus, DR includes class label information for greedy pretraining of deep belief networks which result in a better weight initialization for fine-tuning methods. On the other hand, for discriminative training of deep neural networks, DR is defined as a distance over the features and included in the learning objective. With our experimental tests, we show that DR can help the backpropagation to cope with vanishing gradient problems and to provide faster convergence and smaller generalization errors.

show abstract

Guided Layer-wise Learning for Deep Models using Side Information

Sulimov¹,

Sukmanova²,

Chereshnev³

et al. 2019

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Elena Sukmanova

Guided Layer-Wise Learning for Deep Models Using Side Information

Guided Layer-wise Learning for Deep Models using Side Information

Contact Info

Product

Resources

About