Sujin Shin scite author profile

Recently, training with adversarial examples, which are generated by adding a small but worst-case perturbation on input examples, has improved the generalization performance of neural networks. In contrast to the biased individual inputs to enhance the generality, this paper introduces adversarial dropout, which is a minimal set of dropouts that maximize the divergence between 1) the training supervision and 2) the outputs from the network with the dropouts. The identified adversarial dropouts are used to automatically reconfigure the neural network in the training process, and we demonstrated that the simultaneous training on the original and the reconfigured network improves the generalization performance of supervised and semi-supervised learning tasks on MNIST, SVHN, and CIFAR-10. We analyzed the trained model to find the performance improvement reasons. We found that adversarial dropout increases the sparsity of neural networks more than the standard dropout. Finally, we also proved that adversarial dropout is a regularization term with a rank-valued hyper-parameter that is different from a continuous-valued parameter to specify the strength of the regularization.

show abstract

Measures that maximize weighted entropy for factor maps between subshifts of finite type

Shin

2001

Ergod. Th. Dynam. Sys.

View full text Add to dashboard Cite

Link to this article: http://journals.cambridge.org/abstract_S0143385701001584How to cite this article: SUJIN SHIN (2001). Measures that maximize weighted entropy for factor maps between subshifts of nite type.Abstract. Let X, Y be topologically mixing subshifts of finite type and π : X → Y a factor map. For each α ≥ 0, the weighted entropy function φ α is defined by φ α (µ) = h(µ) + αh(πµ) for each invariant measure µ on X. To investigate whether for a given α > 0 there is a unique measure which achieves sup µ φ α (µ), we use the concept of compensation functions which was first considered by Boyle and Tuncel and has been developed by Walters. We prove that if there is a certain kind (more general than summable variation) of compensation function, then for each α ≥ 0 the shift-invariant measure which maximizes the weighted entropy is unique. In particular, if the compensation function is locally constant, then the unique measure is Markov and mixing. We classify the 1-block codes from a 3-symbol subshift of finite type to a 2-symbol subshift in terms of what type of compensation function exists or does not exist, providing examples of factor maps which do and do not satisfy the hypothesis. Also we study general properties of compensation functions and the maximal weighted entropy map as a function of the weight.

show abstract

Relative entropy functions for factor maps between subshifts

Shin¹

2005

Trans. Amer. Math. Soc.

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Sujin Shin

Measures of maximal relative entropy

Adversarial Dropout for Supervised and Semi-Supervised Learning

Measures that maximize weighted entropy for factor maps between subshifts of finite type

Relative entropy functions for factor maps between subshifts

Contact Info

Product

Resources

About