Yoon Kim scite author profile

Yoon Kim

5Publications

10,550Citation Statements Received

51Citation Statements Given

How they've been cited

14,534

10,484

How they cite others

Affiliations

University of Seoul, Seoul National University, Seoul National University Hospital

Publications

Order By: Most citations

Convolutional Neural Networks for Sentence Classification

Kim¹

2014

11,136

7,297

View full text Add to dashboard Cite

We report on a series of experiments with convolutional neural networks (CNN) trained on top of pre-trained word vectors for sentence-level classification tasks. We show that a simple CNN with little hyperparameter tuning and static vectors achieves excellent results on multiple benchmarks. Learning task-specific vectors through fine-tuning offers further gains in performance. We additionally propose a simple modification to the architecture to allow for the use of both task-specific and static vectors. The CNN models discussed herein improve upon the state of the art on 4 out of 7 tasks, which include sentiment analysis and question classification.

show abstract

Convolutional Neural Networks for Sentence Classification

Kim

2014

Preprint

1,042

1,196

View full text Add to dashboard Cite

OpenNMT: Open-Source Toolkit for Neural Machine Translation

et al. 2017

View full text Add to dashboard Cite

We describe an open-source toolkit for neural machine translation (NMT). The toolkit prioritizes efficiency, modularity, and extensibility with the goal of supporting NMT research into model architectures, feature representations, and source modalities, while maintaining competitive performance and reasonable training requirements. The toolkit consists of modeling and translation support, as well as detailed pedagogical documentation about the underlying techniques.

show abstract

Sequence-Level Knowledge Distillation

2016

View full text Add to dashboard Cite

Neural machine translation (NMT) offers a novel alternative formulation of translation that is potentially simpler than statistical approaches. However to reach competitive performance, NMT models need to be exceedingly large. In this paper we consider applying knowledge distillation approaches (Bucila et al., 2006;Hinton et al., 2015) that have proven successful for reducing the size of neural models in other domains to the problem of NMT. We demonstrate that standard knowledge distillation applied to word-level prediction can be effective for NMT, and also introduce two novel sequence-level versions of knowledge distillation that further improve performance, and somewhat surprisingly, seem to eliminate the need for beam search (even when applied on the original teacher model). Our best student model runs 10 times faster than its state-of-the-art teacher with little loss in performance. It is also significantly better than a baseline model trained without knowledge distillation: by 4.2/1.7 BLEU with greedy decoding/beam search. Applying weight pruning on top of knowledge distillation results in a student model that has 13× fewer parameters than the original teacher model, with a decrease of 0.4 BLEU.

show abstract

Temporal Analysis of Language through Neural Language Models

Kim¹,

Chiu²,

Hanaki³

et al. 2014

222

282

View full text Add to dashboard Cite

We provide a method for automatically detecting change in language across time through a chronologically trained neural language model. We train the model on the Google Books Ngram corpus to obtain word vector representations specific to each year, and identify words that have changed significantly from 1900 to 2009. The model identifies words such as cell and gay as having changed during that time period. The model simultaneously identifies the specific years during which such words underwent change.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yoon Kim

Convolutional Neural Networks for Sentence Classification

Convolutional Neural Networks for Sentence Classification

OpenNMT: Open-Source Toolkit for Neural Machine Translation

Sequence-Level Knowledge Distillation

Temporal Analysis of Language through Neural Language Models

Contact Info

Product

Resources

About