Ran Wang scite author profile

Current predominant neural machine translation (NMT) models often have a deep structure with large amounts of parameters, making these models hard to train and easily suffering from over-fitting. A common practice is to utilize a validation set to evaluate the training process and select the best checkpoint. Average and ensemble techniques on checkpoints can lead to further performance improvement. However, as these methods do not affect the training process, the system performance is restricted to the checkpoints generated in the original training procedure. In contrast, we propose an online knowledge distillation method. Our method on-the-fly generates a teacher model from checkpoints, guiding the training process to obtain better performance. Experiments on several datasets and language pairs show steady improvement over a strong self-attention-based baseline system. We also provide analysis on data-limited setting against over-fitting. Furthermore, our method leads to an improvement on a machine reading experiment as well.

show abstract

Assessing effects of economic factors on construction cost estimation using deep neural networks

Wang

Asghari

Cheung

et al. 2022

Automation in Construction

View full text Add to dashboard Cite

A novel approach for segmentation of touching characters on the license plate

Wang

Liu

et al. 2013

View full text Add to dashboard Cite

A real-time tennis level evaluation and strokes classification system based on the Internet of Things

Fan

et al. 2022

Internet of Things

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ran Wang

A review of artificial fish swarm algorithms: recent advances and applications

Online Distilling from Checkpoints for Neural Machine Translation

Assessing effects of economic factors on construction cost estimation using deep neural networks

A novel approach for segmentation of touching characters on the license plate

A real-time tennis level evaluation and strokes classification system based on the Internet of Things

Contact Info

Product

Resources

About