Effective self-training for parsing

McClosky, David; Charniak, Eugene; Johnson, Mark

doi:10.3115/1220835.1220855

Cited by 397 publications

(339 citation statements)

References 16 publications

(24 reference statements)

Supporting

Mentioning

325

Contrasting

Unclassified

Order By: Relevance

“…The difficulty of providing sufficient supervision has motivated work on semi-supervised and unsupervised learning for many of these tasks (McClosky et al, 2006;Spitkovsky et al, 2010;Subramanya et al, 2010;Stratos and Collins, 2015;Marinho et al, 2016;Tran et al, 2016), including several that also used autoencoders (Ammar et al, 2014;Lin et al, 2015;Miao and Blunsom, 2016;Kociský et al, 2016;Cheng et al, 2017). In this paper we expand on these works, and suggest a neural CRF autoencoder, that can leverage both labeled and unlabeled data.…”

Section: Related Workmentioning

confidence: 99%

Semi-supervised Structured Prediction with Neural CRF Autoencoder

Zhang¹,

Jiang²,

Peng³

et al. 2017

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

In this paper we propose an end-toend neural CRF autoencoder (NCRF-AE) model for semi-supervised learning of sequential structured prediction problems. Our NCRF-AE consists of two parts: an encoder which is a CRF model enhanced by deep neural networks, and a decoder which is a generative model trying to reconstruct the input. Our model has a unified structure with different loss functions for labeled and unlabeled data with shared parameters. We developed a variation of the EM algorithm for optimizing both the encoder and the decoder simultaneously by decoupling their parameters. Our experimental results over the Part-of-Speech (POS) tagging task on eight different languages, show that the NCRF-AE model can outperform competitive systems in both supervised and semi-supervised scenarios.

show abstract

Section: Related Workmentioning

confidence: 99%

Semi-supervised Structured Prediction with Neural CRF Autoencoder

Zhang¹,

Jiang²,

Peng³

et al. 2017

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

show abstract

“…Although naively adding self-labeled material to extend training data is normally not successful, there have been successful variants of self-learning for parsing as well. For instance, in [16] self-learning is used to improve a twophase parser reranker, with very good results for the classical Wall Street Journal parsing task.…”

Section: Previous Researchmentioning

confidence: 99%

Self-Trained Bilexical Preferences to Improve Disambiguation Accuracy

Noord

2010

Text, Speech and Language Technology

View full text Add to dashboard Cite

“…McClosky et al (2006a) introduces self-training techniques for two-step parsers. In McClosky et al (2006b), these methods are then used to adapt a parser trained on Wall Street Journal data, without using labeled data from the latter domain.…”

Section: Translation Examplesmentioning

confidence: 99%