Transition-Based Neural Word Segmentation

Zhang, Meishan; Zhang, Yue; Fu, Guohong

doi:10.18653/v1/p16-1040

Cited by 109 publications

(106 citation statements)

References 28 publications

(51 reference statements)

Supporting

Mentioning

100

Contrasting

Order By: Relevance

“…But both of them rely heavily on massive handcrafted features. Contemporary to this work, some neural models (Zhang et al, 2016a;Liu et al, 2016) Another notable exception is (Ma and Hinrichs, 2015), which is also an embedding-based model, but models CWS as configuration-action matching. However, again, this method only uses the context information within limited sized windows.…”

Section: Related Workmentioning

confidence: 99%

Neural Word Segmentation Learning for Chinese

Cai¹,

Zhao²

2016

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

135

129

View full text Add to dashboard Cite

Most previous approaches to Chinese word segmentation formalize this problem as a character-based sequence labeling task so that only contextual information within fixed sized local windows and simple interactions between adjacent tags can be captured. In this paper, we propose a novel neural framework which thoroughly eliminates context windows and can utilize complete segmentation history. Our model employs a gated combination neural network over characters to produce distributed representations of word candidates, which are then given to a long shortterm memory (LSTM) language scoring model. Experiments on the benchmark datasets show that without the help of feature engineering as most existing approaches, our models achieve competitive or better performances with previous stateof-the-art methods.

show abstract

Section: Related Workmentioning

confidence: 99%

Neural Word Segmentation Learning for Chinese

Cai¹,

Zhao²

2016

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

135

129

View full text Add to dashboard Cite

show abstract

“…Recently, deep learning methods have been widely used in many nature language processing tasks, such as name entity recognition (Lample et al, 2016), zero pronoun resolution (Yin et al, 2017) and word segmentation (Zhang et al, 2016). The effectiveness of neural features has also been studied for this framework Watanabe and Sumita, 2015;Andor et al, 2016).…”

Section: Related Workmentioning

confidence: 99%

Transition-Based Disfluency Detection using LSTMs

Wang¹,

Che²,

Zhang³

et al. 2017

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

Self Cite

View full text Add to dashboard Cite

We model the problem of disfluency detection using a transition-based framework, which incrementally constructs and labels the disfluency chunk of input sentences using a set of transition actions without syntax information. Compared with sequence labeling methods, it can capture non-local chunk-level features; compared with joint parsing and disfluency detection methods, it is free for noise in syntax. Experiments show that our model achieves state-of-theart F-score on both the commonly used English Switchboard test set and a set of in-house annotated Chinese data.

show abstract

“…The neural network with its non-linearity is in theory able to learn bigrams by conjoining unigrams, but it has been 4 Our calculation of BTS FLOPs is very conservative and favorable to BTS, as detailed in the supplementary material. shown that explicitly using character bigram features leads to better accuracy (Zhang et al, 2016;Pei et al, 2014). Zhang et al (2016) suggests that embedding manually specified feature conjunctions further improves accuracy ('Zhang et al (2016)-combo' in Table 4).…”

Section: Segmentationmentioning

confidence: 99%

Natural Language Processing with Small Feed-Forward Networks

Botha

Pitler

et al. 2017

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

We show that small and shallow feedforward neural networks can achieve near state-of-the-art results on a range of unstructured and structured language processing tasks while being considerably cheaper in memory and computational requirements than deep recurrent models. Motivated by resource-constrained environments like mobile phones, we showcase simple techniques for obtaining such small neural network models, and investigate different tradeoffs when deciding how to allocate a small memory budget.

show abstract

Transition-Based Neural Word Segmentation

Cited by 109 publications

References 28 publications

Neural Word Segmentation Learning for Chinese

Neural Word Segmentation Learning for Chinese

Transition-Based Disfluency Detection using LSTMs

Natural Language Processing with Small Feed-Forward Networks

Contact Info

Product

Resources

About