Subword-based tagging by conditional random fields for Chinese word segmentation

Zhang, Ruiqiang; Kikui, Genichiro; Sumita, Eiichiro

doi:10.3115/1614049.1614098

Cited by 33 publications

(26 citation statements)

References 2 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…5 https://code.google.com/p/word2vec/ Model PKU MSRA Best05 95.0 96.0 Best05 (Tseng et al, 2005) 95.0 96.4 (Zhang et al, 2006) 95.1 97.1 (Zhang and Clark, 2007) 94.5 97.2 95.2 97.3 (Sun et al, 2012) 95.4 97.4 (Zhang et al, 2013) 96 A very common feature in Chinese word segmentation is the character bigram feature. Formally, at the i-th character of a sentence c [1:n] , the bigram features are c k c k+1 (i − 3 < k < i + 2).…”

Section: Minimal Feature Engineeringmentioning

confidence: 99%

Max-Margin Tensor Neural Network for Chinese Word Segmentation

Pei¹,

Ge²,

Chang³

2014

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

148

164

View full text Add to dashboard Cite

Recently, neural network models for natural language processing tasks have been increasingly focused on for their ability to alleviate the burden of manual feature engineering. In this paper, we propose a novel neural network model for Chinese word segmentation called Max-Margin Tensor Neural Network (MMTNN). By exploiting tag embeddings and tensorbased transformation, MMTNN has the ability to model complicated interactions between tags and context characters. Furthermore, a new tensor factorization approach is proposed to speed up the model and avoid overfitting. Experiments on the benchmark dataset show that our model achieves better performances than previous neural network models and that our model can achieve a competitive performance with minimal feature engineering. Despite Chinese word segmentation being a specific case, MMTNN can be easily generalized and applied to other sequence labeling tasks.

show abstract

Section: Minimal Feature Engineeringmentioning

confidence: 99%

Max-Margin Tensor Neural Network for Chinese Word Segmentation

Pei¹,

Ge²,

Chang³

2014

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

148

164

View full text Add to dashboard Cite

show abstract

“…There are two primary classes of models: character-based, where the foundational units for processing are individual Chinese characters (Xue, 2003;Tseng et al, 2005;Zhang et al, 2006;Wang et al, 2010), and word-based, where the units are full words based on some dictionary or training lexicon (Andrew, 2006;Zhang and Clark, 2007). Sun (2010) details their respective theoretical strengths: character-based approaches better model the internal compositional structure of words and are therefore more effective at inducing new OOV words; word-based approaches are better at reproducing the words of the training lexicon and can capture information from significantly larger contextual spans.…”

Section: Introductionmentioning

confidence: 99%

Two Knives Cut Better Than One: Chinese Word Segmentation with Dual Decomposition

Wang

Voigt²,

Manning

2014

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

View full text Add to dashboard Cite

There are two dominant approaches to Chinese word segmentation: word-based and character-based models, each with respective strengths. Prior work has shown that gains in segmentation performance can be achieved from combining these two types of models; however, past efforts have not provided a practical technique to allow mainstream adoption. We propose a method that effectively combines the strength of both segmentation schemes using an efficient dual-decomposition algorithm for joint inference. Our method is simple and easy to implement. Experiments on SIGHAN 2003 and 2005 evaluation datasets show that our method achieves the best reported results to date on 6 out of 7 datasets.

show abstract

“…Another approach has been that of Cherry and Guo (2015) and Peng and Dredze (2015), who relied on training unsupervised lexical embeddings in place of these upstream systems and achieved state-of-the-art results for English and Chinese social media, respectively. The same approach was also found helpful for NER in the news domain (Collobert and Weston, 2008;Passos et al, 2014) In Asian languages like Chinese, Japanese and Korean, word segmentation is a critical first step for many tasks (Gao et al, 2005;Zhang et al, 2006;Mao et al, 2008). Peng and Dredze (2015) showed the value of word segmentation to Chinese NER in social media by using character positional embeddings, which encoded word segmentation information.…”

Section: Introductionmentioning

confidence: 99%

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

Gurevych

Miyao

2016

View full text Add to dashboard Cite

Subword-based tagging by conditional random fields for Chinese word segmentation

Cited by 33 publications

References 2 publications

Max-Margin Tensor Neural Network for Chinese Word Segmentation

Max-Margin Tensor Neural Network for Chinese Word Segmentation

Two Knives Cut Better Than One: Chinese Word Segmentation with Dual Decomposition

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

Contact Info

Product

Resources

About