2011
DOI: 10.1007/s11390-011-9411-z
|View full text |Cite
|
Sign up to set email alerts
|

Chinese New Word Identification: A Latent Discriminative Model with Global Features

Abstract: Sun X, Huang DG, Song HY et al. Chinese new word identification: a latent discriminative model with global features. AbstractChinese new words are particularly problematic in Chinese natural language processing. With the fast development of Internet and information explosion, it is impossible to get a complete system lexicon for applications in Chinese natural language processing, as new words out of dictionaries are always being created. The procedure of new words identification and POS tagging are usually se… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2012
2012
2020
2020

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 14 publications
0
5
0
Order By: Relevance
“…They transformed this problem into an equivalent sequence tagging problem. Some other researchers [11][12] proposed their methods along this direction. They also used the conditional random filed model, but they improved the feature computation processes to get better results.…”
Section: Combination Of the Rule-based Methods And The Statistical-mentioning
confidence: 99%
See 1 more Smart Citation
“…They transformed this problem into an equivalent sequence tagging problem. Some other researchers [11][12] proposed their methods along this direction. They also used the conditional random filed model, but they improved the feature computation processes to get better results.…”
Section: Combination Of the Rule-based Methods And The Statistical-mentioning
confidence: 99%
“…We regard the new words extraction process as a tagging process like other researchers do in [10][11][12][13], and since the CRF model has archived excellent results in [10][11][12][13], we also choose it as the tagging model.…”
Section: A Overall Frameworkmentioning
confidence: 99%
“…There are also many hybrid methods combined statistical metrics with linguistic knowledge and machine Learning algorithms, such as Part-of-Speech filters (Smadja, 1994;Asanee, 1997), roles tagging based (Zhang et al, 2002), syntactic discriminators (Chen & Ma 2002), max-margin Markov networks (Qiao and Sun, 2010;Li and Chang, 2010), Unsupervised Learning Strategy (Sun et al, 2004), Latent Discriminative Model (Sun et al, 2011), boostingbased ensemble learning (TeCho et al, 2012). But POS filters, roles tagging, machine learning algorithms does not work for Tibetan UWI.…”
Section: Related Workmentioning
confidence: 99%
“…Each domain category contains positive and negative documents. We use our Chinese lexical analysis tools [13], [14] to extracted all the Multiword Expressions of two words that match the three patterns we predefined: a noun (subject) and an adjective (predicate), a verb and a noun, an adverb and an adjective, from "ChnSentiCorp". Then all the Multiword Expressions are manually labeled with corresponding semantic polarity labels(negative, neutral or positive).…”
Section: Experimental Settingsmentioning
confidence: 99%