Proceedings of the 2006 SIAM International Conference on Data Mining 2006
DOI: 10.1137/1.9781611972764.61
|View full text |Cite
|
Sign up to set email alerts
|

A Systematic Cross-Comparison of Sequence Classifiers

Abstract: In the CoNLL 2003 NER shared task, more than two thirds of the submitted systems used a feature-rich representation of the task. Most of them used the maximum entropy principle to combine the features together. Others used large margin linear classifiers, such as SVM and RRM. In this paper, we compare several common classifiers under exactly the same conditions, demonstrating that the ranking of systems in the shared task is due to feature selection and other causes and not due to inherent qualities of the alg… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
9
0

Year Published

2007
2007
2012
2012

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 11 publications
(9 citation statements)
references
References 10 publications
0
9
0
Order By: Relevance
“…CRFs are a statistical sequence modeling framework that is reported to outperform other popular learning models, including MaxEnt (maximum entropy), in a number of natural language processing applications [35]. CRFs modeling is first applied to Chinese word segmentation in [31], treating it as a binary decision task to determine whether a character is the beginning of a word in a sentence.…”
Section: Baseline Systemmentioning
confidence: 99%
“…CRFs are a statistical sequence modeling framework that is reported to outperform other popular learning models, including MaxEnt (maximum entropy), in a number of natural language processing applications [35]. CRFs modeling is first applied to Chinese word segmentation in [31], treating it as a binary decision task to determine whether a character is the beginning of a word in a sentence.…”
Section: Baseline Systemmentioning
confidence: 99%
“…CRFs often outperform maximum entropy Markov model (MEMM) [2], another popular structure learning method. The main reason is that, among directed graphical models, CRFs do not suffer from the label bias problem as much as MEMM and other conditional Markov models do [1].…”
Section: Introductionmentioning
confidence: 99%
“…It often outperforms a maximum entropy (MaxEnt) model [2] , another popular machine learning method in NLP. The main reason is that, among directed graphical models, CRF does not suffer from the label bias problem as much as the MaxEnt and other conditional Markov models do [1] .…”
Section: Introductionmentioning
confidence: 99%
“…(2) L-BFGS [4] is a typical algorithm for CRFs training. To label an unseen sequence, we compute the most likely labeling Y * as in (3) by Viterbi algorithm [5] .…”
Section: Introductionmentioning
confidence: 99%