Proceedings of the ACL-02 Workshop on Natural Language Processing in the Biomedical Domain - 2002
DOI: 10.3115/1118149.1118150
|View full text |Cite
|
Sign up to set email alerts
|

Tuning support vector machines for biomedical named entity recognition

Abstract: We explore the use of Support Vector Machines (SVMs) for biomedical named entity recognition. To make the SVM training with the available largest corpus -the GENIA corpus -tractable, we propose to split the non-entity class into sub-classes, using part-of-speech information. In addition, we explore new features such as word cache and the states of an HMM trained by unsupervised learning. Experiments on the GENIA corpus show that our class splitting technique not only enables the training with the GENIA corpus … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
154
1
1

Year Published

2005
2005
2020
2020

Publication Types

Select...
5
2
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 183 publications
(156 citation statements)
references
References 15 publications
(13 reference statements)
0
154
1
1
Order By: Relevance
“…In this regards, few researchers have proposed the utilization of named entity in their own particular way. The authors [13], [17] has gathered most common prefix and suffix from training data. Whilst [12], [18] the author gathered 23 categories of prefix and suffix data using statistical methods as their own distribution.…”
Section: ) Affixesmentioning
confidence: 99%
“…In this regards, few researchers have proposed the utilization of named entity in their own particular way. The authors [13], [17] has gathered most common prefix and suffix from training data. Whilst [12], [18] the author gathered 23 categories of prefix and suffix data using statistical methods as their own distribution.…”
Section: ) Affixesmentioning
confidence: 99%
“…Kazama et al [25] used SVMs to identify proteins, DNA, cell types, cell lines and lipids, with a F-measure of 73.6%. Tsai et al [26] constructed a NER framework with CRF.…”
Section: The Machine-learning Ner Approachmentioning
confidence: 99%
“…The second study contrasts two models in the combined identification and classification task (Kazama et al, 2002). Word frequency, part-of-speech tags, inflectional morphology and lexical inclusion are used as input to a SVM and Maximum Entropy (ME) model.…”
Section: Related Workmentioning
confidence: 99%