Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and Its Applications - JNLPBA '04 2004
DOI: 10.3115/1567594.1567614
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting context for biomedical entity recognition

Abstract: We describe a machine learning system for the recognition of names in biomedical texts. The system makes extensive use of local and syntactic features within the text, as well as external resources including the web and gazetteers. It achieves an Fscore of 70% on the Coling 2004 NLPBA/BioNLP shared task of identifying five biomedical named entities in the GENIA corpus.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
86
0

Year Published

2005
2005
2020
2020

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 102 publications
(86 citation statements)
references
References 7 publications
0
86
0
Order By: Relevance
“…We selected one of the feature sets, F2, as an orthographic feature that was utilized as the best indicator in biomedical NER. There are many studies based on orthographic features [8][9][10][11][12][13][14][15][16][17][18][19][20] and we found the best to be [11]. Another set of features, F1, was extracted as a context-based feature set with lexicons, which provides domain knowledge in a consistent manner.…”
Section: Data Preprocessing and Feature Processing For Biomedical Nermentioning
confidence: 99%
See 1 more Smart Citation
“…We selected one of the feature sets, F2, as an orthographic feature that was utilized as the best indicator in biomedical NER. There are many studies based on orthographic features [8][9][10][11][12][13][14][15][16][17][18][19][20] and we found the best to be [11]. Another set of features, F1, was extracted as a context-based feature set with lexicons, which provides domain knowledge in a consistent manner.…”
Section: Data Preprocessing and Feature Processing For Biomedical Nermentioning
confidence: 99%
“…Since the machine learning approach was adopted, significant progress in biomedical NER has been achieved with methods like the Markov Model [7], the Support Vector Machine (SVM) [8][9][10][11][12][13] the Maximum Entropy Markov Model [14], and Conditional Random Fields (CRF) [15][16][17][18][19].…”
Section: Related Workmentioning
confidence: 99%
“…Head nouns were used in [37] and [38]; word lists that are highly associated to classes are extracted as lexicons in [52]; keyword lexicons are statistically computed in [39]; keyword and boundary lists in [50].…”
Section: Jnlpba'04 Corpus and Current Solutionsmentioning
confidence: 99%
“…In 2004, the JNLPBA [20] open challenge task for bio-NER simplified the 36 entity classes in the GENIA corpus [21] and used only five classes, namely protein, DNA, RNA, cell line, and cell type, to evaluate the performance of the participating systems. Unlike the earliest rule-based NER system [14], the following four types of classification models were applied by the participating teams: Support Vector Machines (SVMs) [16,34], Hidden Markov Models (HMMs) [50], Maximum Entropy Markov Models (MEMMs) [13] and Conditional Random Fields (CRFs) [38]. The most frequently applied models were SVMs.…”
Section: Biological Named Entity Recognitionmentioning
confidence: 99%