2011
DOI: 10.1093/bioinformatics/btr452
|View full text |Cite
|
Sign up to set email alerts
|

OrganismTagger: detection, normalization and grounding of organism entities in biomedical documents

Abstract: witte@semanticsoftware.info.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
35
0

Year Published

2012
2012
2020
2020

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 40 publications
(36 citation statements)
references
References 14 publications
1
35
0
Order By: Relevance
“…The OrganismTagger performance has previously been evaluated on two corpora, where it showed a precision of 95%-99%, a recall of 94%-97%, and a grounding accuracy of 97.4%-97.5% [22]. Since its results here are lower, we examined the error cases in more detail.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…The OrganismTagger performance has previously been evaluated on two corpora, where it showed a precision of 95%-99%, a recall of 94%-97%, and a grounding accuracy of 97.4%-97.5% [22]. Since its results here are lower, we examined the error cases in more detail.…”
Section: Discussionmentioning
confidence: 99%
“…The OrganismTagger is a hybrid rule-based/machine-learning system that extracts organism mentions from the biomedical literature, normalizes them to their scientific name, and provides grounding to the NCBI Taxonomy database [22]. …”
Section: Introductionmentioning
confidence: 99%
“…Typically, TM system that are based on algorithms such as hidden Markov models (HMM) [57,58], maximum entropy Markov models (MEMM) [59], conditional random fields (CRF) [60,61], and support vector machines (SVM) [62,63], need to be trained on a carefully constructed annotated training data set that is representative for the real life data set before the actual NER task. Machine learning based TM systems are used for instance to identify chemical entities in text [64], or are used in combination with rule-based and lexical methods to identify organism names in text [65] or used for extraction of cancer staging information from health records to improve clinical decision making [66].…”
Section: Named Entity Recognitionmentioning
confidence: 99%
“…On the one hand are tools for the fundamental task of recognising names within text, such as the dictionary-based TaxonFinder [12], Linnaneus [13] and OrganismTagger [14]; the machine learning-based NetiNeti [15]; and hybrid systems such as TaxonGrab [16] and SPECIES [17]. On the other hand are name-linking tools which are aimed at resolving taxonomic names mentioned in text to their preferred names, accomplished at three various levels: (1) orthographic or spelling, i.e, linking misspelt names to the correct ones; (2) nomenclature, i.e., taking into account scientific names with or without author, date and annotations; and (3) semantic, i.e., taking into account both scientific names and vernacular names.…”
Section: Introductionmentioning
confidence: 99%