2011
DOI: 10.1186/2041-1480-2-s5-s11
|View full text |Cite
|
Sign up to set email alerts
|

Assessment of NER solutions against the first and second CALBC Silver Standard Corpus

Abstract: BackgroundCompetitions in text mining have been used to measure the performance of automatic text processing solutions against a manually annotated gold standard corpus (GSC). The preparation of the GSC is time-consuming and costly and the final corpus consists at the most of a few thousand documents annotated with a limited set of semantic groups. To overcome these shortcomings, the CALBC project partners (PPs) have produced a large-scale annotated biomedical corpus with four different semantic groups through… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
45
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
5
4
1

Relationship

3
7

Authors

Journals

citations
Cited by 48 publications
(45 citation statements)
references
References 9 publications
0
45
0
Order By: Relevance
“…However, dictionary-based approaches tend to miss undefined terms that are not mentioned in the dictionary [12]. The overall results of dictionary-based approaches rely heavily on a predefined dictionary.…”
Section: Introductionmentioning
confidence: 99%
“…However, dictionary-based approaches tend to miss undefined terms that are not mentioned in the dictionary [12]. The overall results of dictionary-based approaches rely heavily on a predefined dictionary.…”
Section: Introductionmentioning
confidence: 99%
“…For example, “4-hydroxybenzoate polyprenyltransferase” (UniProt:COQ2 YEAST) is an enzyme that requires the substrate “4-hydroxybenzoate” (ChEBI:17879). Many more such references can be expected across the Lexeome as has been demonstrated by the CALBC project [5].…”
Section: Introductionmentioning
confidence: 84%
“…Some of the best known are BioCreative (4), BioNLP (5) and CALBC (6). The 2012 BioCreative edition included, in particular, a task aiming at supporting the triage process for the Comparative Toxicogenomics Database.…”
Section: Introductionmentioning
confidence: 99%