2014
DOI: 10.1093/database/bau086
|View full text |Cite
|
Sign up to set email alerts
|

Overview of the gene ontology task at BioCreative IV

Abstract: Gene Ontology (GO) annotation is a common task among model organism databases (MODs) for capturing gene function data from journal articles. It is a time-consuming and labor-intensive task, and is thus often considered as one of the bottlenecks in literature curation. There is a growing need for semiautomated or fully automated GO curation techniques that will help database curators to rapidly and accurately identify gene function information in full-length articles. Despite multiple attempts in the past, few … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
71
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
6
2
2

Relationship

0
10

Authors

Journals

citations
Cited by 56 publications
(74 citation statements)
references
References 48 publications
(44 reference statements)
2
71
0
Order By: Relevance
“…ER methods, typically divide the task into two steps, (1) identify the entities and their location in the context, and (2) assign unique identifiers to the entities [23]. Fortunately, multiple terminological databases, such as Gene Ontology [25], UMBLS [26], BioLexicon [26], and Biothesaurus [26] provide information on biological entities and name variations and can be used to detect biological entities such as genes or proteins [27,28].…”
Section: Overview and Related Workmentioning
confidence: 99%
“…ER methods, typically divide the task into two steps, (1) identify the entities and their location in the context, and (2) assign unique identifiers to the entities [23]. Fortunately, multiple terminological databases, such as Gene Ontology [25], UMBLS [26], BioLexicon [26], and Biothesaurus [26] provide information on biological entities and name variations and can be used to detect biological entities such as genes or proteins [27,28].…”
Section: Overview and Related Workmentioning
confidence: 99%
“…This resource has drawn a remarkable attention from the TM and NLP community. However on close examination, the extraction of GO concepts from text content is still a research subject since it has proven to be challenging (59). …”
Section: Knowledge Encodingmentioning
confidence: 99%
“…The foremost of these difficulties is that challenge tasks are often simplified or abstracted versions of the real-world problems. For example, although biocurators routinely use the full text of an article (56, 57), challenge tasks often only utilize the abstract due to difficulties in accessing full text articles and processing full text. A consequence of this simplification of the real-world problem is that even systems that perform well on challenge tasks yield significantly lower results when evaluated in practical real-world settings.…”
Section: Future Roles Of Researchers Publishers and Curatorsmentioning
confidence: 99%