2014
DOI: 10.1093/database/bau094
|View full text |Cite
|
Sign up to set email alerts
|

Hybrid curation of gene–mutation relations combining automated extraction and crowdsourcing

Abstract: Background: This article describes capture of biological information using a hybrid approach that combines natural language processing to extract biological entities and crowdsourcing with annotators recruited via Amazon Mechanical Turk to judge correctness of candidate biological relations. These techniques were applied to extract gene– mutation relations from biomedical abstracts with the goal of supporting production scale capture of gene–mutation–disease findings as an open source resource for personalized… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
38
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
5
2
1
1

Relationship

2
7

Authors

Journals

citations
Cited by 39 publications
(38 citation statements)
references
References 38 publications
0
38
0
Order By: Relevance
“…National and international Research Infrastructures should play a central role to promote optimal data stewardship practices across the databases they support. Similarly, innovative curation models should emerge by combining the quality and richness of curation workfl ows, more cost-effective crowd-based triage, and the scalability of text mining instruments [ 58 ].…”
Section: Resultsmentioning
confidence: 99%
“…National and international Research Infrastructures should play a central role to promote optimal data stewardship practices across the databases they support. Similarly, innovative curation models should emerge by combining the quality and richness of curation workfl ows, more cost-effective crowd-based triage, and the scalability of text mining instruments [ 58 ].…”
Section: Resultsmentioning
confidence: 99%
“…Crowd Sourced Assessment of Technical Skills (CSATS): A Scalable Assessment Tool for the Nursing Workforce utilize the 'power of the crowd' to perform discrete tasks [8], create content frameworks like Wikipedia [9], produce repositories of public data [10]. Khatib, et al, used crowdsourcing as a method to create an online game in which participants created an accurate model of a protein structure that had previously eluded scientists [11].…”
Section: Research Articlementioning
confidence: 99%
“…Application of crowdsourcing in biology, particularly for data curation, is not new and comprises a variety of areas including: name entity recognition (NER) for drug and diseases (25-27), identification of medically-relevant terms from patient online posts (28), annotation of diseases described in PubMed (29), systematic examination of databases and other resources for drug indications, biomedical ontologies and gene-disease interactions (26,30-33), identification of the relationship between genes and mutations (34), and annotation of medical data for electronic health records (35). Interestingly, in most of the studies, crowdsourcing has proven to be as effective as expert curation (26,36).…”
Section: Introductionmentioning
confidence: 99%