2018
DOI: 10.1038/s41598-018-28330-z
|View full text |Cite
|
Sign up to set email alerts
|

Using machine learning tools for protein database biocuration assistance

Abstract: Biocuration in the omics sciences has become paramount, as research in these fields rapidly evolves towards increasingly data-dependent models. As a result, the management of web-accessible publicly-available databases becomes a central task in biological knowledge dissemination. One relevant challenge for biocurators is the unambiguous identification of biological entities. In this study, we illustrate the adequacy of machine learning methods as biocuration assistance tools using a publicly available protein … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
6
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
1
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 54 publications
0
6
0
Order By: Relevance
“…To address these issues, automated and augmented curation systems for extracting protein functional data from scientific literature are becoming increasingly desired. In particular, Machine Learning and Natural Language Processing techniques are beginning to be employed for biocuration efforts 1 , 2 for extracting and organising unstructured biological information into a structured form that is accessible to biologists. Central to these automated systems, is the process of unambiguously extracting semantic relationships between two or more biological entities in the literature 3 .…”
Section: Introductionmentioning
confidence: 99%
“…To address these issues, automated and augmented curation systems for extracting protein functional data from scientific literature are becoming increasingly desired. In particular, Machine Learning and Natural Language Processing techniques are beginning to be employed for biocuration efforts 1 , 2 for extracting and organising unstructured biological information into a structured form that is accessible to biologists. Central to these automated systems, is the process of unambiguously extracting semantic relationships between two or more biological entities in the literature 3 .…”
Section: Introductionmentioning
confidence: 99%
“…To address these issues, automated and augmented curation systems for extracting protein functional data from scientific literature is becoming increasingly desired. In particular, Machine Learning and Natural Language Processing techniques are beginning to be employed for large scale biocuration efforts 1,2 . Biocuration refers to the process of extracting and organising unstructured biological information into a structured form that is accessible to biologists.…”
Section: Introductionmentioning
confidence: 99%
“…2 One reason for this may be the lack of the type of biocuration standards that begin to be common in other life sciences fields such as genomics and, to a lesser extent, proteomics. 3 Further reasons include the fact that MRS data in this area are scarce and fragmented. Fragmentation is both geographical and institutional, as the effort of gathering multi-center and international data is hindered by different barriers.…”
mentioning
confidence: 99%