2016
DOI: 10.1007/978-1-4939-3743-1_5
|View full text |Cite
|
Sign up to set email alerts
|

Computational Methods for Annotation Transfers from Sequence

Abstract: Surveys of public sequence resources show that experimentally supported functional information is still completely missing for a considerable fraction of known proteins and is clearly incomplete for an even larger portion. Bioinformatics methods have long made use of very diverse data sources alone or in combination to predict protein function, with the understanding that different data types help elucidate complementary biological roles. This chapter focuses on methods accepting amino acid sequences as input … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
33
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 43 publications
(35 citation statements)
references
References 78 publications
(32 reference statements)
0
33
0
Order By: Relevance
“…The prediction of gene function generally proceeds by the transfer of function from genes with experimental evidence to unannotated, or less-annotated, genes that are similar by some measure [42]. While several methods use multiple data types to carry out predictions [31,47,11], many solely rely on evolutionary relationships [16,24,6,10] and are the focus of the current study.…”
Section: Introductionmentioning
confidence: 99%
“…The prediction of gene function generally proceeds by the transfer of function from genes with experimental evidence to unannotated, or less-annotated, genes that are similar by some measure [42]. While several methods use multiple data types to carry out predictions [31,47,11], many solely rely on evolutionary relationships [16,24,6,10] and are the focus of the current study.…”
Section: Introductionmentioning
confidence: 99%
“…4 and 6 [ 8 , 9 ]) and predicted (Chap. 5 [ 10 ]), for all known proteins, noncoding RNA sequences, and cellular components. In other words, carrying out comprehensive functional annotation is what drives the project, not the ontology itself.…”
Section: Database Cross-referencementioning
confidence: 99%
“…Such a dataset should not be used to train the prediction algorithms (refer to Chap. 5 [ 7 ]) and can therefore be used to test them. In the current literature, the validation sets mimic the gold standard dataset, but they are biased:proteins that are prioritized for experimental characterization and curation are often selected for their medical or agricultural relevance, and may not be representative of the full function space that the computational methods address.…”
Section: Challenges Of Assessing Computational Prediction Of Functionmentioning
confidence: 99%
“…Once the pipeline for the computational prediction has been setup-a task which is by no means trivial-it can be relatively straightforward to obtain computational prediction of function across a large number of biological sequences. Chapter 5 [ 7 ] contains a detailed introduction to the methods used in computational annotation.…”
Section: Introductionmentioning
confidence: 99%