2015
DOI: 10.1186/1471-2105-16-s6-s4
|View full text |Cite
|
Sign up to set email alerts
|

Computational algorithms to predict Gene Ontology annotations

Abstract: BackgroundGene function annotations, which are associations between a gene and a term of a controlled vocabulary describing gene functional features, are of paramount importance in modern biology. Datasets of these annotations, such as the ones provided by the Gene Ontology Consortium, are used to design novel biological experiments and interpret their results. Despite their importance, these sources of information have some known issues. They are incomplete, since biological knowledge is far from being defini… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
28
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 27 publications
(29 citation statements)
references
References 27 publications
0
28
0
Order By: Relevance
“…We also plan to try alternative prediction models, like probabilistic latent semantic analysis (Pinoli, Chicco & Masseroli, 2015). Finally, we plan to extend our computational system by adding a feature selection step, able to state the most relevant features among the dataset.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…We also plan to try alternative prediction models, like probabilistic latent semantic analysis (Pinoli, Chicco & Masseroli, 2015). Finally, we plan to extend our computational system by adding a feature selection step, able to state the most relevant features among the dataset.…”
Section: Resultsmentioning
confidence: 99%
“…We validated the performance of traditional machine learning models such as support vector machines (SVM) with radial basis function kernel (Scholkopf et al, 1997), k-nearest neighbors (Peterson, 2009), and decision trees (Quinlan, 1986). In general, the proposed models surpassed the performance of the classical methodologies in terms of area under the PR curve.…”
Section: Notesmentioning
confidence: 99%
“…We used the annotation prediction and prioritization pipeline described in [20], which includes the prediction methods truncated singular value decomposition (tSVD), semantically improved tSVD with gene clustering (SIM1), and Semantically IMproved tSVD with gene clustering and feature term similarity weights (SIM2), all described in [12] [13]. Figure 1 shows the used computational pipeline and its extension with the novelty indicator proposed in this paper.…”
Section: Prediction Pipeline and Datasetsmentioning
confidence: 99%
“…We tested the use of the GOscore BM measure as a novelty indicator on all the GO annotations predicted for the Homo sapiens genes, based on the gene GO annotations available in the GPDW dataset of July 2009 and using the (tSVD), Semantically IMproved tSVD with gene clustering (SIM1), Semantically IMproved tSVD with gene clustering and feature term similarity weights (SIM2) tSVD, SIM1 and SIM2 methods described in [12]. The GOscore BM measure resulted in concordance with the visual evaluation of the gene annotations performed by an expert.…”
Section: Novelty Indicator Testmentioning
confidence: 99%
“…Genome annotation is a central task in modern biology, and a great deal of effort has been dedicated to the construction of controlled vocabularies to describe cellular organization and function, and to the association of terms in these controlled vocabularies to specific genes. Because annotating genes empirically is costly and time consuming, computational methods have been developed to infer gene annotations from existing annotations and additional data, such as gene sequences, interaction network connectivity, and gene-expression profiles [1][2][3][4][5] . To date, the aim of such methods has been to decide whether a gene should or should not be annotated with a particular term.…”
Section: Introductionmentioning
confidence: 99%