2016
DOI: 10.1016/j.ymeth.2015.09.029
|View full text |Cite
|
Sign up to set email alerts
|

Protein function annotation using protein domain family resources

Abstract: As a result of the genome sequencing and structural genomics initiatives, we have a wealth of protein sequence and structural data. However, only about 1% of these proteins have experimental functional annotations. As a result, computational approaches that can predict protein functions are essential in bridging this widening annotation gap. This article reviews the current approaches of protein function prediction using structure and sequence based classification of protein domain family resources with a spec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
27
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
3
3
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 37 publications
(29 citation statements)
references
References 69 publications
2
27
0
Order By: Relevance
“…The large difference between the selected number of genes might be due to the large difference in the number of samples available (and therefore used in analysis) for each species. Functional enrichment analysis of the variable genes revealed pathways related to the heart functions (supplementary table 4), confirming that at least part of the gene expression variability was reflecting biological differences. Hierarchical clustering of the mouse and rat dataset separately revealed clusters of probes and clusters of experiments (supplementary figure 5).…”
Section: Functional Annotation Transfer Across Rat and Mouse Using Hementioning
confidence: 64%
See 1 more Smart Citation
“…The large difference between the selected number of genes might be due to the large difference in the number of samples available (and therefore used in analysis) for each species. Functional enrichment analysis of the variable genes revealed pathways related to the heart functions (supplementary table 4), confirming that at least part of the gene expression variability was reflecting biological differences. Hierarchical clustering of the mouse and rat dataset separately revealed clusters of probes and clusters of experiments (supplementary figure 5).…”
Section: Functional Annotation Transfer Across Rat and Mouse Using Hementioning
confidence: 64%
“…An experimental validation of each gene is impractical to this end as it demands high financial and time cost. It is estimated that only one percent of proteins have experimental functional annotations [4]. Bioinformatic approaches therefore provide an attractive alternative [5].…”
Section: Introductionmentioning
confidence: 99%
“…Current computational approaches for protein family classification include methods such as BLASTp and profile hidden Markov models (pHMMs) which compare sequences to a large database of pre-annotated sequences. However, inference using these alignment-based methods is computationally inefficient, as they require repeated comparison of sequences to an exponentially growing database of labeled family profiles and are limited by expensive, manually-tuned processing pipelines [27]. With the exponential growth of protein discovery, the development of more scalable approaches is required to overcome traditional bottlenecks [28].…”
Section: Task: Protein Family Classificationmentioning
confidence: 99%
“…Proteins are often composed of discrete domains, and these can either be conceptualized as sub-sequences of independent protein sequences which share homology (and by extension evolutionary origin), 6 or alternatively, domains may be considered structurally, where they are subsections of the proteins which are compact, independently folding and observed to be shared between a variety of proteins. [7][8][9] An extension of this observation, that proteins can be decomposed into sets of domains, is the hypothesis that domains act as sub-functional units and when composed together, a protein's given combination of domains is what gives rise to the protein's overall specific function 10,11 In the following study, we show that protein domains can be embedded in a "semantically" meaningful vector space and that this embedding space reflects meaningful information about the functional roles (in terms of GO term assignments) of the individual protein domains.…”
Section: Introductionmentioning
confidence: 99%