2007
DOI: 10.1371/journal.pcbi.0030160
|View full text |Cite
|
Sign up to set email alerts
|

Automated Protein Subfamily Identification and Classification

Abstract: Function prediction by homology is widely used to provide preliminary functional annotations for genes for which experimental evidence of function is unavailable or limited. This approach has been shown to be prone to systematic error, including percolation of annotation errors through sequence databases. Phylogenomic analysis avoids these errors in function prediction but has been difficult to automate for high-throughput application. To address this limitation, we present a computationally efficient pipeline… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
131
0

Year Published

2008
2008
2021
2021

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 109 publications
(133 citation statements)
references
References 73 publications
2
131
0
Order By: Relevance
“…Others apply sensitive profile-profile based comparison strategies with residue conservation scores finely tuned to recognising functional homologues [64][65][66]. One of the challenges in developing methods for large scale application is the lack of any large datasets to optimise thresholds for safe functional inheritance.…”
Section: How Can We Predict Function From Structure?mentioning
confidence: 99%
“…Others apply sensitive profile-profile based comparison strategies with residue conservation scores finely tuned to recognising functional homologues [64][65][66]. One of the challenges in developing methods for large scale application is the lack of any large datasets to optimise thresholds for safe functional inheritance.…”
Section: How Can We Predict Function From Structure?mentioning
confidence: 99%
“…However, without known related sequences or structures their detection is difficult. 89 Innovative novel approaches have been proposed, that is, the use of hydrophobicity distribution on protein structures using the fuzzy oil drop model, 59 the destabilization of limited protein regions, 90 phylogenomic classification of protein sequences, 91 or the classification of known protein catalytic sites. 92 Prediction of protein functional sites is an important step in identifying small-molecule interactions for drug discovery 93 and to optimize the drugs targeting these sites.…”
Section: Methods To Compare Binding Sitesmentioning
confidence: 99%
“…from a set of sequences with different domain compositions) using the SCI-PHY algorithm based on HMMs. 99 Finally, unlike other methods that calculate their similarity matrices based on alignments, CLUSS 100 (http://prospectus.usherbrooke.ca/CLUSS/) performs clustering based on a matching amino acid subsequences, which makes it applicable both to alignable and unalignable sequences, e.g. products of circular permutation etc.…”
Section: Sequence Clusteringmentioning
confidence: 99%
“…As an example, the SCI-PHY server (http://phylogenomics.berkeley.edu/SCI-PHY/) allows users to upload a MSA for subfamily identification and subfamily HMM construction. 99 Further, analysis of the phylogenetic tree in connection with the known (or assumed) tree of hosts (organisms) can be used to deduce major evolutionary events in the protein family, e.g. gene duplications, gene losses, which provide the basis for discrimination between orthologs and paralogs and may guide functional predictions (review: 156 ).…”
Section: Relationship Of Multiple Sequence Alignments To Phylogeneticmentioning
confidence: 99%