2000
DOI: 10.1093/nar/28.1.49
|View full text |Cite
|
Sign up to set email alerts
|

ProtoMap: automatic classification of protein sequences and hierarchy of protein families

Abstract: The ProtoMap site offers an exhaustive classification of all proteins in the SWISS-PROT database, into groups of related proteins. The classification is based on analysis of all pairwise similarities among protein sequences. The analysis makes essential use of transitivity to identify homologies among proteins. Within each group of the classification, every two members are either directly or transitively related. However, transitivity is applied restrictively in order to prevent unrelated proteins from cluster… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
84
0

Year Published

2001
2001
2012
2012

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 141 publications
(84 citation statements)
references
References 16 publications
0
84
0
Order By: Relevance
“…In a given round, new ORFs that participated in high-scoring pairs with e-values of Ͻ10 Ϫ5 became the query set for the next round, still against the same database. This process continued until no additional proteins with e-values below the specified cutoff were obtained (22,62). This method identified a total of 171 proteins, including the original 103 queries as S. cerevisiae cell wall components or their paralogs.…”
Section: Methodsmentioning
confidence: 99%
“…In a given round, new ORFs that participated in high-scoring pairs with e-values of Ͻ10 Ϫ5 became the query set for the next round, still against the same database. This process continued until no additional proteins with e-values below the specified cutoff were obtained (22,62). This method identified a total of 171 proteins, including the original 103 queries as S. cerevisiae cell wall components or their paralogs.…”
Section: Methodsmentioning
confidence: 99%
“…For each method i and each sequence s in the database, we report the sequence, provided its E value E i (s) is below or equal to a cutoff value E C of 1,000. Then, one method is chosen to be used as a reference method, on the basis of which the E values of the other methods are rescaled (25). In CHASE, we use HMMSEARCH as our reference method.…”
Section: For Each Sequence S To Produce E Values Of Comparable Size mentioning
confidence: 99%
“…[78,88]). This provides a broader road map that includes all three SNARE prototypes as described in (Fig.…”
Section: A Road Map To All Snare Proteinsmentioning
confidence: 99%
“…The helical SNARE domains of syntaxin in both genomes share similar global features. This includes [78]). The number of proteins from each cluster that contribute to an edge is marked next to the cluster.…”
Section: Diversity Of Snares In Human and Plant Genomesmentioning
confidence: 99%
See 1 more Smart Citation