2014
DOI: 10.1073/pnas.1403395111
|View full text |Cite
|
Sign up to set email alerts
|

Global view of the protein universe

Abstract: To explore protein space from a global perspective, we consider 9,710 SCOP (Structural Classification of Proteins) domains with up to 70% sequence identity and present all similarities among them as networks: In the "domain network," nodes represent domains, and edges connect domains that share "motifs," i.e., significantly sized segments of similar sequence and structure. We explore the dependence of the network on the thresholds that define the evolutionary relatedness of the domains. At excessively strict t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

4
76
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 72 publications
(80 citation statements)
references
References 41 publications
(59 reference statements)
4
76
0
Order By: Relevance
“…Specifically, given a protein of solved (or predicted) structure but unknown function, the efficient identification of structurally similar proteins in the Protein Data Bank (PDB) is critical to function prediction. Finding structural neighbors can also give insight into the evolutionary origins of proteins of interest (Yona et al, 1999; Nepomnyachiy et al, 2014). …”
Section: Resultsmentioning
confidence: 99%
“…Specifically, given a protein of solved (or predicted) structure but unknown function, the efficient identification of structurally similar proteins in the Protein Data Bank (PDB) is critical to function prediction. Finding structural neighbors can also give insight into the evolutionary origins of proteins of interest (Yona et al, 1999; Nepomnyachiy et al, 2014). …”
Section: Resultsmentioning
confidence: 99%
“…1,2 A central pillar in this area is the concept of a protein "fold." The space of all folds (known and unknown) can be conceptually organized in at least three distinct ways: (a) using discrete, hierarchical classification schemes, with greater levels of similarity between entities (folds or individual three-dimensional [3D] structures within a given fold class) that occupy lower (more finely detailed) classification levels 6 ; (b) as acyclic graphs, with vertices denoting folds and edges representing structural similarity between two folds 7 ; and (c) as dendrograms, wherein proteins with similar SSEs are neighboring leaves in these taxonomic trees. 4 Here, we follow Orengo and colleagues 5 in considering a fold to be the "global arrangement of the main secondary structural elements (SSEs), in terms of their relative orientations (architecture) and patterns of connectivity (topology)."…”
Section: Introductionmentioning
confidence: 99%
“…Proteins can have local structural similarity without an obvious global sequence or structural relationship, a phenomenon that suggests that protein fold space is continuous, at least in part [24] [25]. Local similarities are often indicative of functional similarity, a fact that is exploited by a number of template-based methods for function prediction.…”
Section: Exploiting Local Structural Similaritymentioning
confidence: 99%