2015
DOI: 10.1073/pnas.1508380112
|View full text |Cite
|
Sign up to set email alerts
|

Unexpected features of the dark proteome

Abstract: We surveyed the "dark" proteome-that is, regions of proteins never observed by experimental structure determination and inaccessible to homology modeling. For 546,000 Swiss-Prot proteins, we found that 44-54% of the proteome in eukaryotes and viruses was dark, compared with only ∼14% in archaea and bacteria. Surprisingly, most of the dark proteome could not be accounted for by conventional explanations, such as intrinsic disorder or transmembrane regions. Nearly half of the dark proteome comprised dark protein… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

10
214
2

Year Published

2016
2016
2018
2018

Publication Types

Select...
9

Relationship

1
8

Authors

Journals

citations
Cited by 176 publications
(226 citation statements)
references
References 50 publications
10
214
2
Order By: Relevance
“…15 Traditional ordered proteins have a relatively stable 3-D structure possess Ramachandran angles that vary only slightly around their equilibrium positions with occasional cooperative conformational switches. On the other hand, IDPs/ IDPRs, despite being biologically active, fail to form specific 3D structures and exist as highly dynamic structural ensembles, either at the secondary or at the tertiary level.…”
Section: Introductionmentioning
confidence: 99%
“…15 Traditional ordered proteins have a relatively stable 3-D structure possess Ramachandran angles that vary only slightly around their equilibrium positions with occasional cooperative conformational switches. On the other hand, IDPs/ IDPRs, despite being biologically active, fail to form specific 3D structures and exist as highly dynamic structural ensembles, either at the secondary or at the tertiary level.…”
Section: Introductionmentioning
confidence: 99%
“…On average, each protein sequence matches to ∼200 PDB structures (63) but contains several dark regions, with no detectable similarity to any known 3D structure (66). (e) A stacked bar chart showing the total fraction of protein residues that map to any PDB structure (either directly or via homology modelling); the remaining fraction (dark proteome) is divided into dark regions (d ) and dark proteins (where a single dark region spans an entire sequence) (66). Stacked bar charts have been used to achieve moderate data density, and axes replaced by two shaded regions (indicating 25%, 50%, and 75%).…”
Section: Protein Structuresmentioning
confidence: 99%
“…Interestingly, much of the remaining dark proteome currently cannot be explained (e.g., Figure 6f ). Exploring this dark protein structure universe is an important data science challenge (65) in which visualization is playing a key role (66). High-throughput approaches are also being applied to molecular dynamics (67), generating increasingly large, complex trajectories; these data can give insights into key events (e.g., binding with ligands or other proteins).…”
Section: Protein Structuresmentioning
confidence: 99%
“…These sequences include so-called ORFans [59], domains of unknown function (DUFs) [60,61], and protein 'dark matter' [62,63 ]. Many apparent ORFan proteins are likely highly divergent homologs of known structural families [60,64].…”
Section: Protein Dark Mattermentioning
confidence: 99%