The platform will undergo maintenance on Sep 14 at about 9:30 AM EST and will be unavailable for approximately 1 hour.
2018
DOI: 10.1073/pnas.1711913115
|View full text |Cite
|
Sign up to set email alerts
|

Power law tails in phylogenetic systems

Abstract: Covariance analysis of protein sequence alignments uses coevolving pairs of sequence positions to predict features of protein structure and function. However, current methods ignore the phylogenetic relationships between sequences, potentially corrupting the identification of covarying positions. Here, we use random matrix theory to demonstrate the existence of a power law tail that distinguishes the spectrum of covariance caused by phylogeny from that caused by structural interactions. The power law is essent… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

4
110
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 66 publications
(114 citation statements)
references
References 47 publications
(61 reference statements)
4
110
0
Order By: Relevance
“…This is because our aim here is to quantitatively understand the bases of prediction of interaction partners thanks to contacts. Correlations arising in protein sequences due to their common evolutionary history [34][35][36][37][38][39][40] can further contribute to the success of DCA-based approaches at predicting protein-protein interactions from natural protein sequences [41,42], while they obscure the identification of contacts [6,10,36]. Besides, generalizing the present model to Potts spins and including local fields to tune residue conservation would make it more similar to the models inferred by DCA from natural protein sequences.…”
Section: Model and Data Generationmentioning
confidence: 99%
“…This is because our aim here is to quantitatively understand the bases of prediction of interaction partners thanks to contacts. Correlations arising in protein sequences due to their common evolutionary history [34][35][36][37][38][39][40] can further contribute to the success of DCA-based approaches at predicting protein-protein interactions from natural protein sequences [41,42], while they obscure the identification of contacts [6,10,36]. Besides, generalizing the present model to Potts spins and including local fields to tune residue conservation would make it more similar to the models inferred by DCA from natural protein sequences.…”
Section: Model and Data Generationmentioning
confidence: 99%
“…However, distributed patterns may also be related to phylogenetic correlations, which are present in the data, cf. [46]. As has been shown recently in a heuristic way [47], the decomposition of sequence-data covariance matrices or couplings matrices into a sum of a sparse and a low-rank matrix can substantially improve contact prediction, if only the sparse matrix is used.…”
Section: Discussionmentioning
confidence: 94%
“…However, 23 additional correlations arise in protein sequences due to their common evolutionary 24 history, i.e. phylogeny [23][24][25], even in the absence of structural or functional constraints. 25 These historical correlations could also be exploited to predict interacting partners, since 26 a pair of interacting proteins can have a more strongly shared phylogenetic history than 27 non-interacting proteins [26].…”
mentioning
confidence: 99%
“…phylogeny [23][24][25], even in the absence of structural or functional constraints. 25 These historical correlations could also be exploited to predict interacting partners, since 26 a pair of interacting proteins can have a more strongly shared phylogenetic history than 27 non-interacting proteins [26]. For instance, interacting proteins tend to have similar 28 evolutionary rates [27,28], which helps to identify them [26].…”
mentioning
confidence: 99%
See 1 more Smart Citation