2019
DOI: 10.1101/2019.12.23.887307
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Statistical physics of interacting proteins: impact of dataset size and quality assessed in synthetic sequences

Abstract: Identifying protein-protein interactions is crucial for a systems-level understanding of the cell. Recently, algorithms based on inverse statistical physics, e.g. Direct Coupling Analysis (DCA), have allowed to use evolutionarily related sequences to address two conceptually related inference tasks: finding pairs of interacting proteins, and identifying pairs of residues which form contacts between interacting proteins. Here we address two underlying questions: How are the performances of both inference tasks … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
6
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
2
1

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(7 citation statements)
references
References 46 publications
(108 reference statements)
1
6
0
Order By: Relevance
“…Figure 3 shows the performance of contact prediction versus T for equilibrium sequences (no phylogeny), and for sequences generated with μ = 15 and μ = 5. At equilibrium, our model possesses a ferromagnetic–paramagnetic phase transition, whose approximate temperature T C = 4.2 was found by inspecting magnetization histograms [38]. For equilibrium sequences at low T , inference performance without APC is poor (figure 3, left panel), and DCA methods perform substantially better than local methods (which are overlapping).…”
Section: Resultsmentioning
confidence: 99%
“…Figure 3 shows the performance of contact prediction versus T for equilibrium sequences (no phylogeny), and for sequences generated with μ = 15 and μ = 5. At equilibrium, our model possesses a ferromagnetic–paramagnetic phase transition, whose approximate temperature T C = 4.2 was found by inspecting magnetization histograms [38]. For equilibrium sequences at low T , inference performance without APC is poor (figure 3, left panel), and DCA methods perform substantially better than local methods (which are overlapping).…”
Section: Resultsmentioning
confidence: 99%
“…Figure 3 shows the performance of contact prediction versus T for equilibrium sequences (no phylogeny), and for sequences generated with µ = 15 and µ = 5. At equilibrium, our model possesses a ferromagnetic-paramagnetic phase transition, whose approximate temperature T C = 4.2 was found by inspecting magnetisation histograms [31]. For equilibrium sequences at low T , inference performance without APC is poor (Figure 3, left panel), and DCA methods perform substantially better than local methods (which are overlapping).…”
Section: Resultsmentioning
confidence: 99%
“…These trends are consistent with those previously described in Refs. [1, 31]. Inference is difficult either if sequences are all frozen and redundant (very small T ) or if sequences are too noisy (very large T ), yielding better performance for intermediate values of T .…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In literature, the problem of hybrid PPI inference approaches has thus far been presented in two different concepts [21,22]. Some of those approaches explore the interaction between specific protein partners inside a paired MSA [23][24][25][26][27][28][29][30] whereas a handful of others assess the possibility of interaction between protein families [31][32][33].…”
Section: Introductionmentioning
confidence: 99%