2014
DOI: 10.7554/elife.03430
|View full text |Cite
|
Sign up to set email alerts
|

Sequence co-evolution gives 3D contacts and structures of protein complexes

Abstract: Protein–protein interactions are fundamental to many biological processes. Experimental screens have identified tens of thousands of interactions, and structural biology has provided detailed functional insight for select 3D protein complexes. An alternative rich source of information about protein interactions is the evolutionary sequence record. Building on earlier work, we show that analysis of correlated evolutionary sequence changes across proteins identifies residues that are close in space with sufficie… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

2
236
1

Year Published

2018
2018
2020
2020

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 470 publications
(258 citation statements)
references
References 69 publications
(115 reference statements)
2
236
1
Order By: Relevance
“…An MSA was generated for 9 different bitscores, a sequence inclusion threshold normalized to length and expressed as the number of bits per residue, with values ranging from the most-inclusive 0.1 to the least-inclusive 0.9. The alignment depth was chosen to optimize the number of non-redundant sequences with the fewest gaps in the alignment as described elsewhere 9,10 , although that the alignment choice was robust with respect to consistency of predicted ECs over a wide range of alignment depths. Blindly optimizing the an alignment choice resulted in two alignments with the smaller one at 31,505 non-redundant sequences of length 346 residues (10 – 355) with an “effective number” of 8,729 sequences after down-weighting sequences with more than 80% identity and no more than 30% gaps in any columns used for the EC model computation.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…An MSA was generated for 9 different bitscores, a sequence inclusion threshold normalized to length and expressed as the number of bits per residue, with values ranging from the most-inclusive 0.1 to the least-inclusive 0.9. The alignment depth was chosen to optimize the number of non-redundant sequences with the fewest gaps in the alignment as described elsewhere 9,10 , although that the alignment choice was robust with respect to consistency of predicted ECs over a wide range of alignment depths. Blindly optimizing the an alignment choice resulted in two alignments with the smaller one at 31,505 non-redundant sequences of length 346 residues (10 – 355) with an “effective number” of 8,729 sequences after down-weighting sequences with more than 80% identity and no more than 30% gaps in any columns used for the EC model computation.…”
Section: Methodsmentioning
confidence: 99%
“…EVComplex 10 was used to predict inter-protein contacts between T. thermophilus full-length RodA and full-length PBP2 (Uniprot ID Q5SJ23). We constructed alignments for RodA (33670 sequences) and PBP2 (40764 sequences) as described above for RodA alone, using the April 2017 Uniprot release 42 for clarity on species identifiers.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations