2012
DOI: 10.1371/journal.pone.0037645
|View full text |Cite
|
Sign up to set email alerts
|

Protein Sequence Alignment Analysis by Local Covariation: Coevolution Statistics Detect Benchmark Alignment Errors

Abstract: The use of sequence alignments to understand protein families is ubiquitous in molecular biology. High quality alignments are difficult to build and protein alignment remains one of the largest open problems in computational biology. Misalignments can lead to inferential errors about protein structure, folding, function, phylogeny, and residue importance. Identifying alignment errors is difficult because alignments are built and validated on the same primary criteria: sequence conservation. Local covariation i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
20
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 18 publications
(20 citation statements)
references
References 37 publications
0
20
0
Order By: Relevance
“…The evolutionary distribution of these residues was examined with an alignment of 178 singlechain LHE protein sequences generated by using Cn3D (22) to develop a structure-based guide alignment followed by LoCo (19) to identify and correct systematically misaligned segments. The resulting alignment was hand-curated to remove all sequences lacking acidic residues at the catalytic positions (19) (Fig. S1).…”
Section: Identification Of a Coevolving Network Of Catalytic And Noncmentioning
confidence: 99%
See 2 more Smart Citations
“…The evolutionary distribution of these residues was examined with an alignment of 178 singlechain LHE protein sequences generated by using Cn3D (22) to develop a structure-based guide alignment followed by LoCo (19) to identify and correct systematically misaligned segments. The resulting alignment was hand-curated to remove all sequences lacking acidic residues at the catalytic positions (19) (Fig. S1).…”
Section: Identification Of a Coevolving Network Of Catalytic And Noncmentioning
confidence: 99%
“…Moreover, the monomeric and dimeric LHEs likely evolved under different functional constraints, and the phylogenetic signal and functional information for either form is diluted in alignments that include both the monomeric and dimeric LHEs. Because LHEs are currently under investigation for use as genome-editing agents (16-18), a greater understanding of their functional constraints would aid in engineering studies.Here, we take advantage of high-quality structure-guided multiple sequence alignments of single-chain LHEs to predict coevolving networks using methods based on mutual information (5,19,20). Strikingly, the network with the strongest predictive scores included the metal-binding catalytic residues and adjacent noncatalytic residues that lie on opposite LAGLIDADG α-helices.…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…A number of studies have highlighted the impact of the choice of alignment on subsequent phylogenetic inference [ 23 - 31 ]; in many cases different alignment methods, or different guide trees, can give rise to very different phylogenies [ 23 , 32 - 36 ]. Sensitivity to the alignment is also observed in the context of many other types of downstream analysis, including homology modelling of protein structures [ 37 - 39 ], detection of correlated evolution [ 40 , 41 ], prediction of RNA secondary structure [ 42 ], and inference of positive selection [ 36 , 43 - 45 ].…”
Section: Introductionmentioning
confidence: 99%
“…Many groups have observed high co-evolution between residues close in primary structure. Two or more residues can only be identified as co-evolving if they have minimum distance apart in primary structure [37,38]. Set of sequential residues in one protein chain that are close in primary structure is called subsequence.…”
Section: Introductionmentioning
confidence: 99%