Consensus sequence design offers a promising strategy for designing proteins of high stability while retaining biological activity since it draws upon an evolutionary history in which residues important for both stability and function are likely to be conserved. Although there have been several reports of successful consensus design of individual targets, it is unclear from these anecdotal studies how often this approach succeeds and how often it fails. Here, we attempt to assess generality by designing consensus sequences for a set of six protein families with a range of chain lengths, structures, and activities. We characterize the resulting consensus proteins for stability, structure, and biological activities in an unbiased way. We find that all six consensus proteins adopt cooperatively folded structures in solution. Strikingly, four of six of these consensus proteins show increased thermodynamic stability over naturally occurring homologs. Each consensus protein tested for function maintained at least partial biological activity. Although peptide binding affinity by a consensus-designed SH3 is rather low, K m values for consensus enzymes are similar to values from extant homologs. Although consensus enzymes are slower than extant homologs at low temperature, they are faster than some thermophilic enzymes at high temperature. An analysis of sequence properties shows consensus proteins to be enriched in charged residues, and rarified in uncharged polar residues. Sequence differences between consensus and extant homologs are predominantly located at weakly conserved surface residues, highlighting the importance of these residues in the success of the consensus strategy.protein stability | protein design | consensus sequence
Repeat proteins are constructed from a linear array of modular units, giving rise to an overall topology lacking long-range interactions. This suggests that stabilizing repeat modules based on consensus information might be added to a repeat protein domain, allowing it to be extended without altering its overall topology. Here we add consensus modules the ankyrin repeat domain from the Drosophila Notch receptor to investigate the structural tolerance to these modules, the relative thermodynamic stability of these hybrid proteins, and how alterations in the energy landscape influence folding kinetics. Insertions of consensus modules between repeats five and six of the Notch ankyrin domain have little effect on the far-and near-UV CD spectra, indicating that neither secondary nor tertiary structure is dramatically altered. Furthermore, stable structure is maintained at increased denaturant concentrations in the polypeptides containing the consensus repeats, indicating that the consensus modules are capable of stabilizing much of the domain. However, insertion of the consensus repeats appears to disrupt cooperativity, producing a two-stage (three-state) unfolding transition in which the C-terminal repeats unfold at moderate urea concentrations. Removing the C-terminal repeats (Notch ankyrin repeats six and seven) restores equilibrium two-state folding and demonstrates that the high stability of the consensus repeats is propagated into the N-terminal, naturally-occurring Notch ankyrin repeats. This stability increase greatly increases the folding rate, and suggests that the transition state ensemble may be repositioned in the chimeric consensus-stabilized proteins in response to local stability.
The modular nature of repeat proteins has made them a successful target for protein design. Ankyrin repeat, TPR, and leucine rich repeat domains that have been designed solely on consensus information have been shown to have higher thermostability than their biological counterparts. We have previously shown that we can reshape the energy landscape of a repeat protein by adding multiple C-terminal consensus ankyrin repeats to the five N-terminal repeats of the Notch ankyrin domain. Here we explore how the folding mechanism responds to reshaping of the energy landscape. We have used analogous substitutions of a conserved alanine with glycine in each repeat to determine the distribution of structure in the transitions state ensembles of constructs containing one (Nank1-5C 1 ) and two consensus (Nank1-5C 2 ) ankyrin repeats. Whereas folding of the wild-type Notch ankyrin domain is slowed by substitutions in its central repeats, 1 folding of Nank1-5C 1 and Nank1-5C 2 is slowed by substitutions in the C-terminal repeats. Thus, the addition of C-terminal stabilizing repeats shifts the transition state ensemble towards the C-terminal repeats, rerouting the folding pathway of the ankyrin repeat domain. These findings indicate that for the Notch ankyrin domain, folding pathways are selected based on local energetics.
There is considerable interest in generating proteins with both high stability and high activity for biomedical and industrial purposes. One approach that has been used successfully to increase the stability of linear repeat proteins is consensus design. It is unclear the extent over which the consensus design approach can be used to produce folded and hyperstable proteins, and importantly, whether such stabilized proteins would retain function. Here we extend the consensus strategy to design a globular protein. We show that a consensus-designed homeodomain (HD) sequence adopts a cooperatively folded homeodomain structure. The unfolding free energy of the consensus-HD is 5 kcal·mol−1 higher than that of the naturally-occurring engrailed-HD from Drosophila melanogaster. Remarkably, the consensus-HD binds the engrailed-HD cognate DNA in a similar mode as the engrailed-HD with approximately 100-fold higher affinity. 15N relaxation studies show a decrease in psec-nsec backbone dynamics in the free state of consensus-HD, suggesting that increased affinity is not a result of increased plasticity. In addition to demonstrating the potential for consensus design of globular proteins with increased stability, these results demonstrate that greatly stabilized proteins can bind cognate substrates with increased affinities, showing that high stability is compatible with function.
The traditional view of protein-ligand binding treats a protein as comprising distinct binding epitopes on the surface of a degenerate structural scaffold, largely ignoring the impact of a protein’s energy landscape. To determine the robustness of this simplification, we compared two small helix-turn-helix transcription factors with different energy landscapes. λ-repressor is stable and well folded, while MarA appears to be marginally stable with multiple native conformations (molten). While λ-repressor is known to tolerate any hydrophobic mutation in the core, we find MarA drastically less tolerant to core mutation. Moreover, core mutations in MarA (distant from the DNA-binding interface) change the relative affinities of its binding partners, altering ligand specificity. These results can be explained by taking into account the effects of mutations on the entire energy landscape and not just the native state. Thus, for proteins with multiple conformations that are close in energy, such as many intrinsically disordered proteins, residues distant from the active site can alter both binding affinity and specificity.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.