Mutations in the hydrophobic interior of proteins are generally thought to weaken the interactions only in their immediate neighborhood. This forms the basis of protein engineering-based studies of folding mechanism and function. However, mutational work on diverse proteins has shown that distant residues are thermodynamically coupled, with the network of interactions within the protein acting as signal conduits, thus raising an intriguing paradox. Are mutational effects localized, and if not, is there a general rule for the extent of percolation and the functional form of this propagation? We explore these questions from multiple perspectives in this work. Perturbation analysis of interaction networks within proteins and microsecond long molecular dynamics simulations of several aliphatic mutants of ubiquitin reveal strong evidence of the distinct alteration of distal residue-residue communication networks. We find that mutational effects consistently propagate into the second shell of the altered site (even up to 15-20 Å) in proportion to the perturbation magnitude and dissipate exponentially with a decay distance constant of ∼4-5 Å. We also report evidence for this phenomenon from published experimental nuclear magnetic resonance data that strikingly resemble predictions from network theory and molecular dynamics simulations. Reformulating these observations onto a statistical mechanical model, we reproduce the stability changes of 375 mutations from 19 single-domain proteins. Our work thus reveals a robust energy dissipation-cum-signaling mechanism in the interaction network within proteins, quantifies the partitioning of destabilization energetics around the mutation neighborhood, and presents a simple theoretical framework for modeling the allosteric effects of point mutations.
The amplitude of thermodynamic fluctuations in biological macromolecules determines their conformational behavior, dimensions, nature of phase transitions and effectively their specificity and affinity, thus contributing to fine-tuned molecular recognition. Unique among large-scale conformational changes in proteins are temperature-induced collapse transitions in intrinsically disordered proteins (IDPs). Here, we show that CytR DNA-binding domain, an IDP that folds on binding DNA, undergoes a coil-to-globule transition with temperature in the absence of DNA while exhibiting energetically decoupled local and global structural rearrangements, and maximal thermodynamic fluctuations at the optimal bacterial growth temperature. The collapse is shown to be a continuous transition through a combination of statistical-mechanical modeling and all-atom implicit solvent simulations. Surprisingly, CytR binds single-site cognate DNA with negative cooperativity, described by Hill coefficients less than one, resulting in a graded binding response. We show that heterogeneity arising from varying binding-competent CytR conformations or orientations at the single-molecular level contributes to negative binding cooperativity at the level of bulk measurements due to the conflicting requirements of collapse transition, large fluctuations and folding-upon-binding. Our work reports strong evidence for functionally driven thermodynamic fluctuations in determining the extent of collapse and disorder with implications in protein search efficiency of target DNA sites and regulation.
DNA-binding protein domains (DBDs) sample diverse conformations in equilibrium facilitating the search and recognition of specific sites on DNA over millions of energetically degenerate competing sites. We hypothesize that DBDs have co-evolved to sense and exploit the strong electric potential from the array of negatively charged phosphate groups on DNA. We test our hypothesis by employing the intrinsically disordered DBD of cytidine repressor (CytR) as a model system. CytR displays a graded increase in structure, stability and folding rate on increasing the osmolarity of the solution that mimics the non-specific screening by DNA phosphates. Electrostatic calculations and an Ising-like statistical mechanical model predict that CytR exhibits features of an electric potential sensor modulating its dimensions and landscape in a unique distance-dependent manner, while DNA plays the role of a non-specific macromolecular chaperone. Accordingly, CytR binds its natural half-site faster than the diffusion-controlled limit and even random DNA conforming to an electrostatic-steering binding mechanism. Our work unravels for the first time the synergistic features of a natural electrostatic potential sensor, a novel binding mechanism driven by electrostatic frustration and disorder, and the role of DNA in promoting distance-dependent protein structural transitions critical for switching between specific and non-specific DNA-binding modes.
Intrinsically disordered proteins (IDPs) and proteins with a large degree of disorder are abundant in the proteomes of eukaryotes and viruses, and play a vital role in cellular homeostasis and disease. One fundamental question that has been raised on IDPs is the process by which they offset the entropic penalty involved in transitioning from a heterogeneous ensemble of conformations to a much smaller collection of binding-competent states. However, this has been a difficult problem to address, as the effective entropic cost of fixing residues in a folded-like conformation from disordered amino acid neighborhoods is itself not known. Moreover, there are several examples where the sequence complexity of disordered regions is as high as well-folded regions. Disorder in such cases therefore arises from excess conformational entropy determined entirely by correlated sequence effects, an entropic code that is yet to be identified. Here, we explore these issues by exploiting the order-disorder transitions of a helix in Pbx-Homeodomain together with a dual entropy statistical mechanical model to estimate the magnitude and sign of the excess conformational entropy of residues in disordered regions. We find that a mere 2.1-fold increase in the number of allowed conformations per residue (∼0.7kBT favoring the unfolded state) relative to a well-folded sequence, or ∼2(N) additional conformations for a N-residue sequence, is sufficient to promote disorder under physiological conditions. We show that this estimate is quite robust and helps in rationalizing the thermodynamic signatures of disordered regions in important regulatory proteins, modeling the conformational folding-binding landscapes of IDPs, quantifying the stability effects characteristic of disordered protein loops and their subtle roles in determining the partitioning of folding flux in ordered domains. In effect, the dual entropy model we propose provides a statistical thermodynamic basis for the relative conformational propensities of amino acids in folded and disordered environments in proteins. Our work thus lays the foundation for understanding and quantifying protein disorder through measures of excess conformational entropy.
Statistical mechanical models that afford an intermediate resolution between macroscopic chemical models and all-atom simulations have been successful in capturing folding behaviors of many small single-domain proteins. However, the applicability of one such successful approach, the Wako-Saitô-Muñoz-Eaton (WSME) model, is limited by the size of the protein as the number of conformations grows exponentially with protein length. In this work, we surmount this size limitation by introducing a novel approximation that treats stretches of 3 or 4 residues as blocks, thus reducing the phase space by nearly three orders of magnitude. The performance of the ‘bWSME’ model is validated by comparing the predictions for a globular enzyme (RNase H) and a repeat protein (IκBα), against experimental observables and the model without block approximation. Finally, as a proof of concept, we predict the free-energy surface of the 370-residue, multi-domain maltose binding protein and identify an intermediate in good agreement with single-molecule force-spectroscopy measurements. The bWSME model can thus be employed as a quantitative predictive tool to explore the conformational landscapes of large proteins, extract the structural features of putative intermediates, identify parallel folding paths, and thus aid in the interpretation of both ensemble and single-molecule experiments.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.