Protein collapse during folding is often assumed to be driven by a hydrophobic solvation energy (ΔGvdw) that scales linearly with solvent-accessible surface area (A). In a previous study, we argued that ΔGvdw, as well as its attractive (ΔGatt) and repulsive (ΔGrep) components, was not simply a linear function of A. We found that the surface tensions, γrep, γatt, and γvdw, gotten from ΔGrep, ΔGatt, and ΔGvdw against A for four configurations of deca-alanine differed from those obtained for a set of alkanes. In the present study, we extend our analysis to fifty decaglycine structures and atomic decompositions. We find that different configurations of decaglycine generate different estimates of γrep. Additionally, we considered the reconstruction of the solvation free energy from scaling the free energy of solvation of each atom type, free in solution. The free energy of the isolated atoms, scaled by the inverse surface area the atom would expose in the molecule does not reproduce the γrep for the intact decaglycines. Finally, γatt for the decaglycine conformations is much larger in magnitude than those for deca-alanine or the alkanes, leading to large negative values of γvdw (-74 and -56 cal/mol/Å(2) for CHARMM27 and AMBER ff12sb force fields, respectively). These findings imply that ΔGvdw favors extended rather than compact structures for decaglycine. We find that ΔGrep and ΔGvdw have complicated dependencies on multibody correlations between solute atoms, on the geometry of the molecular surface, and on the chemical identities of the atoms.
Molecular simulations can be used to study disordered polypeptide systems and to generate hypotheses on the underlying structural and thermodynamic mechanisms that govern their function. As the number of disordered protein systems investigated with simulations increase, it is important to understand how particular force fields affect the structural properties of disordered polypeptides in solution. To this end, we performed a comparative structural analysis of Gly3 and Gly10 in aqueous solution from all-atom, microsecond MD simulations using the CHARMM 27 (C27), CHARMM 36 (C36), and Amber ff12SB force fields. For each force field, Gly3 and Gly10 were simulated for at least 300 ns and 1 μs, respectively. Simulating oligoglycines of two different lengths allows us to evaluate how force field effects depend on polypeptide length. Using a variety of structural metrics (e.g. end-to-end distance, radius of gyration, dihedral angle distributions), we characterize the distribution of oligoglycine conformers for each force field and show that each sample conformation space differently, yielding considerably different structural tendencies of the same oligoglycine model in solution. Notably, we find that C36 samples more extended oligoglycine structures than both C27 and ff12SB.
Oligoglycine is a backbone mimic for all proteins and is prevalent in the sequences of intrinsically disordered proteins. We have computed the absolute chemical potential of glycine oligomers at infinite dilution by simulation with the CHARMM36 and Amber ff12SB force fields. We performed a thermodynamic decomposition of the solvation free energy (ΔG(sol)) of Gly2-5 into enthalpic (ΔH(sol)) and entropic (ΔS(sol)) components as well as their van der Waals and electrostatic contributions. Gly2-5 was either constrained to a rigid/extended conformation or allowed to be completely flexible during simulations to assess the effects of flexibility on these thermodynamic quantities. For both rigid and flexible oligoglycine models, the decrease in ΔG(sol) with chain length is enthalpically driven with only weak entropic compensation. However, the apparent rates of decrease of ΔG(sol), ΔH(sol), ΔS(sol), and their elec and vdw components differ for the rigid and flexible models. Thus, we find solvation entropy does not drive aggregation for this system and may not explain the collapse of long oligoglycines. Additionally, both force fields yield very similar thermodynamic scaling relationships with respect to chain length despite both force fields generating different conformational ensembles of various oligoglycine chains.
Conformational entropy is expected to contribute significantly to the thermodynamics of structural transitions in intrinsically disordered proteins or regions in response to protein/ligand binding, posttranslational modifications, and environmental changes. We calculated the backbone (dihedral) conformational entropy of oligoglycine (Gly), a protein backbone mimic and model intrinsically disordered region, as a function of chain length (N=3, 4, 5, 10, and 15) from simulations using three different approaches. The backbone conformational entropy scales linearly with chain length with a slope consistent with the entropy of folding of well-structured proteins. The entropic contributions of second-order dihedral correlations are predominantly through intraresidue ϕ-ψ pairs, suggesting that oligoglycine may be thermodynamically modeled as a system of independent glycine residues. We find the backbone conformational entropy to be largely independent of global structural parameters, like the end-to-end distance and radius of gyration. We introduce a framework referred to herein as "ensemble confinement" to estimate the loss (gain) of conformational free energy and its entropic component when individual residues are constrained to (released from) particular regions of the ϕ-ψ map. Quantitatively, we show that our protein backbone model resists ordering/folding with a significant, unfavorable ensemble confinement free energy because of the loss of a substantial portion of the absolute backbone entropy. Proteins can couple this free-energy reservoir to distal binding events as a regulatory mechanism to promote or suppress binding.
To search for evolutionary forces that might act upon transcript length, we use the singular value decomposition (SVD) to identify the length distribution functions of sets and subsets of human and yeast transcripts from profiles of mRNA abundance levels across gel electrophoresis migration distances that were previously measured by DNA microarrays. We show that the SVD identifies the transcript length distribution functions as “asymmetric generalized coherent states” from the DNA microarray data and with no a-priori assumptions. Comparing subsets of human and yeast transcripts of the same gene ontology annotations, we find that in both disparate eukaryotes, transcripts involved in protein synthesis or mitochondrial metabolism are significantly shorter than typical, and in particular, significantly shorter than those involved in glucose metabolism. Comparing the subsets of human transcripts that are overexpressed in glioblastoma multiforme (GBM) or normal brain tissue samples from The Cancer Genome Atlas, we find that GBM maintains normal brain overexpression of significantly short transcripts, enriched in transcripts that are involved in protein synthesis or mitochondrial metabolism, but suppresses normal overexpression of significantly longer transcripts, enriched in transcripts that are involved in glucose metabolism and brain activity. These global relations among transcript length, cellular metabolism and tumor development suggest a previously unrecognized physical mode for tumor and normal cells to differentially regulate metabolism in a transcript length-dependent manner. The identified distribution functions support a previous hypothesis from mathematical modeling of evolutionary forces that act upon transcript length in the manner of the restoring force of the harmonic oscillator.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.