A major bottleneck in protein structure prediction is the selection of correct models from a pool of decoys. Relative activities of ∼1,200 individual single-site mutants in a saturation library of the bacterial toxin CcdB were estimated by determining their relative populations using deep sequencing. This phenotypic information was used to define an empirical score for each residue (RankScore), which correlated with the residue depth, and identify active-site residues. Using these correlations, ∼98% of correct models of CcdB (RMSD ≤ 4Å) were identified from a large set of decoys. The model-discrimination methodology was further validated on eleven different monomeric proteins using simulated RankScore values. The methodology is also a rapid, accurate way to obtain relative activities of each mutant in a large pool and derive sequence-structure-function relationships without protein isolation or characterization. It can be applied to any system in which mutational effects can be monitored by a phenotypic readout.
Considerable mechanistic insight has been gained into amyloid aggregation; however, a large number of non-amyloid protein aggregates are considered "amorphous," and in most cases, little is known about their mechanisms. Amorphous aggregation of ␥-crystallins in the eye lens causes cataract, a widespread disease of aging. We combined simulations and experiments to study the mechanism of aggregation of two ␥D-crystallin mutants, W42R and W42Q: the former a congenital cataract mutation, and the latter a mimic of age-related oxidative damage. We found that formation of an internal disulfide was necessary and sufficient for aggregation under physiological conditions. Two-chain all-atom simulations predicted that one non-native disulfide in particular, between Cys 32 and Cys 41 , was likely to stabilize an unfolding intermediate prone to intermolecular interactions. Mass spectrometry and mutagenesis experiments confirmed the presence of this bond in the aggregates and its necessity for oxidative aggregation under physiological conditions in vitro. Mining the simulation data linked formation of this disulfide to extrusion of the N-terminal -hairpin and rearrangement of the native -sheet topology. Specific binding between the extruded hairpin and a distal -sheet, in an intermolecular chain reaction similar to domain swapping, is the most probable mechanism of aggregate propagation.
Considerable mechanistic insight has been gained into amyloid aggregation; however, a large class of non-amyloid protein aggregates are considered "amorphous," and in most cases little is known about their mechanisms. Amorphous aggregation of γ-crystallins in the eye lens causes a widespread disease of aging, cataract. We combined simulations and experiments to study the mechanism of aggregation of two γD-crystallin mutants, W42R and W42Q -the former a congenital cataract mutation, and the latter a mimic of age-related oxidative damage. We found that formation of an internal disulfide was necessary and sufficient for aggregation under physiological conditions. Twochain all-atom simulations predicted that one nonnative disulfide in particular, between Cys32 and Cys41, was likely to stabilize an unfolding intermediate prone to intermolecular interactions. Mass spectrometry and mutagenesis experiments confirmed the presence of this bond in the aggregates and its necessity for oxidative aggregation under physiological conditions in vitro.Mining the simulation data linked formation of this disulfide to extrusion of the N-terminal β-hairpin and rearrangement of the native β-sheet topology. Specific binding between the extruded hairpin and a distal β-sheet, in an intermolecular chain reaction similar to domain swapping, is the most probable mechanism of aggregate propagation. γD-crystallin aggregation precursor
Despite considerable efforts, no physical mechanism has been shown to explain N-terminal codon bias in prokaryotic genomes. Using a systematic study of synonymous substitutions in two endogenous E. coli genes, we show that interactions between the coding region and the upstream Shine-Dalgarno (SD) sequence modulate the efficiency of translation initiation, affecting both intracellular mRNA and protein levels due to the inherent coupling of transcription and translation in E. coli. We further demonstrate that far-downstream mutations can also modulate mRNA levels by occluding the SD sequence through the formation of non-equilibrium secondary structures. By contrast, a non-endogenous RNA polymerase that decouples transcription and translation largely alleviates the effects of synonymous substitutions on mRNA levels. Finally, a complementary statistical analysis of the E. coli genome specifically implicates avoidance of intra-molecular base pairing with the SD sequence. Our results provide general physical insights into the coding-level features that optimize protein expression in prokaryotes.
The catalytic conversion of adenosine triphosphate (ATP) and adenosine monophosphate (AMP) to adenosine diphosphate (ADP) by adenylate kinase (ADK) involves large amplitude, ligand induced domain motions, involving the opening and the closing of ATP binding domain (LID) and AMP binding domain (NMP) domains, during the repeated catalytic cycle. We discover and analyze an interesting dynamical coupling between the motion of the two domains during the opening, using large scale atomistic molecular dynamics trajectory analysis, covariance analysis, and multidimensional free energy calculations with explicit water. Initially, the LID domain must open by a certain amount before the NMP domain can begin to open. Dynamical correlation map shows interesting cross-peak between LID and NMP domain which suggests the presence of correlated motion between them. This is also reflected in our calculated two-dimensional free energy surface contour diagram which has an interesting elliptic shape, revealing a strong correlation between the opening of the LID domain and that of the NMP domain. Our free energy surface of the LID domain motion is rugged due to interaction with water and the signature of ruggedness is evident in the observed root mean square deviation variation and its fluctuation time correlation functions. We develop a correlated dynamical disorder-type theoretical model to explain the observed dynamic coupling between the motion of the two domains in ADK. Our model correctly reproduces several features of the cross-correlation observed in simulations.
Mutations in a microbial population can increase the frequency of a genotype not only by increasing its exponential growth rate, but also by decreasing its lag time or adjusting the yield (resource efficiency). The contribution of multiple life-history traits to selection is a critical question for evolutionary biology as we seek to predict the evolutionary fates of mutations. Here we use a model of microbial growth to show there are two distinct components of selection corresponding to the growth and lag phases, while the yield modulates their relative importance. The model predicts rich population dynamics when there are tradeoffs between phases: multiple strains can coexist or exhibit bistability due to frequency-dependent selection, and strains can engage in rock-paperscissors interactions due to non-transitive selection. We characterize the environmental conditions and patterns of traits necessary to realize these phenomena, which we show to be readily accessible to experiments. Our results provide a theoretical framework for analyzing high-throughput measurements of microbial growth traits, especially interpreting the pleiotropy and correlations between traits across mutants. This work also highlights the need for more comprehensive measurements of selection in simple microbial systems, where the concept of an ordinary fitness landscape breaks down.
Mutations provide the variation that drives evolution, yet their effects on fitness remain poorly understood. Here we explore how mutations in the essential enzyme Adenylate Kinase (Adk) of E. coli affect multiple phases of population growth. We introduce a biophysical fitness landscape for these phases, showing how they depend on molecular and cellular properties of Adk. We find that Adk catalytic capacity in the cell (product of activity and abundance) is the major determinant of mutational fitness effects. We show that bacterial lag times are at a well-defined optimum with respect to Adk’s catalytic capacity, while exponential growth rates are only weakly affected by variation in Adk. Direct pairwise competitions between strains show how environmental conditions modulate the outcome of a competition where growth rates and lag times have a tradeoff, altogether shedding light on the multidimensional nature of fitness and its importance in the evolutionary optimization of enzymes.
The signal peptide plays a key role in targeting and membrane insertion of secretory and membrane proteins in both prokaryotes and eukaryotes. In E. coli, recombinant proteins can be targeted to the periplasmic space by fusing naturally occurring signal sequences to their N-terminus. The model protein thioredoxin was fused at its N-terminus with malE and pelB signal sequences. While WT and the pelB fusion are soluble when expressed, the malE fusion was targeted to inclusion bodies and was refolded in vitro to yield a monomeric product with identical secondary structure to WT thioredoxin. The purified recombinant proteins were studied with respect to their thermodynamic stability, aggregation propensity and activity, and compared with wild type thioredoxin, without a signal sequence. The presence of signal sequences leads to thermodynamic destabilization, reduces the activity and increases the aggregation propensity, with malE having much larger effects than pelB. These studies show that besides acting as address labels, signal sequences can modulate protein stability and aggregation in a sequence dependent manner.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.