Engineered orthogonal translation systems have greatly enabled the expansion of the genetic code using noncanonical amino acids (NCAAs). However, the impact of NCAAs on organismal evolution remains unclear, in part because it is difficult to force the adoption of new genetic codes in organisms. By reengineering TEM-1 β-lactamase to be dependent on a NCAA, we maintained bacterial NCAA dependence for hundreds of generations without escape.
Allostery is a fundamental biophysical mechanism that underlies cellular sensing, signaling, and metabolism. Yet a quantitative understanding of allosteric genotype‐phenotype relationships remains elusive. Here, we report the large‐scale measurement of the genotype‐phenotype landscape for an allosteric protein: the lac repressor from Escherichia coli , LacI. Using a method that combines long‐read and short‐read DNA sequencing, we quantitatively measure the dose‐response curves for nearly 10 5 variants of the LacI genetic sensor. The resulting data provide a quantitative map of the effect of amino acid substitutions on LacI allostery and reveal systematic sequence‐structure‐function relationships. We find that in many cases, allosteric phenotypes can be quantitatively predicted with additive or neural‐network models, but unpredictable changes also occur. For example, we were surprised to discover a new band‐stop phenotype that challenges conventional models of allostery and that emerges from combinations of nearly silent amino acid substitutions.
Allostery is a fundamental biophysical mechanism that underlies cellular sensing, signaling, and metabolism. Yet a quantitative understanding of allosteric genotype-phenotype relationships remains elusive. Here we report the large-scale measurement of the genotype-phenotype landscape for an allosteric protein: the lac repressor from Escherichia coli, LacI. Using a method that combines long-read and short-read DNA sequencing, we quantitatively measure the dose-response curves for nearly 10^5 variants of the LacI genetic sensor. The resulting data provide a quantitative map of the effect of amino acid substitutions on LacI allostery and reveal systematic sequence-structure-function relationships. We find that in many cases, allosteric phenotypes can be quantitatively predicted with additive or neural-network models, but unpredictable changes also occur. For example, we were surprised to discover a new band-stop phenotype that challenges conventional models of allostery and that emerges from combinations of nearly silent amino acid substitutions.
Measuring information transmission from stimulus to response is useful for evaluating the signaling fidelity of biochemical reaction networks (BRNs) in cells. Quantification of information transmission can reveal the optimal input stimuli environment for a BRN and the rate at which the signaling fidelity decreases for non-optimal input probability distributions. Here we present sparse estimation of mutual information landscapes (SEMIL), a method to quantify information transmission through cellular BRNs using commonly available data for single-cell gene expression output, across a design space of possible input distributions. We validate SEMIL and use it to analyze several engineered cellular sensing systems to demonstrate the impact of reaction pathways and rate constants on mutual information landscapes.
Engineering antibodies to utilize non-canonical amino acids (NCAA) should greatly expand the utility of an already important biological reagent. In particular, introducing crosslinking reagents into antibody complementarity determining regions (CDRs) should provide a means to covalently crosslink residues at the antibody–antigen interface. Unfortunately, finding the optimum position for crosslinking two proteins is often a matter of iterative guessing, even when the interface is known in atomic detail. Computer-aided antibody design can potentially greatly restrict the number of variants that must be explored in order to identify successful crosslinking sites. We have therefore used Rosetta to guide the introduction of an oxidizable crosslinking NCAA, l-3,4-dihydroxyphenylalanine (l-DOPA), into the CDRs of the anti-protective antigen scFv antibody M18, and have measured crosslinking to its cognate antigen, domain 4 of the anthrax protective antigen. Computed crosslinking distance, solvent accessibility, and interface energetics were three factors considered that could impact the efficiency of l-DOPA-mediated crosslinking. In the end, 10 variants were synthesized, and crosslinking efficiencies were generally 10% or higher, with the best variant crosslinking to 52% of the available antigen. The results suggest that computational analysis can be used in a pipeline for engineering crosslinking antibodies. The rules learned from l-DOPA crosslinking of antibodies may also be generalizable to the formation of other crosslinked interfaces and complexes.
Since the fixation of the genetic code, evolution has largely been confined to 20 proteinogenic amino acids. The development of orthogonal translation systems that allow for the codon-specific incorporation of noncanonical amino acids may provide a means to expand the code, but these translation systems cannot be simply superimposed on cells that have spent billions of years optimizing their genomes with the canonical code. We have therefore carried out directed evolution experiments with an orthogonal translation system that inserts 3-nitro-L-tyrosine across from amber codons, creating a 21 amino acid genetic code in which the amber stop codon ambiguously encodes either 3-nitro-L-tyrosine or stop. The 21 amino acid code is enforced through the inclusion of an addicted, essential gene, a beta-lactamase dependent upon 3-nitro-L-tyrosine incorporation. After 2000 generations of directed evolution, the fitness deficit of the original strain was largely repaired through mutations that limited the toxicity of the noncanonical. While the evolved lineages had not resolved the ambiguous coding of the amber codon, the improvements in fitness allowed new amber codons to populate protein coding sequences.
Allostery is a fundamental biophysical mechanism that underlies cellular sensing, signaling, and metabolism. Quantitative methods to characterize the genotype-phenotype relationships for allosteric proteins would provide data needed to improve engineering of biological systems, to uncover the role of allosteric mis-regulation in disease, and to develop allosterically targeted drugs1. Here we report the large-scale measurement of the genotype-phenotype landscape for an allosteric protein: the lac repressor from Escherichia coli, LacI. Using a method that combines long-read and short-read DNA sequencing, we quantitatively determine the dose-response curves for nearly 105 variants of the LacI sensor. With the resulting data, we train a deep neural network (DNN) capable of predicting the dose-response curves for additional LacI genotypes in silico. We then map the impact of amino acid substitutions on the allosteric function of LacI. Additionally, we demonstrate engineering of allosteric function with unprecedented accuracy by identifying LacI variants from the measured landscape with quantitatively specified dose-response curves. Finally, we discover sensors with previously unreported band-stop dose-response curves. Overall, our results provide the first high-coverage, quantitative view of genotype-phenotype relationships for an allosteric protein, revealing a surprising diversity of phenotypes and showing that each phenotype is accessible via multiple distinct genotypes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.