Commercial enzymatic processes require robust catalysts able to withstand elevated temperatures and long incubations, conditions under which most native enzymes perform poorly. Incremental increases in thermostability can be achieved by repeated rounds of mutagenesis and screening, but general strategies are needed for designing highly thermostable enzymes a priori. Here we show that enzymes can be created that can withstand temperatures ~ 30 °C higher and incubations ≥ 100 times longer than extant forms in a single step using ancestral reconstruction. We exemplify the approach with the first ancestral resurrections of two unrelated enzyme families: cytochrome P450 monooxygenases, that stereo-and regioselectively functionalize un-activated C-H bonds in pharmaceutical, flavour, fragrance and other fine chemical syntheses; and ketol acid reductoisomerases, used to make butanol-based biofuels. This shows thermostability can be designed into proteins using sequence data alone, potentially enhancing the economic feasibility of any process or product requiring a highly stable protein.
Ancestral sequence reconstruction is a technique that is gaining widespread use in molecular evolution studies and protein engineering. Accurate reconstruction requires the ability to handle appropriately large numbers of sequences, as well as insertion and deletion (indel) events, but available approaches exhibit limitations. To address these limitations, we developed Graphical Representation of Ancestral Sequence Predictions (GRASP), which efficiently implements maximum likelihood methods to enable the inference of ancestors of families with more than 10,000 members. GRASP implements partial order graphs (POGs) to represent and infer insertion and deletion events across ancestors, enabling the identification of building blocks for protein engineering. To validate the capacity to engineer novel proteins from realistic data, we predicted ancestor sequences across three distinct enzyme families: glucose-methanol-choline (GMC) oxidoreductases, cytochromes P450, and dihydroxy/sugar acid dehydratases (DHAD). All tested ancestors demonstrated enzymatic activity. Our study demonstrates the ability of GRASP (1) to support large data sets over 10,000 sequences and (2) to employ insertions and deletions to identify building blocks for engineering biologically active ancestors, by exploring variation over evolutionary time.
The structure of metabolites of drug candidates must frequently be characterised during drug discovery and development. However, synthesising metabolites with the correct stereoselective modifications can be challenging for chemically complex parent compounds. Biocatalysis using human drug‐metabolising enzymes, such as cytochrome P450 2D6 (CYP2D6) is an alternative to chemical synthesis. However, most natural enzymes are unstable and have poor efficiency, limiting yields in preparative biotransformations. The aim of this study was to develop a library of robust mutant CYP2D enzymes for biocatalysis. The CLADE (combinatorial libraries of ancestors for directed evolution) approach increased the stability of CYP2D mutants obtained by DNA shuffling using three extant CYP2D forms. The resulting mutants showed divergent profiles of activity towards typical CYP2D substrates and included thermostable forms that may be useful for the further evolution of biocatalysts for specific applications.
The cytochrome P450 family 1 enzymes (CYP1s) are a diverse family of hemoprotein monooxygenases which metabolize many xenobiotics including numerous environmental carcinogens. However, their historical function and evolution remain largely unstudied. Here we investigate CYP1 evolution via the reconstruction and characterization of the vertebrate CYP1 ancestors. Younger ancestors and extant forms generally demonstrated higher activity towards typical CYP1 xenobiotic and steroid substrates than older ancestors, suggesting significant diversification away from the original CYP1 function. Caffeine metabolism appears to be a recently evolved trait of the CYP1A subfamily, observed in the mammalian CYP1A lineage, and may parallel the recent evolution of caffeine synthesis in multiple separate plant species. Likewise, the aryl hydrocarbon receptor agonist, 6-formylindolo[3,2-b]carbazole (FICZ) was metabolised to a greater extent by certain younger ancestors and extant forms, suggesting that activity towards FICZ increased in specific CYP1 evolutionary branches, a process that may have occurred in parallel to the exploitation of land where UV-exposure was higher than in aquatic environments. As observed with previous reconstructions of P450 enzymes, thermostability correlated with evolutionary age; the oldest ancestor was up to 35 °C more thermostable than the extant forms, with a 10T50 (temperature at which 50% of the haemoprotein remains intact after 10 min) of 71 °C. This robustness may have facilitated evolutionary diversification of the CYP1s by buffering the destabilizing effects of mutations that conferred novel functions, a phenomenon which may also be useful in exploiting the catalytic versatility of these ancestral enzymes for commercial application as biocatalysts.
We developed Graphical Representation of Ancestral Sequence Predictions (GRASP) to infer and explore ancestral variants of protein families with more than 10,000 members. GRASP uses partial order graphs to represent homology in very large datasets, which are intractable with current inference tools and may, for example, be used to engineer proteins by identifying ancient variants of enzymes. We demonstrate that (1) across three distinct enzyme families, GRASP predicts ancestor sequences, all of which demonstrate enzymatic activity, (2) within-family insertions and deletions can be used as building blocks to support the engineering of biologically active ancestors via a new source of ancestral variation, and (3) generous inclusion of sequence data encompassing great diversity leads to less variance in ancestor sequence.GRASP is the central tool in the GRASP-suite, which is freely available at
Cytochromes P450 are found throughout the biosphere in a wide range of environments, serving a multitude of physiological functions. The ubiquity of the P450 fold suggests that it has been co-opted by evolution many times, and likely presents a useful compromise between structural stability and conformational flexibility. The diversity of substrates metabolized and reactions catalyzed by P450s makes them attractive starting materials for use as biocatalysts of commercially useful reactions. However, process conditions impose different requirements on enzymes to those in which they have evolved naturally. Most natural environments are relatively mild, and therefore most P450s have not been selected in Nature for the ability to withstand temperatures above ~40°C, yet industrial processes frequently require extended incubations at much higher temperatures. Thus, there has been considerable interest and effort invested in finding or engineering thermostable P450 systems. Numerous P450s have now been identified in thermophilic organisms and analysis of their structures provides information as to mechanisms by which the P450 fold can be stabilized. In addition, protein engineering, particularly by directed or artificial evolution, has revealed mutations that serve to stabilize particular mesophilic enzymes of interest. Here we review the current understanding of thermostability as it applies to the P450 fold, gleaned from the analysis of P450s characterized from thermophilic organisms and the parallel engineering of mesophilic forms for greater thermostability. We then present a perspective on how this information might be used to design stable P450 enzymes for industrial application. This article is part of a Special Issue entitled: Cytochrome P450 biodiversity and biotechnology, edited by Erika Plettner, Gianfranco Gilardi, Luet Wong, Vlada Urlacher, Jared Goldstone.
The survival of Salmonella typhimurium was investigated in acidogenic, anaerobically fermented pig wastes and in synthetic media, each containing volatile fatty acids (VFA). Salm. typhimurium survived at pH 6.8, but not at pH 4.0, when incubated at 37 degrees C for 24 h in either fermented or synthetic medium containing VFA. The minimum inhibiting concentration of VFA for Salm. typhimurium after 48 h incubation at 30 degrees C at pH 4.0 was 0.03 mol/l and for Escherichia coli it was 0.09 mol/l. Fermented pig wastes in a digester, maintained at pH 5.9, were inoculated with Salm. typhimurium and then incubated at 37 degrees C for 24 h. The pH was adjusted to either 4.0 or 5.0 and after a further 48 h at 30 degrees C, Salm. typhimurium survived at pH 5.0 but not at pH 4.0. It was concluded that pH is critical in determining the survival of this organism in acidogenic anaerobically fermented pig waste.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.