Commercial enzymatic processes require robust catalysts able to withstand elevated temperatures and long incubations, conditions under which most native enzymes perform poorly. Incremental increases in thermostability can be achieved by repeated rounds of mutagenesis and screening, but general strategies are needed for designing highly thermostable enzymes a priori. Here we show that enzymes can be created that can withstand temperatures ~ 30 °C higher and incubations ≥ 100 times longer than extant forms in a single step using ancestral reconstruction. We exemplify the approach with the first ancestral resurrections of two unrelated enzyme families: cytochrome P450 monooxygenases, that stereo-and regioselectively functionalize un-activated C-H bonds in pharmaceutical, flavour, fragrance and other fine chemical syntheses; and ketol acid reductoisomerases, used to make butanol-based biofuels. This shows thermostability can be designed into proteins using sequence data alone, potentially enhancing the economic feasibility of any process or product requiring a highly stable protein.
Ancestral sequence reconstruction is a technique that is gaining widespread use in molecular evolution studies and protein engineering. Accurate reconstruction requires the ability to handle appropriately large numbers of sequences, as well as insertion and deletion (indel) events, but available approaches exhibit limitations. To address these limitations, we developed Graphical Representation of Ancestral Sequence Predictions (GRASP), which efficiently implements maximum likelihood methods to enable the inference of ancestors of families with more than 10,000 members. GRASP implements partial order graphs (POGs) to represent and infer insertion and deletion events across ancestors, enabling the identification of building blocks for protein engineering.
To validate the capacity to engineer novel proteins from realistic data, we predicted ancestor sequences across three distinct enzyme families: glucose-methanol-choline (GMC) oxidoreductases, cytochromes P450, and dihydroxy/sugar acid dehydratases (DHAD). All tested ancestors demonstrated enzymatic activity. Our study demonstrates the ability of GRASP (1) to support large data sets over 10,000 sequences and (2) to employ insertions and deletions to identify building blocks for engineering biologically active ancestors, by exploring variation over evolutionary time.
The structure of metabolites of drug candidates must frequently be characterised during drug discovery and development. However, synthesising metabolites with the correct stereoselective modifications can be challenging for chemically complex parent compounds. Biocatalysis using human drug‐metabolising enzymes, such as cytochrome P450 2D6 (CYP2D6) is an alternative to chemical synthesis. However, most natural enzymes are unstable and have poor efficiency, limiting yields in preparative biotransformations. The aim of this study was to develop a library of robust mutant CYP2D enzymes for biocatalysis. The CLADE (combinatorial libraries of ancestors for directed evolution) approach increased the stability of CYP2D mutants obtained by DNA shuffling using three extant CYP2D forms. The resulting mutants showed divergent profiles of activity towards typical CYP2D substrates and included thermostable forms that may be useful for the further evolution of biocatalysts for specific applications.
The cytochrome P450 family 1 enzymes (CYP1s) are a diverse family of hemoprotein monooxygenases which metabolize many xenobiotics including numerous environmental carcinogens. However, their historical function and evolution remain largely unstudied. Here we investigate CYP1 evolution via the reconstruction and characterization of the vertebrate CYP1 ancestors. Younger ancestors and extant forms generally demonstrated higher activity towards typical CYP1 xenobiotic and steroid substrates than older ancestors, suggesting significant diversification away from the original CYP1 function. Caffeine metabolism appears to be a recently evolved trait of the CYP1A subfamily, observed in the mammalian CYP1A lineage, and may parallel the recent evolution of caffeine synthesis in multiple separate plant species. Likewise, the aryl hydrocarbon receptor agonist, 6-formylindolo[3,2-b]carbazole (FICZ) was metabolised to a greater extent by certain younger ancestors and extant forms, suggesting that activity towards FICZ increased in specific CYP1 evolutionary branches, a process that may have occurred in parallel to the exploitation of land where UV-exposure was higher than in aquatic environments. As observed with previous reconstructions of P450 enzymes, thermostability correlated with evolutionary age; the oldest ancestor was up to 35 °C more thermostable than the extant forms, with a 10T50 (temperature at which 50% of the haemoprotein remains intact after 10 min) of 71 °C. This robustness may have facilitated evolutionary diversification of the CYP1s by buffering the destabilizing effects of mutations that conferred novel functions, a phenomenon which may also be useful in exploiting the catalytic versatility of these ancestral enzymes for commercial application as biocatalysts.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.