The success of protein evolution campaigns is strongly dependent on the sequence context in which mutations are introduced, stemming from pervasive non-additive interactions between a protein’s amino acids (‘intra-gene epistasis’). Our limited understanding of such epistasis hinders the correct prediction of the functional contributions and adaptive potential of mutations. Here we present a straightforward unique molecular identifier (UMI)-linked consensus sequencing workflow (UMIC-seq) that simplifies mapping of evolutionary trajectories based on full-length sequences. Attaching UMIs to gene variants allows accurate consensus generation for closely related genes with nanopore sequencing. We exemplify the utility of this approach by reconstructing the artificial phylogeny emerging in three rounds of directed evolution of an amine dehydrogenase biocatalyst via ultrahigh throughput droplet screening. Uniquely, we are able to identify lineages and their founding variant, as well as non-additive interactions between mutations within a full gene showing sign epistasis. Access to deep and accurate long reads will facilitate prediction of key beneficial mutations and adaptive potential based on in silico analysis of large sequence datasets.
Successful screening of enzyme libraries in functional metagenomics and directed evolution becomes more likely after uniform cell growth in droplets amplifies genotype and phenotype.
We evolved the thermophilic β-glucosidase CsBglA to 150% improvement of kcat/KM at 55 °C, exerting a specific activity of 504 U mg−1 and a KM-value of 37.1 mM, making it the bacterial β-glucosidase with highest activity known so far.
Enzyme discovery and directed evolution are the two major contemporary approaches for the improvement of industrial processes by biocatalysis in various fields. Customization of catalysts for improvement of single enzyme reactions or de novo reaction development is often complex and tedious. The success of screening campaigns relies on the fraction of sequence space that can be sampled, whether for evolving a particular enzyme or screening metagenomes. Ultrahigh-throughput screening (uHTS) based on in vitro compartmentalization in water-in-oil emulsion of picoliter droplets generated in microfluidic systems allows screening rates >1 kHz (or >10 7 per day). Screening for carbohydrate-active enzymes (CAZymes) catalyzing biotechnologically valuable reactions in this format presents an additional challenge because the released carbohydrates are difficult to monitor in high throughput. Activated substrates with large optically active hydrophobic leaving groups provide a generic optical readout, but the molecular recognition properties of sugars will be altered by the incorporation of such fluoro-or chromophores and their typically higher reactivity, as leaving groups with lowered pK a values compared to native substrates make the observation of promiscuous reactions more likely. To overcome these issues, we designed microdroplet assays in which optically inactive carbohydrate products are made visible by specific cascades: the primary reaction of an unlabeled substrate leads to an optical signal downstream. Successfully implementing such assays at the picoliter droplet scale allowed us to detect glucose, xylose, glucuronic acid, and arabinose as final products of complex oligosaccharide degradation by glycoside hydrolases by absorbance measurements. Enabling the use of uHTS for screening CAZyme reactions that have been thus far elusive will chart a route toward faster and easier development of specific and efficient biocatalysts for biovalorization, directing enzyme discovery by challenging catalysts for reaction with natural rather than model substrates.
The exploration of large DNA libraries of metagenomic or synthetic origin is greatly facilitated by ultrahigh-throughput assays that use monodisperse water-in-oil emulsion droplets as sequestered reaction compartments. Millions of samples can be generated and analysed in microfluidic devices at kHz speeds, requiring only micrograms of reagents. The scope of this powerful platform for the discovery of new sequence space is, however, hampered by the limited availability of assay substrates, restricting the functions and reaction types that can be investigated. Here, we broaden the scope of detectable biochemical transformations in droplet microfluidics by introducing the first fluorogenic assay for alcohol dehydrogenases (ADHs) in this format. We have synthesized substrates that release a pyranine fluorophore (8-hydroxy-1,3,6-pyrenetrisulfonic acid, HPTS) when enzymatic turnover occurs. Pyranine is well retained in droplets for > 6 weeks (i. e. 14-times longer than fluorescein), avoiding product leakage and ensuring excellent assay sensitivity. Product concentrations as low as 100 nM were successfully detected, corresponding to less than one turnover per enzyme molecule on average. The potential of our substrate design was demonstrated by efficient recovery of a bona fide ADH with an > 800-fold enrichment. The repertoire of droplet screening is enlarged by this sensitive and direct fluorogenic assay to identify dehydrogenases for biocatalytic applications.
Here we present a straightforward unique molecular identifier (UMI)-linked nanopore consensus sequencing workflow (UMIC-seq), resulting in cost-effective and accurate long-read sequencing of amplicons. Short random molecular barcodes (i.e. unique molecular identifiers, UMIs) are attached to a pool of gene variants prior to nanopore sequencing to enable reliable clustering and the generation of accurate consensus sequences, even when starting from highly similar gene variants (e.g. a library of point mutants in directed protein evolution) that could not be reliably distinguished in the ordinary nanopore sequencing output.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.