In molecular evolutionary analyses, short DNA sequences are used to infer phylogenetic relationships among species. Here we apply this principle to the study of bacterial biosynthesis, enabling the targeted isolation of previously unidentified natural products directly from complex metagenomes. Our approach uses short natural product sequence tags derived from conserved biosynthetic motifs to profile biosynthetic diversity in the environment and then guide the recovery of gene clusters from metagenomic libraries. The methodology is conceptually simple, requires only a small investment in sequencing, and is not computationally demanding. To demonstrate the power of this approach to natural product discovery we conducted a computational search for epoxyketone proteasome inhibitors within 185 globally distributed soil metagenomes. This led to the identification of 99 unique epoxyketone sequence tags, falling into 6 phylogenetically distinct clades. Complete gene clusters associated with nine unique tags were recovered from four saturating soil metagenomic libraries. Using heterologous expression methodologies, seven potent epoxyketone proteasome inhibitors (clarepoxcins A-E and landepoxcins A and B) were produced from these pathways, including compounds with different warhead structures and a naturally occurring halohydrin prodrug. This study provides a template for the targeted expansion of bacterially derived natural products using the global metagenome.T he advent of cost-effective high-throughput sequencing and an increasingly sophisticated understanding of bacterial secondary metabolite biosynthesis have led to two important revelations with respect to the search for new natural products: first, that the biosynthetic potential of most cultured bacteria, as judged by the number of biosynthetic gene clusters (BGCs) observed in sequenced genomes, is far greater than previously estimated (1, 2); second, that the number of bacterial species in most environments is at least 100× greater than the number of species that is readily cultured (3, 4). These observations suggest that conventional "phenotype-first" natural products isolation approaches have only examined a small fraction of earth's bacterial biosynthetic potential.There are now a number of genomic search engines available that allow researchers to rapidly scan microbial whole genome sequences for BGCs encoding new natural products (5-7). Unfortunately, the large DNA contigs that these search strategies require as input are not readily available from complex metagenomes. In response to the need for a more robust metagenomic search strategy, our group recently developed an informatics platform called eSNaPD (8, 9) (environmental Surveyor of Natural Product Diversity) with the specific aim of facilitating sequence-guided discovery of new bacterial natural products from complex metagenomes (Fig. 1).The eSNaPD software is designed to bioinformatically assess short DNA sequences that have been amplified from environmental metagenomes by degenerate PCR targeting c...
Large-scale sequencing of prokaryotic (meta)genomic DNA suggests that most bacterial natural product gene clusters are not expressed under common laboratory culture conditions. Silent gene clusters represent a promising resource for natural product discovery and the development of a new generation of therapeutics. Unfortunately, the characterization of molecules encoded by these clusters is hampered owing to our inability to express these gene clusters in the laboratory. To address this bottleneck, we have developed a promoter-engineering platform to transcriptionally activate silent gene clusters in a model heterologous host. Our approach uses yeast homologous recombination, an auxotrophy complementation-based yeast selection system and sequence orthogonal promoter cassettes to exchange all native promoters in silent gene clusters with constitutively active promoters. As part of this platform, we constructed and validated a set of bidirectional promoter cassettes consisting of orthogonal promoter sequences, Streptomyces ribosome binding sites, and yeast selectable marker genes. Using these tools we demonstrate the ability to simultaneously insert multiple promoter cassettes into a gene cluster, thereby expediting the reengineering process. We apply this method to model active and silent gene clusters (rebeccamycin and tetarimycin) and to the silent, cryptic pseudogene-containing, environmental DNA-derived Lzr gene cluster. Complete promoter refactoring and targeted gene exchange in this "dead" cluster led to the discovery of potent indolotryptoline antiproliferative agents, lazarimides A and B. This potentially scalable and cost-effective promoter reengineering platform should streamline the discovery of natural products from silent natural product biosynthetic gene clusters.promoter engineering | indolotryptoline | environmental DNA
Rifamycin antibiotics (Rifs) target bacterial RNA polymerases (RNAPs) and are widely used to treat infections including tuberculosis. The utility of these compounds is threatened by the increasing incidence of resistance (RifR). As resistance mechanisms found in clinical settings may also occur in natural environments, here we postulated that bacteria could have evolved to produce rifamycin congeners active against clinically relevant resistance phenotypes. We survey soil metagenomes and identify a tailoring enzyme-rich family of gene clusters encoding biosynthesis of rifamycin congeners (kanglemycins, Kangs) with potent in vivo and in vitro activity against the most common clinically relevant RifR mutations. Our structural and mechanistic analyses reveal the basis for Kang inhibition of RifR RNAP. Unlike Rifs, Kangs function through a mechanism that includes interfering with 5′-initiating substrate binding. Our results suggest that examining soil microbiomes for new analogues of clinically used antibiotics may uncover metabolites capable of circumventing clinically important resistance mechanisms.
Sequencing of DNA extracted from environmental samples can provide key insights into the biosynthetic potential of uncultured bacteria. However, the high complexity of soil metagenomes, which can contain thousands of bacterial species per gram of soil, imposes significant challenges to explore secondary metabolites potentially produced by rare members of the soil microbiome. Here, we develop a targeted sequencing workflow termed CONKAT-seq (co-occurrence network analysis of targeted sequences) that detects physically clustered biosynthetic domains, a hallmark of bacterial secondary metabolism. Following targeted amplification of conserved biosynthetic domains in a highly partitioned metagenomic library, CONKAT-seq evaluates amplicon co-occurrence patterns across library subpools to identify chromosomally clustered domains. We show that a single soil sample can contain more than a thousand uncharacterized biosynthetic gene clusters, most of which originate from low frequency genomes which are practically inaccessible through untargeted sequencing. CONKAT-seq allows scalable exploration of largely untapped biosynthetic diversity across multiple soils, and can guide the discovery of novel secondary metabolites from rare members of the soil microbiome.
Most natural product biosynthetic gene clusters identified in bacterial genomic and metagenomic sequencing efforts are silent under laboratory growth conditions. Here, we describe a scalable biosynthetic gene cluster activation method wherein the gene clusters are disassembled at interoperonic regions in vitro using CRISPR/Cas9 and then reassembled with PCR-amplified, short DNAs, carrying synthetic promoters, using transformation assisted recombination (TAR) in yeast. This simple, cost-effective, and scalable method allows for the simultaneous generation of combinatorial libraries of refactored gene clusters, eliminating the need to understand the transcriptional hierarchy of the silent genes. In two test cases, this in vitro disassembly-TAR reassembly method was used to create collections of promoter-replaced gene clusters that were tested in parallel to identify versions that enabled secondary metabolite production. Activation of the atolypene (ato) gene cluster led to the characterization of two unprecedented, bacterial cyclic sesterterpenes, atolypene A (1) and B (2), which are moderately cytotoxic to human cancer cell lines. This streamlined in vitro disassembly-in vivo reassembly method offers a simplified approach for silent gene cluster refactoring that should facilitate the discovery of natural products from silent gene clusters cloned from either metagenomes or cultured bacteria.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.