Genome mining has become a key technology to exploit natural product diversity. While initially performed on a single-genome basis, the process is now being scaled up to mine entire genera, strain collections and microbiomes. However, no bioinformatic framework is currently available for effectively analyzing datasets of this size and complexity. Here, we provide a streamlined computational workflow consisting of two new software tools: The 'Biosynthetic Gene Similarity Clustering And Prospecting Engine' (BiG-SCAPE) facilitates fast and interactive sequence similarity network analysis of biosynthetic gene clusters and gene cluster families. 'CORe Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:
Genome mining has become a key technology to explore and exploit natural product diversity through the identification and analysis of biosynthetic gene clusters (BGCs). Initially, this was performed on a single-genome basis; currently, the process is being scaled up to large-scale mining of pan-genomes of entire genera, complete strain collections and metagenomic datasets from which thousands of bacterial genomes can be extracted at once. However, no bioinformatic framework is currently available for the effective analysis of datasets of this size and complexity. Here, we provide a streamlined computational workflow, tightly integrated with antiSMASH and MIBiG, that consists of two new software tools, BiG-SCAPE and CORASON. BiG-SCAPE facilitates rapid calculation and interactive visual exploration of BGC sequence similarity networks, grouping gene clusters at multiple hierarchical levels, and includes a 'glocal' alignment mode that accurately groups both complete and fragmented BGCs. CORASON employs a phylogenomic approach to elucidate the detailed evolutionary relationships between gene clusters by computing high-resolution multi-locus phylogenies of all BGCs within and across gene cluster families (GCFs), and allows researchers to comprehensively identify all genomic contexts in which particular biosynthetic gene cassettes are found. We validate BiG-SCAPE by correlating its GCF output to metabolomic data across 403 actinobacterial strains. Furthermore, we demonstrate the discovery potential of the platform by using CORASON to comprehensively map the phylogenetic diversity of the large detoxin/rimosamide gene cluster clan, prioritizing three new detoxin families for subsequent characterization of six new analogs using isotopic labeling and analysis of tandem mass spectrometric data.
Thioester reductase domains catalyze two- and four-electron reductions to release natural products following assembly on nonribosomal peptide synthetases, polyketide synthases, and their hybrid biosynthetic complexes. This reductive off-loading of a natural product yields an aldehyde or alcohol, can initiate the formation of a macrocyclic imine, and contributes to important intermediates in a variety of biosyntheses, including those for polyketide alkaloids and pyrrolobenzodiazepines. Compounds that arise from reductase-terminated biosynthetic gene clusters are often reactive and exhibit biological activity. Biomedically important examples include the cancer therapeutic Yondelis (ecteinascidin 743), peptide aldehydes that inspired the first therapeutic proteasome inhibitor bortezomib, and numerous synthetic derivatives and antibody drug conjugates of the pyrrolobenzodiazepines. Recent advances in microbial genomics, metabolomics, bioinformatics, and reactivity-based labeling have facilitated the detection of these compounds for targeted isolation. Herein, we summarize known natural products arising from this important category, highlighting their occurrence in Nature, biosyntheses, biological activities, and the technologies used for their detection and identification. Additionally, we review publicly available genomic data to highlight the remaining potential for novel reductively tailored compounds and drug leads from microorganisms. This thorough retrospecive highlights various molecularfamilies with especially privileged bioactivity while illuminating challenges and prospects toward accelerating the discovery of new, high value natural products.
Multidrug- and extensively drug-resistant strains of Mycobacterium tuberculosis are resistant to first- and second-line drug regimens and resulted in 210,000 fatalities in 2013. In the current study, we screened a library of aquatic bacterial natural product fractions for their ability to inhibit this pathogen. A fraction from a Lake Michigan bacterium exhibited significant inhibitory activity, from which we characterized novel diazaquinomycins H and J. This antibiotic class displayed an in vitro activity profile similar or superior to clinically used anti-tuberculosis agents and maintained this potency against a panel of drug-resistant M. tuberculosis strains. Importantly, these are among the only freshwater-derived actinomycete bacterial metabolites described to date. Further in vitro profiling against a broad panel of bacteria indicated that this antibiotic class selectively targets M. tuberculosis. Additionally, in the case of this pathogen we present evidence counter to previous reports that claim the diazaquinomycins target thymidylate synthase in Gram-positive bacteria. Thus, we establish freshwater environments as potential sources for novel antibiotic leads and present the diazaquinomycins as potent and selective inhibitors of M. tuberculosis.
Genomics and metabolomics are widely used to explore specialized metabolite diversity. The Paired Omics Data Platform is a community initiative to systematically document links between metabolome and (meta)genome data, aiding identification of natural product biosynthetic origins and metabolite structures.
A screening of our actinomycete fraction library against the NCI-60 SKOV3 human tumor cell line led to the isolation of isopimara-2-one-3-ol-8,15-diene (1), lagumycin B (2), dehydrorabelomycin (3), phenanthroviridone (4), and WS-5995 A (5). These secondary metabolites were produced by a Micromonospora sp. isolated from sediment collected off the Cát Bà peninsula in the East Sea of Vietnam. Compound 1 is a novel Δ8,9-pimarane diterpene, representing one of approximately 20 actinomycete-produced diterpenes reported to date, while compound 2 is an angucycline antibiotic that has yet to receive formal characterization. The structures of 1 and 2 were elucidated by combined NMR and MS analysis and the absolute configuration of 1 was assigned by analysis of NOESY NMR and CD spectroscopic data. Compounds 2–5 exhibited varying degrees of cytotoxicity against a panel of cancerous and non-cancerous cell lines. Overall, this study highlights our collaborative efforts to discover novel biologically active molecules from the large, underexplored, and biodiversity-rich waters of Vietnam’s East Sea.
As part of our program to identify novel secondary metabolites that target drug-resistant ovarian cancers, a screening of our aquatic-derived actinomycete fraction library against a cisplatin-resistant ovarian cancer cell line (OVCAR5) led to the isolation of novel diaza-anthracene antibiotic diazaquinomycin E (DAQE; 1), the isomeric mixture of diazaquinomycin F (DAQF; 2) and diazaquinomycin G (DAQG; 3), and known analog diazaquinomycin A (DAQA; 4). The structures of DAQF and DAQG were solved through deconvolution of X-Ray diffraction data of their corresponding co-crystal. DAQE and DAQA exhibited moderate LC50 values against OVCAR5 of 9.0 and 8.8 μM, respectively. At lethal concentrations of DAQA, evidence of DNA damage was observed via induction of apoptosis through cleaved-PARP. Herein, we will discuss the isolation, structure elucidation, and biological activity of these secondary metabolites.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.