Actinobacteria encode a wealth of natural product biosynthetic gene clusters (NPGCs), whose systematic study is complicated by numerous repetitive motifs. By combining several metrics we developed a method for global classification of these gene clusters into families (GCFs) and analyzed the biosynthetic capacity of Actinobacteria in 830 genome sequences, including 344 obtained for this project. The GCF network, comprised of 11,422 gene clusters grouped into 4,122 GCFs, was validated in hundreds of strains by correlating confident mass spectrometric detection of known small molecules with the presence/absence of their established biosynthetic gene clusters. The method also linked previously unassigned GCFs to known natural products, an approach that will enable de novo, bioassay-free discovery of novel natural products using large data sets. Extrapolation from the 830-genome dataset reveals that Actinobacteria encode hundreds of thousands of future drug leads, while the strong correlation between phylogeny and GCFs frames a roadmap to efficiently access them.
Phosphonates, molecules containing direct carbon-phosphorus bonds, compose a structurally diverse class of natural products with interesting and useful biological properties. Although their synthesis in protozoa was discovered more than 50 y ago, the extent and diversity of phosphonate production in nature remains poorly characterized. The rearrangement of phosphoenolpyruvate (PEP) to phosphonopyruvate, catalyzed by the enzyme PEP mutase (PepM), is shared by the vast majority of known phosphonate biosynthetic pathways. Thus, the pepM gene can be used as a molecular marker to examine the occurrence and abundance of phosphonate-producing organisms. Based on the presence of this gene, phosphonate biosynthesis is common in microbes, with ∼5% of sequenced bacterial genomes and 7% of genome equivalents in metagenomic datasets carrying pepM homologs. Similarly, we detected the pepM gene in ∼5% of random actinomycete isolates. The pepM-containing gene neighborhoods from 25 of these isolates were cloned, sequenced, and compared with those found in sequenced genomes. PEP mutase sequence conservation is strongly correlated with conservation of other nearby genes, suggesting that the diversity of phosphonate biosynthetic pathways can be predicted by examining PEP mutase diversity. We used this approach to estimate the range of phosphonate biosynthetic pathways in nature, revealing dozens of discrete groups in pepM amplicons from local soils, whereas hundreds were observed in metagenomic datasets. Collectively, our analyses show that phosphonate biosynthesis is both diverse and relatively common in nature, suggesting that the role of phosphonate molecules in the biosphere may be more important than is often recognized.
Although natural products have been a particularly rich source of human medicines, activity-based screening results in a very high rate of rediscovery of known molecules. Based on the large number of natural product biosynthetic genes in microbial genomes, many have proposed "genome mining" as an alternative approach for discovery efforts; however, this idea has yet to be performed experimentally on a large scale. Here, we demonstrate the feasibility of large-scale, high-throughput genome mining by screening a collection of over 10,000 actinomycetes for the genetic potential to make phosphonic acids, a class of natural products with diverse and useful bioactivities. Genome sequencing identified a diverse collection of phosphonate biosynthetic gene clusters within 278 strains. These clusters were classified into 64 distinct groups, of which 55 are likely to direct the synthesis of unknown compounds. Characterization of strains within five of these groups resulted in the discovery of a new archetypical pathway for phosphonate biosynthesis, the first (to our knowledge) dedicated pathway for H-phosphinates, and 11 previously undescribed phosphonic acid natural products. Among these compounds are argolaphos, a broad-spectrum antibacterial phosphonopeptide composed of aminomethylphosphonate in peptide linkage to a rare amino acid N 5 -hydroxyarginine; valinophos, an N-acetyl L-Val ester of 2,3-dihydroxypropylphosphonate; and phosphonocystoximate, an unusual thiohydroximate-containing molecule representing a new chemotype of sulfur-containing phosphonate natural products. Analysis of the genome sequences from the remaining strains suggests that the majority of the phosphonate biosynthetic repertoire of Actinobacteria has been captured at the gene level. This dereplicated strain collection now provides a reservoir of numerous, as yet undiscovered, phosphonate natural products.natural products | genome mining | phosphonic acid | antibiotic
BackgroundActinomycetes are a diverse group of medically, industrially and ecologically important bacteria, studied as much for the diseases they cause as for the cures they hold. The genomes of actinomycetes revealed that these bacteria have a large number of natural product gene clusters, although many of these are difficult to tie to products in the laboratory. Large scale comparisons of these clusters are difficult to perform due to the presence of highly similar repeated domains in the most common biosynthetic machinery: polyketide synthases (PKSs) and nonribosomal peptide synthetases (NRPSs).ResultsWe have used comparative genomics to provide an overview of the genomic features of a set of 102 closed genomes from this important group of bacteria with a focus on natural product biosynthetic genes. We have focused on well-represented genera and determine the occurrence of gene cluster families therein. Conservation of natural product gene clusters within Mycobacterium, Streptomyces and Frankia suggest crucial roles for natural products in the biology of each genus. The abundance of natural product classes is also found to vary greatly between genera, revealing underlying patterns that are not yet understood.ConclusionsA large-scale analysis of natural product gene clusters presents a useful foundation for hypothesis formulation that is currently underutilized in the field. Such studies will be increasingly necessary to study the diversity and ecology of natural products as the number of genome sequences available continues to grow.
Horizontal gene transfer (HGT) is widespread in the microbial world, but its impact on the origin and persistence of microbial species remains poorly defined. HGT can result in either acquisition of new genetic material or homologous replacement of existing genes. The evolutionary significance of homologous recombination in a population can be quantified by examining the relative rates at which polymorphisms are introduced from recombination (q) and mutation (h w ). We used multilocus sequence analysis (MLSA) to quantify both intraspecies and interspecies homologous recombination among streptomycetes, multicellular Gram-positive bacteria ubiquitous in soil, which are an important source of antibiotics and bioactive compounds. Intraspecies recombination was examined using strains of Streptomyces flavogriseus isolated from soils at five locations spanning 1000 km. The strains had 499.8% nucleotide identity across the loci examined. We found remarkable levels of gene exchange within S. flavogriseus (q/h w ¼ 27.9), and found that the population was in linkage equilibrium (standardized index of association ¼ 0.0018), providing evidence for a freely recombining sexual population structure. We also examined interspecies homologous recombination among different Streptomyces species in an MLSA data set and found that 40% of the species had housekeeping genes acquired through HGT. The recombination rate between these named species (q/h w ¼ 0.21) exceeds that observed within many species of bacteria. Despite widespread gene exchange in the genus, the intraspecies recombination rate exceeded the interspecies rate by two orders of magnitude suggesting that patterns of gene exchange and recombination may shape the evolution of streptomycetes.
We show that Streptomyces biogeography in soils across North America is influenced by the regional diversification of microorganisms due to dispersal limitation and genetic drift. Streptomyces spp. form desiccation-resistant spores, which can be dispersed on the wind, allowing for a strong test of whether dispersal limitation governs patterns of terrestrial microbial diversity. We employed an approach that has high sensitivity for determining the effects of genetic drift. Specifically, we examined the genetic diversity and phylogeography of physiologically similar Streptomyces strains isolated from geographically distributed yet ecologically similar habitats. We found that Streptomyces beta diversity scales with geographic distance and both beta diversity and phylogenetic diversity manifest in a latitudinal diversity gradient. This pattern of Streptomyces biogeography resembles patterns seen for diverse species of plants and animals, and we therefore evaluated these data in the context of ecological and evolutionary hypotheses proposed to explain latitudinal diversity gradients. The data are consistent with the hypothesis that niche conservatism limits dispersal, and historical patterns of glaciation have limited the time for speciation in higher-latitude sites. Most notably, higher-latitude sites have lower phylogenetic diversity, higher phylogenetic clustering, and evidence of range expansion from lower latitudes. In addition, patterns of beta diversity partition with respect to the glacial history of sites. Hence, the data support the hypothesis that extant patterns of Streptomyces biogeography have been driven by historical patterns of glaciation and are the result of demographic range expansion, dispersal limitation, and regional diversification due to drift.
For more than half a century the pharmaceutical industry has sifted through natural products produced by microbes, uncovering new scaffolds and fashioning them into a broad range of vital drugs. We sought a strategy to reinvigorate the discovery of natural products with distinctive structures using bacterial genome sequencing combined with metabolomics. By correlating genetic content from 178 actinomycete genomes with mass spectrometry-enabled analyses of their exported metabolomes, we paired new secondary metabolites with their biosynthetic gene clusters. We report the use of this new approach to isolate and characterize tambromycin, a new chlorinated natural product, composed of several nonstandard amino acid monomeric units, including a unique pyrrolidine-containing amino acid we name tambroline. Tambromycin shows antiproliferative activity against cancerous human B- and T-cell lines. The discovery of tambromycin via large-scale correlation of gene clusters with metabolites (a.k.a. metabologenomics) illuminates a path for structure-based discovery of natural products at a sharply increased rate.
The family Streptomycetaceae, notably species in the genus Streptomyces, have long been the subject of investigation due to their well-known ability to produce secondary metabolites. The emergence of drug resistant pathogens and the relative ease of producing genome sequences has renewed the importance of Streptomyces as producers of new natural products and resulted in revived efforts in isolating and describing strains from novel environments. A previous large study of the phylogeny in the Streptomycetaceae based on 16S rRNA gene sequences provided a useful framework for the relationships among species, but did not always have sufficient resolution to provide definitive identification. Multi-locus sequence analysis of 5 house-keeping genes has been shown to provide improved taxonomic resolution of Streptomyces species in a number of previous reports so a comprehensive study was undertaken to evaluate evolutionary relationships among species within the family Streptomycetaceae where type strains are available in the ARS Culture Collection or genome sequences are available in GenBank. The results of the analysis supported the distinctiveness of Kitasatospora and Streptacidiphilus as validly named genera since they cluster outside of the phylogenetic radiation of the genus Streptomyces. There is also support for the transfer of a number of Streptomyces species to the genus Kitasatospora as well for reducing at least 31 species clusters to a single taxon. The multi-locus sequence database resulting from the study is a useful tool for identification of new isolates and the phylogenetic analysis presented also provides a road map for planning future genome sequencing efforts in the Streptomycetaceae.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.