Gardnerella vaginalis is associated with a spectrum of clinical conditions, suggesting high degrees of genetic heterogeneity among stains. Seventeen G. vaginalis isolates were subjected to a battery of comparative genomic analyses to determine their level of relatedness. For each measure, the degree of difference among the G. vaginalis strains was the highest observed among 23 pathogenic bacterial species for which at least eight genomes are available. Genome sizes ranged from 1.491 to 1.716 Mb; GC contents ranged from 41.18% to 43.40%; and the core genome, consisting of only 746 genes, makes up only 51.6% of each strain's genome on average and accounts for only 27% of the species supragenome. Neighbor-grouping analyses, using both distributed gene possession data and core gene allelic data, each identified two major sets of strains, each of which is composed of two groups. Each of the four groups has its own characteristic genome size, GC ratio, and greatly expanded core gene content, making the genomic diversity of each group within the range for other bacterial species. To test whether these 4 groups corresponded to genetically isolated clades, we inferred the phylogeny of each distributed gene that was present in at least two strains and absent in at least two strains; this analysis identified frequent homologous recombination within groups but not between groups or sets. G. vaginalis appears to include four nonrecombining groups/clades of organisms with distinct gene pools and genomic properties, which may confer distinct ecological properties. Consequently, it may be appropriate to treat these four groups as separate species.
In many macroorganisms, the ultimate source of potent biologically active natural products has remained elusive due to an inability to identify and culture the producing symbiotic microorganisms. As a model system for developing a meta-omic approach to identify and characterize natural product pathways from invertebrate-derived microbial consortia we chose to investigate the ET-743 (Yondelis®) biosynthetic pathway. This molecule is an approved anti-cancer agent obtained in low abundance (10−4–10−5% w/w) from the tunicate Ecteinascidia turbinata, and is generated in suitable quantities for clinical use by a lengthy semi-synthetic process. Based on structural similarities to three bacterial secondary metabolites, we hypothesized that ET-743 is the product of a marine bacterial symbiont. Using metagenomic sequencing of total DNA from the tunicate/microbial consortium we targeted and assembled a 35 kb contig containing 25 genes that comprise the core of the NRPS biosynthetic pathway for this valuable anti-cancer agent. Rigorous sequence analysis based on codon usage of two large unlinked contigs suggests that Candidatus Endoecteinascidia frumentensis produces the ET-743 metabolite. Subsequent metaproteomic analysis confirmed expression of three key biosynthetic proteins. Moreover, the predicted activity of an enzyme for assembly of the tetrahydroisoquinoline core of ET-743 was verified in vitro. This work provides a foundation for direct production of the drug and new analogs through metabolic engineering. We expect that the interdisciplinary approach described is applicable to diverse host-symbiont systems that generate valuable natural products for drug discovery and development.
Although there is tremendous interest in understanding the evolutionary roles of horizontal gene transfer (HGT) processes that occur during chronic polyclonal infections, to date there have been few studies that directly address this topic. We have characterized multiple HGT events that most likely occurred during polyclonal infection among nasopharyngeal strains of Streptococcus pneumoniae recovered from a child suffering from chronic upper respiratory and middle-ear infections. Whole genome sequencing and comparative genomics were performed on six isolates collected during symptomatic episodes over a period of seven months. From these comparisons we determined that five of the isolates were genetically highly similar and likely represented a dominant lineage. We analyzed all genic and allelic differences among all six isolates and found that all differences tended to occur within contiguous genomic blocks, suggestive of strain evolution by homologous recombination. From these analyses we identified three strains (two of which were recovered on two different occasions) that appear to have been derived sequentially, one from the next, each by multiple recombination events. We also identified a fourth strain that contains many of the genomic segments that differentiate the three highly related strains from one another, and have hypothesized that this fourth strain may have served as a donor multiple times in the evolution of the dominant strain line. The variations among the parent, daughter, and grand-daughter recombinant strains collectively cover greater than seven percent of the genome and are grouped into 23 chromosomal clusters. While capturing in vivo HGT, these data support the distributed genome hypothesis and suggest that a single competence event in pneumococci can result in the replacement of DNA at multiple non-adjacent loci.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.