Sequencing of bacterial and archaeal genomes has revolutionized our understanding of the many roles played by microorganisms1. There are now nearly 1,000 completed bacterial and archaeal genomes available2, most of which were chosen for sequencing on the basis of their physiology. As a result, the perspective provided by the currently available genomes is limited by a highly biased phylogenetic distribution3–5. To explore the value added by choosing microbial genomes for sequencing on the basis of their evolutionary relationships, we have sequenced and analysed the genomes of 56 culturable species of Bacteria and Archaea selected to maximize phylogenetic coverage. Analysis of these genomes demonstrated pronounced benefits (compared to an equivalent set of genomes randomly selected from the existing database) in diverse areas including the reconstruction of phylogenetic history, the discovery of new protein families and biological properties, and the prediction of functions for known genes from other organisms. Our results strongly support the need for systematic ‘phylogenomic’ efforts to compile a phylogeny-driven ‘Genomic Encyclopedia of Bacteria and Archaea’ in order to derive maximum knowledge from existing microbial genome data as well as from genome sequences to come.
Culturing microorganisms is a critical step in understanding and utilizing microbial life. Here we map the landscape of existing culture media by extracting natural-language media recipes into a Known Media Database (KOMODO), which includes >18,000 strain–media combinations, >3300 media variants and compound concentrations (the entire collection of the Leibniz Institute DSMZ repository). Using KOMODO, we show that although media are usually tuned for individual strains using biologically common salts, trace metals and vitamins/cofactors are the most differentiating components between defined media of strains within a genus. We leverage KOMODO to predict new organism–media pairings using a transitivity property (74% growth in new in vitro experiments) and a phylogeny-based collaborative filtering tool (83% growth in new in vitro experiments and stronger growth on predicted well-scored versus poorly scored media). These resources are integrated into a web-based platform that predicts media given an organism's 16S rDNA sequence, facilitating future cultivation efforts.
The phylogenetic relationships of the type strains of 38 species from 15 genera of the family Enterobacteriaceae were investigated by comparative 16s rDNA analysis. Several sequences of strains f rom the genera Citrobacter, Erwinia, Pantoea, Proteus, Rahnella and Serratia, analysed in this study, have been analysed previously. However, as the sequences of this study differ slightly from the published ones, they were included in the analysis. Of the 23 enterobacterial genera included in an overview dendrogram of relatedness, members of the genera Xenorhabdus, Photorhabdus, Proteus and Plesiomonas were used as a root. The other genera formed two groups which could be separated, although not exclusively, by signature nucleotides at positions 590-649 and 600-638. Group A contains species of Brenneria, Buttiauxella, Citrobacter, Escherichia, Erwinia, Klebsiella, Pantoea, Pectobacterium and Salmonella. All seven type strains of Buttiauxella share 16s rDNA similarities greater than 99 O/ O. Group B embraces two phylogenetically separate Serratia clusters, a lineage containing Yersinia species, Rahnella aquatica, Ewingella americana, and also the highly related pair Hafnia alvei and Obesumbacterium pro teus.
Genome comparisons based on average nucleotide identity (ANI) values of four strains currently classified as Polynucleobacter necessarius subsp. asymbioticus resulted in ANI values of 75.7–78.4 %, suggesting that each of those strains represents a separate species. The species P. necessarius was proposed by Heckmann and Schmidt in 1987 to accommodate obligate endosymbionts of ciliates affiliated with the genus Euplotes. The required revision of this species is, however, hampered by the fact, that this species is based only on a description and lacks a type strain available as pure culture. Furthermore, the ciliate culture Euplotes aediculatus ATCC 30859, on which the description of the species was based, is no longer available. We found another Euplotes aediculatus culture (Ammermann) sharing the same origin with ATCC 30859 and proved the identity of the endosymbionts contained in the two cultures. A multilocus sequence comparison approach was used to estimate if the four strains currently classified as Polynucleobacter
necessarius subsp. asymbioticus share ANI values with the endosymbiont in the Ammermann culture above or below the threshold for species demarcation. A significant correlation (R2 0.98, P<0.0001) between multilocus sequence similarity and ANI values of genome-sequenced strains enabled the prediction that it is highly unlikely that these four strains belong to the species P. necessarius. We propose reclassification of strains QLW-P1DMWA-1T (=DSM 18221T=CIP 109841T), MWH-MoK4T (=DSM 21495T=CIP 110977T), MWH-JaK3T (=DSM 21493T=CIP 110976T) and MWH-HuW1T (=DSM 21492T=CIP 110978T) as Polynucleobacter asymbioticus comb. nov., Polynucleobacter duraquae sp. nov., Polynucleobacter yangtzensis sp. nov. and Polynucleobacter sinensis sp. nov., respectively.
Summary 26Bacterial strains affiliated to the phylogenetically shallow subcluster C (PnecC) of the Polynucleobacter cluster, which is characterized by a minimal 16S rRNA gene sequence 28 similarity of approx. 98.5%, have been reported to occur as obligate endosymbionts of ciliates (Euplotes spp.), as well as to occur as free-living cells in the pelagic zone of 30 freshwater habitats. We investigated if these two groups of closely related bacteria represent
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.