Microbial hydrolysis of polysaccharides is critical to ecosystem functioning and is of great interest in diverse biotechnological applications, such as biofuel production and bioremediation. Here we demonstrate the use of a new, efficient approach to recover genomes of active polysaccharide degraders from natural, complex microbial assemblages, using a combination of fluorescently labeled substrates, fluorescence-activated cell sorting, and single cell genomics. We employed this approach to analyze freshwater and coastal bacterioplankton for degraders of laminarin and xylan, two of the most abundant storage and structural polysaccharides in nature. Our results suggest that a few phylotypes of Verrucomicrobia make a considerable contribution to polysaccharide degradation, although they constituted only a minor fraction of the total microbial community. Genomic sequencing of five cells, representing the most predominant, polysaccharide-active Verrucomicrobia phylotype, revealed significant enrichment in genes encoding a wide spectrum of glycoside hydrolases, sulfatases, peptidases, carbohydrate lyases and esterases, confirming that these organisms were well equipped for the hydrolysis of diverse polysaccharides. Remarkably, this enrichment was on average higher than in the sequenced representatives of Bacteroidetes, which are frequently regarded as highly efficient biopolymer degraders. These findings shed light on the ecological roles of uncultured Verrucomicrobia and suggest specific taxa as promising bioprospecting targets. The employed method offers a powerful tool to rapidly identify and recover discrete genomes of active players in polysaccharide degradation, without the need for cultivation.
In May of 2011, an enteroaggregative Escherichia coli O104:H4 strain that had acquired a Shiga toxin 2-converting phage caused a large outbreak of bloody diarrhea in Europe which was notable for its high prevalence of hemolytic uremic syndrome cases. Several studies have described the genomic inventory and phylogenies of strains associated with the outbreak and a collection of historical E. coli O104:H4 isolates using draft genome assemblies. We present the complete, closed genome sequences of an isolate from the 2011 outbreak (2011C–3493) and two isolates from cases of bloody diarrhea that occurred in the Republic of Georgia in 2009 (2009EL–2050 and 2009EL–2071). Comparative genome analysis indicates that, while the Georgian strains are the nearest neighbors to the 2011 outbreak isolates sequenced to date, structural and nucleotide-level differences are evident in the Stx2 phage genomes, the mer/tet antibiotic resistance island, and in the prophage and plasmid profiles of the strains, including a previously undescribed plasmid with homology to the pMT virulence plasmid of Yersinia pestis. In addition, multiphenotype analysis showed that 2009EL–2071 possessed higher resistance to polymyxin and membrane-disrupting agents. Finally, we show evidence by electron microscopy of the presence of a common phage morphotype among the European and Georgian strains and a second phage morphotype among the Georgian strains. The presence of at least two stx2 phage genotypes in host genetic backgrounds that may derive from a recent common ancestor of the 2011 outbreak isolates indicates that the emergence of stx2 phage-containing E. coli O104:H4 strains probably occurred more than once, or that the current outbreak isolates may be the result of a recent transfer of a new stx2 phage element into a pre-existing stx2-positive genetic background.
Continued advancements in sequencing technologies have fueled the development of new sequencing applications and promise to flood current databases with raw data. A number of factors prevent the seamless and easy use of these data, including the breadth of project goals, the wide array of tools that individually perform fractions of any given analysis, the large number of associated software/hardware dependencies, and the detailed expertise required to perform these analyses. To address these issues, we have developed an intuitive web-based environment with a wide assortment of integrated and cutting-edge bioinformatics tools in pre-configured workflows. These workflows, coupled with the ease of use of the environment, provide even novice next-generation sequencing users with the ability to perform many complex analyses with only a few mouse clicks and, within the context of the same environment, to visualize and further interrogate their results. This bioinformatics platform is an initial attempt at Empowering the Development of Genomics Expertise (EDGE) in a wide range of applications for microbial research.
BackgroundAlthough serotype O157:H7 is the predominant enterohemorrhagic Escherichia coli (EHEC), outbreaks of non-O157 EHEC that cause severe foodborne illness, including hemolytic uremic syndrome have increased worldwide. In fact, non-O157 serotypes are now estimated to cause over half of all the Shiga toxin-producing Escherichia coli (STEC) cases, and outbreaks of non-O157 EHEC infections are frequently associated with serotypes O26, O45, O103, O111, O121, and O145. Currently, there are no complete genomes for O145 in public databases.ResultsWe determined the complete genome sequences of two O145 strains (EcO145), one linked to a US lettuce-associated outbreak (RM13514) and one to a Belgium ice-cream-associated outbreak (RM13516). Both strains contain one chromosome and two large plasmids, with genome sizes of 5,737,294 bp for RM13514 and 5,559,008 bp for RM13516. Comparative analysis of the two EcO145 genomes revealed a large core (5,173 genes) and a considerable amount of strain-specific genes. Additionally, the two EcO145 genomes display distinct chromosomal architecture, virulence gene profile, phylogenetic origin of Stx2a prophage, and methylation profile (methylome). Comparative analysis of EcO145 genomes to other completely sequenced STEC and other E. coli and Shigella genomes revealed that, unlike any other known non-O157 EHEC strain, EcO145 ascended from a common lineage with EcO157/EcO55. This evolutionary relationship was further supported by the pangenome analysis of the 10 EHEC str ains. Of the 4,192 EHEC core genes, EcO145 shares more genes with EcO157 than with the any other non-O157 EHEC strains.ConclusionsOur data provide evidence that EcO145 and EcO157 evolved from a common lineage, but ultimately each serotype evolves via a lineage-independent nature to EHEC by acquisition of the core set of EHEC virulence factors, including the genes encoding Shiga toxin and the large virulence plasmid. The large variation between the two EcO145 genomes suggests a distinctive evolutionary path between the two outbreak strains. The distinct methylome between the two EcO145 strains is likely due to the presence of a BsuBI/PstI methyltransferase gene cassette in the Stx2a prophage of the strain RM13514, suggesting a role of horizontal gene transfer-mediated epigenetic alteration in the evolution of individual EHEC strains.
chien-chi Lo & patrick S. G. chain * there is growing interest in reconstructing phylogenies from the copious amounts of genome sequencing projects that target related viral, bacterial or eukaryotic organisms. to facilitate the construction of standardized and robust phylogenies for disparate types of projects, we have developed a complete bioinformatic workflow, with a web-based component to perform phylogenetic and molecular evolutionary (phaMe) analysis from sequencing reads, draft assemblies or completed genomes of closely related organisms. furthermore, the ability to incorporate raw data, including some metagenomic samples containing a target organism (e.g. from clinical samples with suspected infectious agents), shows promise for the rapid phylogenetic characterization of organisms within complex samples without the need for prior assembly. The reconstruction of organismal evolutionary history using phylogenetics is a fundamental method applied to many areas of biology. Single nucleotide polymorphisms (SNPs), one of the dominant forms of evolutionary change, have become an indispensable tool for phylogenetic analyses 1-4. Phylogenies in the pre-genomic era relied on SNPs and conserved sites within a single locus, and was later extended to multiple loci, such as in multiple locus sequence typing (MLST). Although still valuable, these methods only consider evolutionary signals originating within a small fraction of the genome, are unable to capture the complete variation within species, and generally provide a weak phylogenetic signal, particularly within a species, and do not always reflect the true evolutionary history of species 5. While phylogenetic analyses that use many conserved genes (orthologs) are a great improvement, these methods require annotated coding regions, whose predictions are not always accurate or available 6. Furthermore, they are impacted by horizontal gene transfer (HGT) 7 , recombination 8 , rate heterogeneity 9 , and incomplete lineage sorting. Genome-wide SNPs are one of the best measures of phylogenetic diversity as they can discriminate among closely related organisms and help resolve both short and long branches in a tree 10,11. Since selectively neutral SNPs accumulate at a uniform rate, they can be used to measure divergence between species as well as strains 12,13. Furthermore, due to the large number of SNPs found along the length of entire genomes, the use of whole-genome SNPs minimizes the impact of random sequencing and assembly errors that can impact individual loci, as well as biases due to individual genes under strong selective pressure. Some inherent biases remain with whole genome SNP approaches that are similar to loci-based phylogenies such as HGT, recombination, and rate heterogeneity. Although genome-wide sequencing now allows examination of the full complement of genomic variation, the number of completed and finished genomes are increasingly falling behind the generation of new draft genomes, due to the lack of computational or other resources. For example, o...
Endocytosis and vesicle trafficking are required for optimal neural transmission. Yet, little is currently known about the evolution of neuronal proteins regulating these processes. Here, we report the first phylogenetic study of NEEP21, calcyon, and P19, a family of neuronal proteins implicated in synaptic receptor endocytosis and recycling, as well as in membrane protein trafficking in the somatodendritic and axonal compartments of differentiated neurons. Database searches identified orthologs for P19 and NEEP21 in bony fish, but not urochordate or invertebrate phyla. Calcyon orthologs were only retrieved from mammalian databases and distant relatives from teleost fish. In situ localization of the P19 zebrafish ortholog, and extant progenitor of the gene family, revealed a CNS specific expression pattern. Based on non-synonymous nucleotide substitution rates, the calcyon genes appear to be under less intense negative selective pressure. Indeed, a functional group II WW domain binding motif was detected in primate and human calcyon, but not in non-primate orthologs. Sequencing of the calcyon gene from 80 human subjects revealed a non-synonymous single nucleotide polymorphism that abrogated group II WW domain protein binding. Altogether, our data indicate the NEEP21/calcyon/P19 gene family emerged, and underwent two rounds of gene duplication relatively late in metazoan evolution (but early in vertebrate evolution at the latest). As functional studies suggest NEEP21 and calcyon play related, but distinct roles in regulating vesicle trafficking at synapses, and in neurons in general, we propose the family arose in chordates to support a more diverse range of synaptic and behavioral responses.
Escherichia coli O145:H28 strain RM12581 was isolated from bagged romaine lettuce during a 2010 U.S. lettuce-associated outbreak. E. coli O145:H28 strain RM12761 was isolated from ice cream during a 2007 ice cream-associated outbreak in Belgium. Here we report the complete genome sequences and annotation of both strains.
No abstract
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.