Bacteriophages typically have small genomes 1 and depend on their bacterial hosts for replication 2 . Here we sequenced DNA from diverse ecosystems and found hundreds of phage genomes with lengths of more than 200 kilobases (kb), including a genome of 735 kb, which is-to our knowledge-the largest phage genome to be described to date. Thirty-five genomes were manually curated to completion (circular and no gaps). Expanded genetic repertoires include diverse and previously undescribed CRISPR-Cas systems, transfer RNAs (tRNAs), tRNA synthetases, tRNA-modification enzymes, translation-initiation and elongation factors, and ribosomal proteins. The CRISPR-Cas systems of phages have the capacity to silence host transcription factors and translational genes, potentially as part of a larger interaction network that intercepts translation to redirect biosynthesis to phage-encoded functions. In addition, some phages may repurpose bacterial CRISPR-Cas systems to eliminate competing phages. We phylogenetically define the major clades of huge phages from human and other animal microbiomes, as well as from oceans, lakes, sediments, soils and the built environment. We conclude that the large gene inventories of huge phages reflect a conserved biological strategy, and that the phages are distributed across a broad bacterial host range and across Earth's ecosystems.Phages-viruses that infect bacteria-are considered distinct from cellular life owing to their inability to carry out most biological processes required for reproduction. They are agents of ecosystem change because they prey on specific bacterial populations, mediate lateral gene transfer, alter host metabolism and redistribute bacterially derived compounds through cell lysis 2-4 . They spread antibiotic resistance 5 and disperse pathogenicity factors that cause disease in humans and animals 6,7 . Most knowledge about phages is based on laboratorystudied examples, the vast majority of which have genomes that are a few tens of kb in length. Widely used isolation-based methods select against large phage particles, and they can be excluded from phage concentrates obtained by passage through 100-nm or 200-nm filters 1 . In 2017, only 93 isolated phages with genomes that were more than 200 kb in length were published 1 . Sequencing of whole-community DNA can uncover phage-derived fragments; however, large genomes can still escape detection owing to fragmentation 8 . A new clade of human-and animal-associated megaphages was recently described on the basis of genomes that were manually curated to completion from metagenomic datasets 9 . This finding prompted us to carry out a more-comprehensive analysis of microbial communities to evaluate the prevalence, diversity and ecosystem distribution of phages with large genomes. Previously, phages with genomes of more than 200 kb have been referred to as 'jumbophages' 1 or, in the case of phages with genomes of more than 500 kb, as megaphages 9 . As the set reconstructed here span both size ranges we refer to them simply as 'huge phage...
Methanogenesis is an ancient metabolism of key ecological relevance, with direct impact on the evolution of Earth’s climate. Recent results suggest that the diversity of methane metabolisms and their derivations have probably been vastly underestimated. Here, by probing thousands of publicly available metagenomes for homologues of methyl-coenzyme M reductase complex (MCR), we have obtained ten metagenome-assembled genomes (MAGs) belonging to potential methanogenic, anaerobic methanotrophic and short-chain alkane oxidizing archaea. Five of these MAGs represent under-sampled (e.g., Verstraetearchaeota, Methanonatronarchaeia, ANME-1) or previously genomically undescribed (ANME-2c) archaeal lineages. The remaining five MAGs correspond to lineages that are only distantly related to previously known methanogens and span the entire archaeal phylogeny. Comprehensive comparative annotation significantly expands the metabolic diversity and energy conservation systems of MCR-bearing archaea. It also suggests the potential existence of a yet uncharacterized type of methanogenesis linked to short-chain alkane/fatty acid oxidation in a previously undescribed class of archaea (‘ Ca . Methanoliparia’). We redefine a common core of marker genes specific to methanogenic, anaerobic methanotrophic and short-chain alkane-oxidizing archaea, and propose a possible scenario for the evolutionary and functional transitions that led to the emergence of such metabolic diversity.
A wide array of microorganisms survive and thrive in extreme environments. However, we know little about the patterns of, and controls over, their large-scale ecological distribution. To this end, we have applied a bar-coded 16S rRNA pyrosequencing technology to explore the phylogenetic differentiation among 59 microbial communities from physically and geochemically diverse acid mine drainage (AMD) sites across Southeast China, revealing for the first time environmental variation as the major factor explaining community differences in these harsh environments. Our data showed that overall microbial diversity estimates, including phylogenetic diversity, phylotype richness and pairwise UniFrac distance, were largely correlated with pH conditions. Furthermore, multivariate regression tree analysis also identified solution pH as a strong predictor of relative lineage abundance. Betaproteobacteria, mostly affiliated with the ‘Ferrovum' genus, were explicitly predominant in assemblages under moderate pH conditions, whereas Alphaproteobacteria, Euryarchaeota, Gammaproteobacteria and Nitrospira exhibited a strong adaptation to more acidic environments. Strikingly, such pH-dependent patterns could also be observed in a subsequent comprehensive analysis of the environmental distribution of acidophilic microorganisms based on 16S rRNA gene sequences previously retrieved from globally distributed AMD and associated environments, regardless of the long-distance isolation and the distinct substrate types. Collectively, our results suggest that microbial diversity patterns are better predicted by contemporary environmental variation rather than geographical distance in extreme AMD systems.
Genomes are an integral component of the biological information about an organism; thus, the more complete the genome, the more informative it is. Historically, bacterial and archaeal genomes were reconstructed from pure (monoclonal) cultures, and the first reported sequences were manually curated to completion. However, the bottleneck imposed by the requirement for isolates precluded genomic insights for the vast majority of microbial life. Shotgun sequencing of microbial communities, referred to initially as community genomics and subsequently as genome-resolved metagenomics, can circumvent this limitation by obtaining metagenome-assembled genomes (MAGs); but gaps, local assembly errors, chimeras, and contamination by fragments from other genomes limit the value of these genomes. Here, we discuss genome curation to improve and, in some cases, achieve complete (circularized, no gaps) MAGs (CMAGs). To date, few CMAGs have been generated, although notably some are from very complex systems such as soil and sediment. Through analysis of about 7000 published complete bacterial isolate genomes, we verify the value of cumulative GC skew in combination with other metrics to establish bacterial genome sequence accuracy. The analysis of cumulative GC skew identified potential misassemblies in some reference genomes of isolated bacteria and the repeat sequences that likely gave rise to them. We discuss methods that could be implemented in bioinformatic approaches for curation to ensure that metabolic and evolutionary analyses can be based on very high-quality genomes.
The microbial communities in acid mine drainage have been extensively studied to reveal their roles in acid generation and adaption to this environment. Lacking, however, are integrated community- and organism-wide comparative gene transcriptional analyses that could reveal the response and adaptation mechanisms of these extraordinary microorganisms to different environmental conditions. In this study, comparative metagenomics and metatranscriptomics were performed on microbial assemblages collected from four geochemically distinct acid mine drainage (AMD) sites. Taxonomic analysis uncovered unexpectedly high microbial biodiversity of these extremely acidophilic communities, and the abundant taxa of Acidithiobacillus, Leptospirillum and Acidiphilium exhibited high transcriptional activities. Community-wide comparative analyses clearly showed that the AMD microorganisms adapted to the different environmental conditions via regulating the expression of genes involved in multiple in situ functional activities, including low-pH adaptation, carbon, nitrogen and phosphate assimilation, energy generation, environmental stress resistance, and other functions. Organism-wide comparative analyses of the active taxa revealed environment-dependent gene transcriptional profiles, especially the distinct strategies used by Acidithiobacillus ferrivorans and Leptospirillum ferrodiazotrophum in nutrients assimilation and energy generation for survival under different conditions. Overall, these findings demonstrate that the gene transcriptional profiles of AMD microorganisms are closely related to the site physiochemical characteristics, providing clues into the microbial response and adaptation mechanisms in the oligotrophic, extremely acidic environments.
High-throughput sequencing is expanding our knowledge of microbial diversity in the environment. Still, understanding the metabolic potentials and ecological roles of rare and uncultured microbes in natural communities remains a major challenge. To this end, we applied a ‘divide and conquer' strategy that partitioned a massive metagenomic data set (>100 Gbp) into subsets based on K-mer frequency in sequence assembly to a low-diversity acid mine drainage (AMD) microbial community and, by integrating with an additional metatranscriptomic assembly, successfully obtained 11 draft genomes most of which represent yet uncultured and/or rare taxa (relative abundance <1%). We report the first genome of a naturally occurring Ferrovum population (relative abundance >90%) and its metabolic potentials and gene expression profile, providing initial molecular insights into the ecological role of these lesser known, but potentially important, microorganisms in the AMD environment. Gene transcriptional analysis of the active taxa revealed major metabolic capabilities executed in situ, including carbon- and nitrogen-related metabolisms associated with syntrophic interactions, iron and sulfur oxidation, which are key in energy conservation and AMD generation, and the mechanisms of adaptation and response to the environmental stresses (heavy metals, low pH and oxidative stress). Remarkably, nitrogen fixation and sulfur oxidation were performed by the rare taxa, indicating their critical roles in the overall functioning and assembly of the AMD community. Our study demonstrates the potential of the ‘divide and conquer' strategy in high-throughput sequencing data assembly for genome reconstruction and functional partitioning analysis of both dominant and rare species in natural microbial assemblages.
51Genomes are an integral component of the biological information about an organism and, logically, the more 52 complete the genome, the more informative it is. Historically, bacterial and archaeal genomes were 53 reconstructed from pure (monoclonal) cultures and the first reported sequences were manually curated to 54 completion. However, the bottleneck imposed by the requirement for isolates precluded genomic insights 55 for the vast majority of microbial life. Shotgun sequencing of microbial communities, referred to initially as 56 community genomics and subsequently as genome-resolved metagenomics, can circumvent this limitation 57 by obtaining metagenome-assembled genomes (MAGs), but gaps, local assembly errors, chimeras and 58 contamination by fragments from other genomes limit the value of these genomes. Here, we discuss genome 59 curation to improve and in some cases achieve complete (circularized, no gaps) MAGs (CMAGs). To date, few 60 CMAGs have been generated, although notably some are from very complex systems such as soil and 61 sediment. Through analysis of ~7000 published complete bacterial isolate genomes, we verify the value of 62 cumulative GC skew in combination with other metrics to establish bacterial genome sequence accuracy. 63Interestingly, analysis of cumulative GC skew identified potential mis-assemblies in some reference genomes 64 of isolated bacteria and the repeat sequences that likely gave rise to them. We discuss methods that could 65 be implemented in bioinformatic approaches for curation to ensure that metabolic and evolutionary 66 analyses can be based on very high-quality genomes. 67 68 69 70 2017), and even human blood (Moustafa et al. 2017) amenable to shotgun metagenomic surveys and recovery 87 of MAGs. Although incomplete, draft MAGs represent a major advance over knowing nothing about the genes 88 and pathways present in an organism, and led to the discovery of new metabolisms. For example, the complete 89 oxidation of ammonia to nitrate via nitrite (i.e., comammox) was determined by the detection of necessary 90 genes in a single MAG (Daims et al. 2015; van Kessel et al. 2015). MAGs are often derived from uncultivated 91 organisms that can be quite distantly related to any isolated species, which is a clear advantage of MAGs 92 (Becraft et al. 2017; Garg et al. 2019). For this reason, genome-resolved metagenomics has been critical for 93 more comprehensive descriptions of bacterial and archaeal diversity and the overall topology of the Tree of 94
Introduction Microbial residents of the human oral cavity have long been a major focus of microbiology due to their influence on host health and intriguing patterns of site specificity amidst the lack of dispersal limitation. However, the determinants of niche partitioning in this habitat are yet to be fully understood, especially among taxa that belong to recently discovered branches of microbial life. Results Here, we assemble metagenomes from tongue and dental plaque samples from multiple individuals and reconstruct 790 non-redundant genomes, 43 of which resolve to TM7, a member of the Candidate Phyla Radiation, forming six monophyletic clades that distinctly associate with either plaque or tongue. Both pangenomic and phylogenomic analyses group tongue-specific clades with other host-associated TM7 genomes. In contrast, plaque-specific TM7 group with environmental TM7 genomes. Besides offering deeper insights into the ecology, evolution, and mobilome of cryptic members of the oral microbiome, our study reveals an intriguing resemblance between dental plaque and non-host environments indicated by the TM7 evolution, suggesting that plaque may have served as a stepping stone for environmental microbes to adapt to host environments for some clades of microbes. Additionally, we report that prophages are widespread among oral-associated TM7, while absent from environmental TM7, suggesting that prophages may have played a role in adaptation of TM7 to the host environment. Conclusions Our data illuminate niche partitioning of enigmatic members of the oral cavity, including TM7, SR1, and GN02, and provide genomes for poorly characterized yet prevalent members of this biome, such as uncultivated Flavobacteriaceae.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.