Bacteriophages typically have small genomes 1 and depend on their bacterial hosts for replication 2 . Here we sequenced DNA from diverse ecosystems and found hundreds of phage genomes with lengths of more than 200 kilobases (kb), including a genome of 735 kb, which is-to our knowledge-the largest phage genome to be described to date. Thirty-five genomes were manually curated to completion (circular and no gaps). Expanded genetic repertoires include diverse and previously undescribed CRISPR-Cas systems, transfer RNAs (tRNAs), tRNA synthetases, tRNA-modification enzymes, translation-initiation and elongation factors, and ribosomal proteins. The CRISPR-Cas systems of phages have the capacity to silence host transcription factors and translational genes, potentially as part of a larger interaction network that intercepts translation to redirect biosynthesis to phage-encoded functions. In addition, some phages may repurpose bacterial CRISPR-Cas systems to eliminate competing phages. We phylogenetically define the major clades of huge phages from human and other animal microbiomes, as well as from oceans, lakes, sediments, soils and the built environment. We conclude that the large gene inventories of huge phages reflect a conserved biological strategy, and that the phages are distributed across a broad bacterial host range and across Earth's ecosystems.Phages-viruses that infect bacteria-are considered distinct from cellular life owing to their inability to carry out most biological processes required for reproduction. They are agents of ecosystem change because they prey on specific bacterial populations, mediate lateral gene transfer, alter host metabolism and redistribute bacterially derived compounds through cell lysis 2-4 . They spread antibiotic resistance 5 and disperse pathogenicity factors that cause disease in humans and animals 6,7 . Most knowledge about phages is based on laboratorystudied examples, the vast majority of which have genomes that are a few tens of kb in length. Widely used isolation-based methods select against large phage particles, and they can be excluded from phage concentrates obtained by passage through 100-nm or 200-nm filters 1 . In 2017, only 93 isolated phages with genomes that were more than 200 kb in length were published 1 . Sequencing of whole-community DNA can uncover phage-derived fragments; however, large genomes can still escape detection owing to fragmentation 8 . A new clade of human-and animal-associated megaphages was recently described on the basis of genomes that were manually curated to completion from metagenomic datasets 9 . This finding prompted us to carry out a more-comprehensive analysis of microbial communities to evaluate the prevalence, diversity and ecosystem distribution of phages with large genomes. Previously, phages with genomes of more than 200 kb have been referred to as 'jumbophages' 1 or, in the case of phages with genomes of more than 500 kb, as megaphages 9 . As the set reconstructed here span both size ranges we refer to them simply as 'huge phage...
Candidate phyla radiation (CPR) bacteria and DPANN archaea are unisolated, small-celled symbionts that are often detected in groundwater. The effects of groundwater geochemistry on the abundance, distribution, taxonomic diversity and host association of CPR bacteria and DPANN archaea has not been studied. Here, we performed genome-resolved metagenomic analysis of one agricultural and seven pristine groundwater microbial communities and recovered 746 CPR and DPANN genomes in total. The pristine sites, which serve as local sources of drinking water, contained up to 31% CPR bacteria and 4% DPANN archaea. We observed little species-level overlap of metagenome-assembled genomes (MAGs) across the groundwater sites, indicating that CPR and DPANN communities may be differentiated according to physicochemical conditions and host populations. Cryogenic transmission electron microscopy imaging and genomic analyses enabled us to identify CPR and DPANN lineages that reproducibly attach to host cells and showed that the growth of CPR bacteria seems to be stimulated by attachment to host-cell surfaces. Our analysis reveals site-specific diversity of CPR bacteria and DPANN archaea that coexist with diverse hosts in groundwater aquifers. Given that CPR and DPANN organisms have been identified in human microbiomes and their presence is correlated with diseases such as periodontitis, our findings are relevant to considerations of drinking water quality and human health.
Survival and growth of the anaerobic gut fungi (AGF; Neocallimastigomycota) in the herbivorous gut necessitate the possession of multiple abilities absent in other fungal lineages. We hypothesized that horizontal gene transfer (HGT) was instrumental in forging the evolution of AGF into a phylogenetically distinct gut-dwelling fungal lineage. The patterns of HGT were evaluated in the transcriptomes of 27 AGF strains, 22 of which were isolated and sequenced in this study, and 4 AGF genomes broadly covering the breadth of AGF diversity. We identified 277 distinct incidents of HGT in AGF transcriptomes, with subsequent gene duplication resulting in an HGT frequency of 2 to 3.5% in AGF genomes. The majority of HGT events were AGF specific (91.7%) and wide (70.8%), indicating their occurrence at early stages of AGF evolution. The acquired genes allowed AGF to expand their substrate utilization range, provided new venues for electron disposal, augmented their biosynthetic capabilities, and facilitated their adaptation to anaerobiosis. The majority of donors were anaerobic fermentative bacteria prevalent in the herbivorous gut. This study strongly indicates that HGT indispensably forged the evolution of AGF as a distinct fungal phylum and provides a unique example of the role of HGT in shaping the evolution of a high-rank taxonomic eukaryotic lineage. IMPORTANCE The anaerobic gut fungi (AGF) represent a distinct basal phylum lineage (Neocallimastigomycota) commonly encountered in the rumen and alimentary tracts of herbivores. Survival and growth of anaerobic gut fungi in these anaerobic, eutrophic, and prokaryote-dominated habitats necessitates the acquisition of several traits absent in other fungal lineages. We assess here the role of horizontal gene transfer as a relatively fast mechanism for trait acquisition by the Neocallimastigomycota postsequestration in the herbivorous gut. Analysis of 27 transcriptomes that represent the broad diversity of Neocallimastigomycota identified 277 distinct HGT events, with subsequent gene duplication resulting in an HGT frequency of 2 to 3.5% in AGF genomes. These HGT events have allowed AGF to survive in the herbivorous gut by expanding their substrate utilization range, augmenting their biosynthetic pathway, providing new routes for electron disposal by expanding fermentative capacities, and facilitating their adaptation to anaerobiosis. HGT in the AGF is also shown to be mainly a cross-kingdom affair, with the majority of donors belonging to the bacteria. This study represents a unique example of the role of HGT in shaping the evolution of a high-rank taxonomic eukaryotic lineage.
The “Latescibacteria” (formerly WS3), member of the Fibrobacteres–Chlorobi–Bacteroidetes (FCB) superphylum, represents a ubiquitous candidate phylum found in terrestrial, aquatic, and marine ecosystems. Recently, single-cell amplified genomes (SAGs) representing the “Latescibacteria” were obtained from the anoxic monimolimnion layers of Sakinaw Lake (British Columbia, Canada), and anoxic sediments of a coastal lagoon (Etoliko lagoon, Western Greece). Here, we present a detailed in-silico analysis of the four SAGs to gain some insights on their metabolic potential and apparent ecological roles. Metabolic reconstruction suggests an anaerobic fermentative mode of metabolism, as well as the capability to degrade multiple polysaccharides and glycoproteins that represent integral components of green (Charophyta and Chlorophyta) and brown (Phaeophycaea) algae cell walls (pectin, alginate, ulvan, fucan, hydroxyproline-rich glycoproteins), storage molecules (starch and trehalose), and extracellular polymeric substances (EPSs). The analyzed SAGs also encode dedicated transporters for the uptake of produced sugars and amino acids/oligopeptides, as well as an extensive machinery for the catabolism of all transported sugars, including the production of a bacterial microcompartment (BMC) to sequester propionaldehyde, a toxic intermediate produced during fucose and rhamnose metabolism. Finally, genes for the formation of gas vesicles, flagella, type IV pili, and oxidative stress response were found, features that could aid in cellular association with algal detritus. Collectively, these results indicate that the analyzed “Latescibacteria” mediate the turnover of multiple complex organic polymers of algal origin that reach deeper anoxic/microoxic habitats in lakes and lagoons. The implications of such process on our understanding of niche specialization in microbial communities mediating organic carbon turnover in stratified water bodies are discussed.
We investigated the global patterns of abundance, diversity, and community structure of members of the Aminicenantes (candidate phylum OP8). Our aim was to identify the putative ecological role(s) played by members of this poorly characterized bacterial lineages in various ecosystems. Analysis of near full-length 16S rRNA genes identified four classes and eight orders within the Aminicenantes. Within 3,134 datasets comprising ∼1.8 billion high throughput-generated partial 16S rRNA genes, 47,351 Aminicenantes-affiliated sequences were identified in 913 datasets. The Aminicenantes exhibited the highest relative abundance in hydrocarbon-impacted environments, followed by marine habitats (especially hydrothermal vents and coral-associated microbiome samples), and aquatic, non-marine habitats (especially in terrestrial springs and groundwater samples). While the overall abundance of the Aminicenantes was higher in low oxygen tension as well as non-saline and low salinity habitats, it was encountered in a wide range of oxygen tension, salinities, and temperatures. Analysis of the community structure of the Aminicenantes showed distinct patterns across various datasets that appear to be, mostly, driven by habitat variations rather than prevalent environmental parameters. We argue that the detection of the Aminicenantes across environmental extremes and the observed distinct community structure patterns reflect a high level of intraphylum metabolic diversity and adaptive capabilities that enable its survival and growth in a wide range of habitats and environmental conditions.
The archaeal phylum ‘Diapherotrites’ was recently proposed based on phylogenomic analysis of genomes recovered from an underground water seep in an abandoned gold mine (Homestake mine in Lead, SD, USA). Here we present a detailed analysis of the metabolic capabilities and genomic features of three single amplified genomes (SAGs) belonging to the ‘Diapherotrites’. The most complete of the SAGs, Candidatus ‘Iainarchaeum andersonii’ (Cand. IA), had a small genome (∼1.24 Mb), short average gene length (822 bp), one ribosomal RNA operon, high coding density (∼90.4%), high percentage of overlapping genes (27.6%) and low incidence of gene duplication (2.16%). Cand. IA genome possesses limited catabolic capacities that, nevertheless, could theoretically support a free-living lifestyle by channeling a narrow range of substrates such as ribose, polyhydroxybutyrate and several amino acids to acetyl-coenzyme A. On the other hand, Cand. IA possesses relatively well-developed anabolic capabilities, although it remains auxotrophic for several amino acids and cofactors. Phylogenetic analysis suggests that the majority of Cand. IA anabolic genes were acquired from bacterial donors via horizontal gene transfer. We thus propose that members of the ‘Diapherotrites’ have evolved from an obligate symbiotic ancestor by acquiring anabolic genes from bacteria that enabled independent biosynthesis of biological molecules previously acquired from symbiotic hosts. ‘Diapherotrites’ 16S rRNA genes exhibit multiple mismatches with the majority of archaeal 16S rRNA primers, a fact that could be responsible for their observed rarity in amplicon-generated data sets. The limited substrate range, complex growth requirements and slow growth rate predicted could be responsible for its refraction to isolation.
We investigated the global distribution patterns and pangenomic diversity of the candidate phylum "" (WS3) in 16S rRNA gene as well as metagenomic data sets. We document distinct distribution patterns for various "" orders in 16S rRNA gene data sets, with prevalence of orders sediment_1 in terrestrial, PBSIII_9 in groundwater and temperate freshwater, and GN03 in pelagic marine, saline-hypersaline, and wastewater habitats. Using a fragment recruitment approach, we identified 68.9 Mb of ""-affiliated contigs in publicly available metagenomic data sets comprising 73,079 proteins. Metabolic reconstruction suggests a prevalent saprophytic lifestyle in all "" orders, with marked capacities for the degradation of proteins, lipids, and polysaccharides predominant in plant, bacterial, fungal/crustacean, and eukaryotic algal cell walls. As well, extensive transport and central metabolic pathways for the metabolism of imported monomers were identified. Interestingly, genes and domains suggestive of the production of a cellulosome-e.g., protein-coding genes harboring dockerin I domains attached to a glycosyl hydrolase and scaffoldin-encoding genes harboring cohesin I and CBM37 domains-were identified in order PBSIII_9, GN03, and MSB-4E2 fragments recovered from four anoxic aquatic habitats; hence extending the cellulosomal production capabilities in beyond the Gram-positive In addition to fermentative pathways, a complete electron transport chain with terminal cytochrome oxidases Caa3 (for operation under high oxygen tension) and Cbb3 (for operation under low oxygen tension) were identified in PBSIII_9 and GN03 fragments recovered from oxygenated and partially/seasonally oxygenated aquatic habitats. Our metagenomic recruitment effort hence represents a comprehensive pangenomic view of this yet-uncultured phylum and provides insights broader than and complementary to those gained from genome recovery initiatives focusing on a single or few sampled environments. Our understanding of the phylogenetic diversity, metabolic capabilities, and ecological roles of yet-uncultured microorganisms is rapidly expanding. However, recent efforts mainly have been focused on recovering genomes of novel microbial lineages from a specific sampling site, rather than from a wide range of environmental habitats. To comprehensively evaluate the genomic landscape, putative metabolic capabilities, and ecological roles of yet-uncultured candidate phyla, efforts that focus on the recovery of genomic fragments from a wide range of habitats and that adequately sample the intraphylum diversity within a specific target lineage are needed. Here, we investigated the global distribution patterns and pangenomic diversity of the candidate phylum "" Our results document the preference of specific "" orders to specific habitats, the prevalence of plant polysaccharide degradation abilities within all "" orders, the occurrence of all genes/domains necessary for the production of cellulosomes within three "" orders (GN03, PBSIII_9, and MSB-4E2) in data sets recover...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.