Bacteriophages typically have small genomes 1 and depend on their bacterial hosts for replication 2 . Here we sequenced DNA from diverse ecosystems and found hundreds of phage genomes with lengths of more than 200 kilobases (kb), including a genome of 735 kb, which is-to our knowledge-the largest phage genome to be described to date. Thirty-five genomes were manually curated to completion (circular and no gaps). Expanded genetic repertoires include diverse and previously undescribed CRISPR-Cas systems, transfer RNAs (tRNAs), tRNA synthetases, tRNA-modification enzymes, translation-initiation and elongation factors, and ribosomal proteins. The CRISPR-Cas systems of phages have the capacity to silence host transcription factors and translational genes, potentially as part of a larger interaction network that intercepts translation to redirect biosynthesis to phage-encoded functions. In addition, some phages may repurpose bacterial CRISPR-Cas systems to eliminate competing phages. We phylogenetically define the major clades of huge phages from human and other animal microbiomes, as well as from oceans, lakes, sediments, soils and the built environment. We conclude that the large gene inventories of huge phages reflect a conserved biological strategy, and that the phages are distributed across a broad bacterial host range and across Earth's ecosystems.Phages-viruses that infect bacteria-are considered distinct from cellular life owing to their inability to carry out most biological processes required for reproduction. They are agents of ecosystem change because they prey on specific bacterial populations, mediate lateral gene transfer, alter host metabolism and redistribute bacterially derived compounds through cell lysis 2-4 . They spread antibiotic resistance 5 and disperse pathogenicity factors that cause disease in humans and animals 6,7 . Most knowledge about phages is based on laboratorystudied examples, the vast majority of which have genomes that are a few tens of kb in length. Widely used isolation-based methods select against large phage particles, and they can be excluded from phage concentrates obtained by passage through 100-nm or 200-nm filters 1 . In 2017, only 93 isolated phages with genomes that were more than 200 kb in length were published 1 . Sequencing of whole-community DNA can uncover phage-derived fragments; however, large genomes can still escape detection owing to fragmentation 8 . A new clade of human-and animal-associated megaphages was recently described on the basis of genomes that were manually curated to completion from metagenomic datasets 9 . This finding prompted us to carry out a more-comprehensive analysis of microbial communities to evaluate the prevalence, diversity and ecosystem distribution of phages with large genomes. Previously, phages with genomes of more than 200 kb have been referred to as 'jumbophages' 1 or, in the case of phages with genomes of more than 500 kb, as megaphages 9 . As the set reconstructed here span both size ranges we refer to them simply as 'huge phage...
Candidate phyla radiation (CPR) bacteria and DPANN archaea are unisolated, small-celled symbionts that are often detected in groundwater. The effects of groundwater geochemistry on the abundance, distribution, taxonomic diversity and host association of CPR bacteria and DPANN archaea has not been studied. Here, we performed genome-resolved metagenomic analysis of one agricultural and seven pristine groundwater microbial communities and recovered 746 CPR and DPANN genomes in total. The pristine sites, which serve as local sources of drinking water, contained up to 31% CPR bacteria and 4% DPANN archaea. We observed little species-level overlap of metagenome-assembled genomes (MAGs) across the groundwater sites, indicating that CPR and DPANN communities may be differentiated according to physicochemical conditions and host populations. Cryogenic transmission electron microscopy imaging and genomic analyses enabled us to identify CPR and DPANN lineages that reproducibly attach to host cells and showed that the growth of CPR bacteria seems to be stimulated by attachment to host-cell surfaces. Our analysis reveals site-specific diversity of CPR bacteria and DPANN archaea that coexist with diverse hosts in groundwater aquifers. Given that CPR and DPANN organisms have been identified in human microbiomes and their presence is correlated with diseases such as periodontitis, our findings are relevant to considerations of drinking water quality and human health.
Rubisco sustains the biosphere through the fixation of CO 2 into biomass. In plants and cyanobacteria, Form I Rubisco is structurally comprised of large and small subunits, whereas all other Rubisco Forms lack small subunits. Thus, the rise of the Form I complex through the innovation of small subunits represents a key, yet poorly understood, transition in Rubisco's evolution. Through metagenomic analyses, we discovered a previously uncharacterized clade sister to Form I Rubisco that evolved without small subunits. This clade diverged prior to the evolution of cyanobacteria and the origin of the small subunit; thus, it provides a unique reference point to advance our understanding of Form I Rubisco evolution. Structural and kinetic data presented here reveal how a proto-Form I Rubisco assembled and functioned without the structural stability imparted from small subunits. Our findings provide insight into a key evolutionary transition of the most abundant enzyme on Earth and the predominant entry point for nearly all global organic carbon.
Studying the genetic differences between related microorganisms from different environment types can indicate factors associated with their movement among habitats. This is particularly interesting for bacteria from the Candidate Phyla Radiation because their minimal metabolic capabilities require associations with microbial hosts.
Currently described members of Elusimicrobia, a relatively recently defined phylum, are animal-associated and rely on fermentation. However, free-living Elusimicrobia have been detected in sediments, soils and groundwater, raising questions regarding their metabolic capacities and evolutionary relationship to animal-associated species. Here, we analyzed 94 draftquality, non-redundant genomes, including 30 newly reconstructed genomes, from diverse animal-associated and natural environments. Genomes group into 12 clades, 10 of which previously lacked reference genomes. Groundwater-associated Elusimicrobia are predicted to be capable of heterotrophic or autotrophic lifestyles, reliant on oxygen or nitrate/nitrite-dependent respiration, or a variety of organic compounds and Rhodobacter nitrogen fixation (Rnf) complex-dependent acetogenesis with hydrogen and carbon dioxide as the substrates. Genomes from two clades of groundwater-associated Elusimicrobia often encode a new group of nitrogenase paralogs that co-occur with an extensive suite of radical S-Adenosylmethionine (SAM) proteins. We identified similar genomic loci in genomes of bacteria from the Gracilibacteria phylum and the Myxococcales order and predict that the gene clusters reduce a tetrapyrrole, possibly to form a novel cofactor. The animal-associated Elusimicrobia clades nest phylogenetically within two free-living-associated clades. Thus, we propose an evolutionary trajectory in which some Elusimicrobia adapted to animal-associated lifestyles from free-living species via genome reduction.
Knowledge of microbial gene functions comes from manipulating the DNA of individual species in isolation from their natural communities. While this approach to microbial genetics has been foundational, its requirement for culturable microorganisms has left the majority of microbes and their interactions genetically unexplored. Here we describe a generalizable methodology for editing the genomes of specific organisms within a complex microbial community. First, we identified genetically tractable bacteria within a community using a new approach, Environmental Transformation Sequencing (ET-Seq), in which non-targeted transposon integrations were mapped and quantified following community delivery. ET-Seq was repeated with multiple delivery strategies for both a nine-member synthetic bacterial community and a ~200-member microbial bioremediation community. We achieved insertions in 10 species not previously isolated and identified natural competence for foreign DNA integration that depends on the presence of the community. Second, we developed and used DNA-editing All-in-one RNA-guided CRISPR-Cas Transposase (DART) systems for targeted DNA insertion into organisms identified as tractable by ET-Seq, enabling organism- and locus-specific genetic manipulation within the community context. These results demonstrate a strategy for targeted genome editing of specific organisms within microbial communities, establishing a new paradigm for microbial manipulation relevant to research and applications in human, environmental, and industrial microbiomes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.