Current-day metagenomics analyses increasingly involve de novo taxonomic classification of long DNA sequences and metagenome-assembled genomes. Here, we show that the conventional best-hit approach often leads to classifications that are too specific, especially when the sequences represent novel deep lineages. We present a classification method that integrates multiple signals to classify sequences (Contig Annotation Tool, CAT) and metagenome-assembled genomes (Bin Annotation Tool, BAT). Classifications are automatically made at low taxonomic ranks if closely related organisms are present in the reference database and at higher ranks otherwise. The result is a high classification precision even for sequences from considerably unknown organisms.
Current-day metagenomics increasingly requires taxonomic classification of long DNA sequences and metagenome-assembled genomes (MAGs) of unknown microorganisms. We show that the standard best-hit approach often leads to classifications that are too specific. We present tools to classify highquality metagenomic contigs (Contig Annotation Tool, CAT) and MAGs (Bin Annotation Tool, BAT) and thoroughly benchmark them with simulated metagenomic sequences that are classified against a reference database where related sequences are increasingly removed, thereby simulating increasingly unknown queries. We find that the query sequences are correctly classified at low taxonomic ranks if closely related organisms are present in the reference database, while classifications are made higher in the taxonomy when closely related organisms are absent, thus avoiding spurious classification specificity. In a real-world challenge, we apply BAT to over 900 MAGs from a recent rumen metagenomics study and classified 97% consistently with prior phylogeny-based classifications, but in a fully automated fashion.
Dysoxic marine waters (DMW, < 1 μM oxygen) are currently expanding in volume in the oceans, which has biogeochemical, ecological and societal consequences on a global scale. In these environments, distinct bacteria drive an active sulfur cycle, which has only recently been recognized for open-ocean DMW. This review summarizes the current knowledge on these sulfur-cycling bacteria. Critical bottlenecks and questions for future research are specifically addressed. Sulfate-reducing bacteria (SRB) are core members of DMW. However, their roles are not entirely clear, and they remain largely uncultured. We found support for their remarkable diversity and taxonomic novelty by mining metagenome-assembled genomes from the Black Sea as model ecosystem. We highlight recent insights into the metabolism of key sulfur-oxidizing SUP05 and Sulfurimonas bacteria, and discuss the probable involvement of uncultivated SAR324 and BS-GSO2 bacteria in sulfur oxidation. Uncultivated Marinimicrobia bacteria with a presumed organoheterotrophic metabolism are abundant in DMW. Like SRB, they may use specific molybdoenzymes to conserve energy from the oxidation, reduction or disproportionation of sulfur cycle intermediates such as S 0 and thiosulfate, produced from the oxidation of sulfide. We expect that tailored sampling methods and a renewed focus on cultivation will yield deeper insight into sulfur-cycling bacteria in DMW.
Archaea synthesize membranes of isoprenoid lipids that are ether-linked to glycerol-1-phosphate (G1P), while Bacteria/Eukarya produce membranes consisting of fatty acids ester-bound to glycerol-3-phosphate (G3P). This dichotomy in membrane lipid composition (i.e., the ‘lipid divide’) is believed to have arisen after the Last Universal Common Ancestor (LUCA). A leading hypothesis is that LUCA possessed a heterochiral ‘mixed archaeal/bacterial membrane’. However, no natural microbial representatives supporting this scenario have been shown to exist today. Here, we demonstrate that bacteria of the Fibrobacteres–Chlorobi–Bacteroidetes (FCB) group superphylum encode a putative archaeal pathway for ether-bound isoprenoid membrane lipids in addition to the bacterial fatty acid membrane pathway. Key genes were expressed in the environment and their recombinant expression in Escherichia coli resulted in the formation of a ‘mixed archaeal/bacterial membrane’. Genomic evidence and biochemical assays suggest that the archaeal-like lipids of members of the FCB group could possess either a G1P or G3P stereochemistry. Our results support the existence of ‘mixed membranes’ in natural environments and their stability over a long period in evolutionary history, thereby bridging a once-thought fundamental divide in biology.
In the past few decades, population genetics and phylogeographic studies have improved our knowledge of connectivity and population demography in marine environments. Studies of deep‐sea hydrothermal vent populations have identified barriers to gene flow, hybrid zones, and demographic events, such as historical population expansions and contractions. These deep‐sea studies, however, used few loci, which limit the amount of information they provided for coalescent analysis and thus our ability to confidently test complex population dynamics scenarios. In this study, we investigated population structure, demographic history, and gene flow directionality among four Western Pacific hydrothermal vent populations of the vent limpet Lepetodrilus aff. schrolli. These vent sites are located in the Manus and Lau back‐arc basins, currently of great interest for deep‐sea mineral extraction. A total of 42 loci were sequenced from each individual using high‐throughput amplicon sequencing. Amplicon sequences were analyzed using both genetic variant clustering methods and evolutionary coalescent approaches. Like most previously investigated vent species in the South Pacific, L. aff. schrolli showed no genetic structure within basins but significant differentiation between basins. We inferred significant directional gene flow from Manus Basin to Lau Basin, with low to no gene flow in the opposite direction. This study is one of the very few marine population studies using >10 loci for coalescent analysis and serves as a guide for future marine population studies.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.