Synteny conservation analysis is a well-established methodology to investigate the potential functional role of unknown prokaryotic genes. However, bioinformatic tools to reconstruct and visualise genomic contexts usually depend on slow computations, are restricted to narrow taxonomic ranges, and/or do not allow for the functional and interactive exploration of neighbouring genes across different species. Here, we present GeCoViz, an online resource built upon 12 221 reference prokaryotic genomes that provides fast and interactive visualisation of custom genomic regions anchored by any target gene, which can be sought by either name, orthologous group (KEGGs, eggNOGs), protein domain (PFAM) or sequence. To facilitate functional and evolutionary interpretation, GeCoViz allows to customise the taxonomic scope of each analysis and provides comprehensive annotations of the neighbouring genes. Interactive visualisation options include, among others, the scaled representations of gene lengths and genomic distances, and on the fly calculation of synteny conservation of neighbouring genes, which can be highlighted based on custom thresholds. The resulting plots can be downloaded as high-quality images for publishing purposes. Overall, GeCoViz offers an easy-to-use, comprehensive, fast and interactive web-based tool for investigating the genomic context of prokaryotic genes, and is freely available at https://gecoviz.cgmlab.org
Several archaeal lineages thrive in high, saturating salt concentrations. These extremely halophilic archaea, including Halobacteria, Nanohaloarchaeota, Methanonatronarchaeia, and Haloplasmatales, must maintain osmotic equilibrium with their environment. For this, they use a 'salt-in' strategy, which involves pumping molar concentrations of potassium into the cells, which, in turn, has led to extensive proteome-wide modifications to prevent protein aggregation. However, the evolutionary history underlying these adaptations remains poorly understood. In particular, the number of times that these dramatic proteome-sweeping changes occurred is unclear due to the conflicting phylogenetic positions found for several of these lineages. Here, we present a resolved phylogeny of extremely halophilic archaea obtained using improved taxon sampling and state-of-the-art phylogenetic approaches designed to cope with the strong compositional biases of their proteomes. We describe two new uncultured lineages, Afararchaeaceae and Asboarchaeaceae, which break the long branches at the base of Haloarchaea and Nanohaloarchaeota, respectively. Our extensive phylogenomic analyses show that at least four independent adaptations to extreme halophily occurred during archaeal evolution. Finally, gene-tree/species-tree reconciliation suggests that gene duplication and horizontal gene transfer played an important role in this process, for example, by spreading key genes (such as those encoding potassium transporters) across the various extremely halophilic lineages.
We present draft genome assemblies of Beta patula, a critically endangered wild beet endemic to the Madeira archipelago, and of the closely related Beta vulgaris ssp. maritima (sea beet). Evidence-based reference gene sets for B. patula and sea beet were generated, consisting of 25 127 and 27 662 genes, respectively. The genomes and gene sets of the two wild beets were compared with their cultivated sister taxon B. vulgaris ssp. vulgaris (sugar beet). Large syntenic regions were identified, and a display tool for automatic genome-wide synteny image generation was developed. Phylogenetic analysis based on 9861 genes showing 1:1:1 orthology supported the close relationship of B. patula to sea beet and sugar beet. A comparative analysis of the Rz2 locus, responsible for rhizomania resistance, suggested that the sequenced B. patula accession was rhizomania susceptible. Reference karyotypes for the two wild beets were established, and genomic rearrangements were detected. We consider our data as highly valuable and comprehensive resources for wild beet studies, B. patula conservation management, and sugar beet breeding research.Proteins of < 10 aa were removed. The average length of genes that had at least one transcript showing 1-100% mRNA-seq evidence was 5695.4 bp (B. patula) and 5457.3 bp (B.v. maritima), respectively. aa, amino acids; avg., average; bp, base pairs; evid., evidence; tr., transcripts.
Most microbes on our planet remain uncultured and poorly studied. Recent efforts to catalog their genetic diversity have revealed that a significant fraction of the observed microbial genes are functional and evolutionary untraceable, lacking homologs in reference databases. Despite their potential biological value, these apparently unrelated orphan genes from uncultivated taxa have been routinely discarded in metagenomics surveys. Here, we analyzed a global multi-habitat dataset covering 151,697 medium and high-quality metagenome assembled genomes (MAGs), 5,969 single-amplified genomes (SAGs), and 19,642 reference genomes, and identified 413,335 highly curated novel protein families under strong purifying selection out of previously considered orphan genes. These new protein families, representing a three-fold increase over the total number of prokaryotic orthologous groups described to date, spread out across the prokaryote phylogeny, can span multiple habitats, and are notably overrepresented in recently discovered taxa. By genomic context analysis, we pinpointed thousands of unknown protein families to phylogenetically conserved operons linked to energy production, xenobiotic metabolism and microbial resistance. Most remarkably, we found 980 previously neglected protein families that can accurately distinguish entire uncultivated phyla, classes, and orders, likely representing synapomorphic traits that fostered their divergence. The systematic curation and evolutionary analysis of the unique genetic repertoire of uncultivated taxa opens new avenues for understanding the biology and ecological roles of poorly explored lineages at a global scale.
Microbes in marine sediments play crucial roles in global carbon and nutrient cycling. However, our understanding of microbial diversity and physiology on the ocean floor is limited. Here, we use phylogenomic analyses of thousands of metagenome-assembled genomes (MAGs) from coastal and deep-sea sediments to identify 55 MAGs that are phylogenetically distinct from previously described bacterial phyla. We propose that these MAGs belong to 4 novel bacterial phyla (Blakebacterota, Orphanbacterota, Arandabacterota, and Joyebacterota) and a previously proposed phylum (AABM5-125-24), all of them within the FCB superphylum. Comparison of their rRNA genes with public databases reveals that these phyla are globally distributed in different habitats, including marine, freshwater, and terrestrial environments. Genomic analyses suggest these organisms are capable of mediating key steps in sedimentary biogeochemistry, including anaerobic degradation of polysaccharides and proteins, and respiration of sulfur and nitrogen. Interestingly, these genomes code for an unusually high proportion (~9% on average, up to 20% per genome) of protein families lacking representatives in public databases. Genes encoding hundreds of these protein families colocalize with genes predicted to be involved in sulfur reduction, nitrogen cycling, energy conservation, and degradation of organic compounds. Our findings advance our understanding of bacterial diversity, the ecological roles of these bacteria, and potential links between novel gene families and metabolic processes in the oceans.
Microbes are the most abundant form of life on Earth and play crucial roles in carbon and nutrient cycling. Despite their crucial role, our understanding of microbial diversity and physiology on the ocean floor is limited. To address this gap in knowledge, we obtained 55 novel bacterial metagenome-assembled genomes (MAGs) from coastal and deep sea sediments. Phylogenomic analyses revealed they belong to four new and one poorly described bacterial phyla. Comparison of their rRNA genes with public databases revealed they are all globally distributed. These novel bacteria are capable of the anaerobic degradation of polysaccharides and proteins, and the respiration of sulfur and nitrogen. These genomes code for an unusually high proportion (~ 9, and up to 20% per genome) of protein families lacking representatives in public databases. Hundreds of these protein families are predicted to be co-localized with genes for sulfur reduction, nitrogen cycling, energy conservation, and the degradation of organic compounds. These findings expand our understanding of microbial diversity and link previously overlooked gene families with key metabolic processes in the oceans.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.