Nucleotide sequence and taxonomy reference databases are critical resources for widespread applications including marker-gene and metagenome sequencing for microbiome analysis, diet metabarcoding, and environmental DNA (eDNA) surveys. Reproducibly generating, managing, using, and evaluating nucleotide sequence and taxonomy reference databases creates a significant bottleneck for researchers aiming to generate custom sequence databases. Furthermore, database composition drastically influences results, and lack of standardization limits cross-study comparisons. To address these challenges, we developed RESCRIPt, a Python 3 software package and QIIME 2 plugin for reproducible generation and management of reference sequence taxonomy databases, including dedicated functions that streamline creating databases from popular sources, and functions for evaluating, comparing, and interactively exploring qualitative and quantitative characteristics across reference databases. To highlight the breadth and capabilities of RESCRIPt, we provide several examples for working with popular databases for microbiome profiling (SILVA, Greengenes, NCBI-RefSeq, GTDB), eDNA and diet metabarcoding surveys (BOLD, GenBank), as well as for genome comparison. We show that bigger is not always better, and reference databases with standardized taxonomies and those that focus on type strains have quantitative advantages, though may not be appropriate for all use cases. Most databases appear to benefit from some curation (quality filtering), though sequence clustering appears detrimental to database quality. Finally, we demonstrate the breadth and extensibility of RESCRIPt for reproducible workflows with a comparison of global hepatitis genomes. RESCRIPt provides tools to democratize the process of reference database acquisition and management, enabling researchers to reproducibly and transparently create reference materials for diverse research applications. RESCRIPt is released under a permissive BSD-3 license at https://github.com/bokulich-lab/RESCRIPt.
Itaconate (methylenesuccinate) was recently identified as a mammalian metabolite whose production is substantially induced during macrophage activation. This compound is a potent inhibitor of isocitrate lyase, a key enzyme of the glyoxylate cycle, which is a pathway required for the survival of many pathogens inside the eukaryotic host. Here we show that numerous bacteria, notably many pathogens such as Yersinia pestis and Pseudomonas aeruginosa, have three genes for itaconate degradation. They encode itaconate coenzyme A (CoA) transferase, itaconyl-CoA hydratase and (S)-citramalyl-CoA lyase, formerly referred to as CitE-like protein. These genes are known to be crucial for survival of some pathogens in macrophages. The corresponding enzymes convert itaconate into the cellular building blocks pyruvate and acetyl-CoA, thus enabling the bacteria to metabolize itaconate and survive in macrophages. The itaconate degradation and detoxification pathways of Yersinia and Pseudomonas are the result of convergent evolution. This work revealed a common persistence factor operating in many pathogenic bacteria.
The occurrence of the proteasome in bacteria is limited to the phylum of actinobacteria, where it is maintained in parallel to the usual bacterial compartmentalizing proteases. The role it plays in these organisms is still not fully understood, but in the human pathogen Mycobacterium tuberculosis (Mtb) the proteasome supports persistence in the host. In complex with the ring-shaped ATPase Mpa (called ARC in other actinobacteria), the proteasome can degrade proteins that have been post-translationally modified with the prokaryotic ubiquitin-like protein Pup. Unlike for the eukaryotic proteasome core particle, no other bacterial proteasome interactors have been identified to date. Here we describe and characterize a novel bacterial proteasome activator of Mycobacterium tuberculosis we termed Bpa (Rv3780), using a combination of biochemical and biophysical methods. Bpa features a canonical C-terminal proteasome interaction motif referred to as the HbYX motif, and its orthologs are only found in those actinobacteria encoding the proteasomal subunits. Bpa can inhibit degradation of Pup-tagged substrates in vitro by competing with Mpa for association with the proteasome. Using negative-stain electron microscopy, we show that Bpa forms a ring-shaped homooligomer that can bind coaxially to the face of the proteasome cylinder. Interestingly, Bpa can stimulate the proteasomal degradation of the model substrate β-casein, which suggests it could play a role in the removal of non-native or damaged proteins.
BackgroundNucleotide sequence and taxonomy reference databases are critical resources for widespread applications including marker-gene and metagenome sequencing for microbiome analysis, diet metabarcoding, and environmental DNA (eDNA) surveys. Reproducibly generating, managing, using, and evaluating nucleotide sequence and taxonomy reference databases creates a significant bottleneck for researchers aiming to generate custom sequence databases. Furthermore, database composition drastically influences results, and lack of standardizations limits cross-study comparisons. To address these challenges, we developed RESCRIPt, a software package for reproducible generation and management of reference sequence taxonomy databases, including dedicated functions that streamline creating databases from popular sources, and functions for evaluating, comparing, and interactively exploring qualitative and quantitative characteristics across reference databases.ResultsTo highlight the breadth and capabilities of RESCRIPt, we provide several examples for working with popular databases for microbiome profiling (SILVA, Greengenes, NCBI-RefSeq, GTDB), eDNA, and diet metabarcoding surveys (BOLD, GenBank), as well as for genome comparison. We show that bigger is not always better, and reference databases with standardized taxonomies and those that focus on type strains have quantitative advantages, though may not be appropriate for all use cases. Most databases appear to benefit from some curation (quality filtering), though sequence clustering appears detrimental to database quality. Finally, we demonstrate the breadth and extensibility of RESCRIPt for reproducible workflows with a comparison of global hepatitis genomes.ConclusionsRESCRIPt provides tools to democratize the process of reference database acquisition and management, enabling researchers to reproducibly and transparently create reference materials for diverse research applications. RESCRIPt is released under a permissive BSD-3 license at https://github.com/bokulich-lab/RESCRIPt.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.