H37Rv is the most widely used Mycobacterium tuberculosis strain, and its genome is globally used as the M. tuberculosis reference sequence. Here, we present Bact-Builder, a pipeline that uses consensus building to generate complete and accurate bacterial genome sequences and apply it to three independently cultured and sequenced H37Rv aliquots of a single laboratory stock. Two of the 4,417,942 base-pair long H37Rv assemblies are 100% identical, with the third differing by a single nucleotide. Compared to the existing H37Rv reference, the new sequence contains ~6.4 kb additional base pairs, encoding ten new regions that include insertions in PE/PPE genes and new paralogs of esxN and esxJ, which are differentially expressed compared to the reference genes. New sequencing and de novo assemblies with Bact-Builder confirm that all 10 regions, plus small additional polymorphisms, are also present in the commonly used H37Rv strains NR123, TMC102, and H37Rv1998. Thus, Bact-Builder shows promise as an improved method to perform accurate and reproducible de novo assemblies of bacterial genomes, and our work provides important updates to the primary M. tuberculosis reference genome.
Most experiments studying bacterial microbiomes rely on the PCR amplification of all or part of the gene for the 16S rRNA subunit, which serves as a biomarker for identifying and quantifying the various taxa present in a microbiome sample. Several computational methods exist for analyzing 16S amplicon-based metagenomics. However, the most-used bioinformatics tools cannot produce quality genus-level or species-level taxonomic calls and may underestimate the degree to which these calls are possible. We used 16S sequencing data from mock bacterial communities to evaluate the sensitivity and specificity of several bioinformatics pipelines and genomic reference libraries used for microbiome analyses, concentrating on measuring the accuracy of species-level taxonomic assignments of 16S amplicon reads. We evaluated the tools Qiime 2, Mothur, PathoScope 2, and Kraken 2 in conjunction with reference libraries from Greengenes, Silva, Kraken, and RefSeq. Profiling tools were compared using publicly available mock community data from several sources, comprising 136 samples with varied species richness and evenness, several different amplified regions within the 16S gene, and both DNA spike-ins and cDNA from collections of plated cells. PathoScope 2 and Kraken 2, both tools designed for whole-genome metagenomics, outperformed Qiime 2 using the DADA2 plugin and Mothur, both of which are theoretically specialized for 16S analyses.
Background: Previous studies of infants born to HIV-positive mothers have linked HIV exposure to poor outcomes from gastrointestinal and respiratory illnesses, and to overall increased mortality rates. The mechanism behind this is unknown, but it is possible that differences in the nasopharyngeal (NP) microbiome between HIV-unexposed and HIV-exposed infants could play a role in perpetuating some outcomes. Methods: We conducted a longitudinal analysis of 170 NP swabs of healthy HIV-exposed, uninfected (HEU; n=10) infants and their HIV(+) mothers and HIV-unexposed, uninfected (HUU; n=10) infants and their HIV(-) mothers. These swabs were identified from a sample library collected in Lusaka, Zambia between 2015 and 2016. Using 16S rRNA gene sequencing, we characterized the maturation of the microbiome over the first 14 weeks of life to determine what quantifiable differences exist between HEU and HUU infants, and what patterns are reflected in the mothers' NP microbiomes. Results: In both HEU and HUU infants, Staphylococcus and Corynebacterium began as primary colonizers of the NP microbiome but were in time replaced by Dolosigranulum, Streptococcus, Moraxella and Haemophilus. When studying differences between infants, the microbe Staphylococcus haemolyticus indicated a distinctive high association with HIV exposure at birth, even when accounting for the interaction between HIV exposure status and time of sampling. When comparing infants to their mothers with paired analyses, HEU infants’ NP microbiome composition was only slightly different from their HIV(+) mothers at birth or 14 weeks, including in their carriage of S. pneumoniae, H. influenzae, and S. haemolyticus. Conclusions: Our analyses indicate that the HEU infants in our study exhibit subtle differences in the NP microbial composition throughout the sampling interval. Given our results and the sampling limitations of our study, we believe that further research must be conducted in order to confidently understand the relationship between HIV exposure and infants’ NP microbiomes.
Rationale: Many blood-based transcriptional gene signatures for tuberculosis (TB) have been developed with potential use to diagnose disease, predict risk of progression from infection to disease, and monitor TB treatment outcomes. However, an unresolved issue is whether gene set enrichment analysis (GSEA) of the signature transcripts alone is sufficient for prediction and differentiation, or whether it is necessary to use the original statistical model created when the signature was derived. Intra-method comparison is complicated by the unavailability of original training data, missing details about the original trained model, and inadequate publicly-available software tools or source code implementing models. To facilitate these signatures′ replicability and appropriate utilization in TB research, comprehensive comparisons between gene set scoring methods with cross-data validation of original model implementations are needed. Objectives: We compared the performance of 19 TB gene signatures across 24 transcriptomic datasets using both re-rebuilt original models and gene set scoring methods to evaluate whether gene set scoring is a reasonable proxy to the performance of the original trained model. We have provided an open-access software implementation of the original models for all 19 signatures for future use. Methods: We considered existing gene set scoring and machine learning methods, including ssGSEA, GSVA, PLAGE, Singscore, and Zscore, as alternative approaches to profile gene signature performance. The sample-size-weighted mean area under the curve (AUC) value was computed to measure each signature′s performance across datasets. Correlation analysis and Wilcoxon paired tests were used to analyze the performance of enrichment methods with the original models. Measurement and Main Results: For many signatures, the gene set scoring method predictions were highly correlated and statistically equivalent to the results given by the original diagnostic models. PLAGE outperformed all other gene scoring methods. In some cases, PLAGE outperformed the original models when considering signatures' weighted mean AUC values and the AUC results within individual studies. Conclusion: Gene set enrichment scoring of existing blood-based biomarker gene sets can distinguish patients with active TB disease from latent TB infection and other clinical conditions with equivalent or improved accuracy compared to the original methods and models. These data justify using gene set scoring methods of published TB gene signatures for predicting TB risk and treatment outcomes, especially when original models are difficult to apply or implement.
Background:Respiratory Syncytial Virus (RSV) is the most common cause of bronchiolitis and lower respiratory tract infections in children in their first year of life, disproportionately affecting infants in developing countries. Previous studies have found that the nasopharyngeal microbiome of infants with RSV infection has specific characteristics that correlate with disease severity, including lower biodiversity, perturbations of the microbiota and differences in relative abundance. These studies have focused on infants seen in clinical or hospital settings, predominantly in developed countries.Methods:We conducted a nested case control study within a random sample of 50 deceased RSV+ infants with age at death ranging from 4 days to 6 months and 50 matched deceased RSV- infants who were all previously enrolled in the Zambia Pertussis and RSV Infant Mortality Estimation (ZPRIME) study. All infants died within the community or within 48 hours of facility admittance. As part of the ZPRIME study procedures, all decedents underwent one-time, post-mortem nasopharyngeal sampling. The current analysis explored the differences between the nasopharyngeal microbiome profiles of RSV+ and RSV- decedents using 16S ribosomal DNA sequencing.Results:We found thatMoraxellawas more abundant in the nasopharyngeal microbiome of RSV+ decedents than in RSV- decedents. Additionally,GemellaandStaphylococcuswere less abundant in RSV+ decedents than in RSV- decedents.Conclusion:These results support previously reported findings of the association between the nasopharyngeal microbiome and RSV and suggest that changes in the abundance of these microbes are likely specific to RSV and may correlate with mortality associated with the disease.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.