William Johnson scite author profile

BackgroundThe use of sequencing technologies to investigate the microbiome of a sample can positively impact patient healthcare by providing therapeutic targets for personalized disease treatment. However, these samples contain genomic sequences from various sources that complicate the identification of pathogens.ResultsHere we present Clinical PathoScope, a pipeline to rapidly and accurately remove host contamination, isolate microbial reads, and identify potential disease-causing pathogens. We have accomplished three essential tasks in the development of Clinical PathoScope. First, we developed an optimized framework for pathogen identification using a computational subtraction methodology in concordance with read trimming and ambiguous read reassignment. Second, we have demonstrated the ability of our approach to identify multiple pathogens in a single clinical sample, accurately identify pathogens at the subspecies level, and determine the nearest phylogenetic neighbor of novel or highly mutated pathogens using real clinical sequencing data. Finally, we have shown that Clinical PathoScope outperforms previously published pathogen identification methods with regard to computational speed, sensitivity, and specificity.ConclusionsClinical PathoScope is the only pathogen identification method currently available that can identify multiple pathogens from mixed samples and distinguish between very closely related species and strains in samples with very few reads per pathogen. Furthermore, Clinical PathoScope does not rely on genome assembly and thus can more rapidly complete the analysis of a clinical sample when compared with current assembly-based methods. Clinical PathoScope is freely available at: http://sourceforge.net/projects/pathoscope/.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2105-15-262) contains supplementary material, which is available to authorized users.

show abstract

Decontamination of ambient RNA in single-cell RNA-seq with DecontX

Yang

Corbett

Koga

et al. 2019

Preprint

View full text Add to dashboard Cite

Droplet-based microfluidic devices have become widely used to perform single-cell RNA sequencing (scRNAseq) and discover novel cellular heterogeneity in complex biological systems. However, ambient RNA present in the cell suspension can be incorporated into these droplets and aberrantly counted along with a cell's native mRNA. This results in cross-contamination of transcripts between di↵erent cell populations and can potentially decrease the precision of downstream analyses. We developed a novel hierarchical Bayesian method called DecontX to estimate and remove contamination in individual cells from scRNAseq data. DecontX accurately predicted the proportion of contaminated counts in a mixture of mouse and human cells. Decontamination of PBMC datasets removed aberrant expression of cell type specific marker genes from other cell types and improved overall separation of cell clusters. In general, DecontX can be incorporated into scRNA-seq workflows to assess quality of dissociation protocols and improve downstream analyses.

show abstract

ComBat-Seq: batch effect adjustment for RNA-Seq count data

Zhang

Parmigiani

Johnson

2020

Preprint

View full text Add to dashboard Cite

The benefit of integrating batches of genomic data to increase statistical power in differential expression is often hindered by batch effects, or unwanted variation in data caused by differences in technical factors across batches. It is therefore critical to effectively address batch effects in genomic data. Many existing methods for batch effect adjustment assume continuous, bellshaped Gaussian distributions for data. However in RNA-Seq studies where data are skewed, over-dispersed counts, this assumption is not appropriate and may lead to erroneous results. Negative binomial regression models have been used to better capture the properties of counts. We developed a batch correction method, ComBat-Seq, using negative binomial regression. ComBat-Seq retains the integer nature of count data in RNA-Seq studies, making the batch adjusted data 1 compatible with common differential expression software packages that require integer counts. We show in realistic simulations that the ComBat-Seq adjusted data result in better statistical power and control of false positives in differential expression, compared to data adjusted by the other available methods. We further demonstrated in a real data example where ComBat-Seq successfully removes batch effects and recovers the biological signal in the data.

show abstract

Rationale, Background, and Development of Experimental Lake Studies in Northwestern Ontario

Johnson¹,

Vallentyne²

1971

J. Fish. Res. Bd. Can.

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

William Johnson

Using VAAST to Identify an X-linked Disorder Resulting in Lethality in Male Infants Due to N-Terminal Acetyltransferase Deficiency

Clinical PathoScope: rapid alignment and filtration for accurate pathogen identification in clinical samples using unassembled sequencing data

Decontamination of ambient RNA in single-cell RNA-seq with DecontX

ComBat-Seq: batch effect adjustment for RNA-Seq count data

Rationale, Background, and Development of Experimental Lake Studies in Northwestern Ontario

Contact Info

Product

Resources

About