2020
DOI: 10.1093/bioinformatics/btaa1009
|View full text |Cite
|
Sign up to set email alerts
|

glmGamPoi: fitting Gamma-Poisson generalized linear models on single cell count data

Abstract: Motivation The Gamma-Poisson distribution is a theoretically and empirically motivated model for the sampling variability of single cell RNA-sequencing counts (Grün et al., 2014; Svensson, 2020; Silverman et al., 2018; Hafemeister and Satija, 2019) and an essential building block for analysis approaches including differential expression analysis (Robinson et al., 2010; McCarthy et al., 2012; Anders and Huber, 2010; Love et al., 2014), principal component analysis (Townes et al., 2019) and fac… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
148
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 178 publications
(165 citation statements)
references
References 18 publications
0
148
0
Order By: Relevance
“…To identify genes selectively expressed in OB projection neurons, we used the R-package glmGamPoi ( Ahlmann-Eltze and Huber, 2021 ). We combined the average expression levels of the top differentially expressed genes for each cell type and found it to be highly specific for each cluster ( Figure 3A,C ).…”
Section: Resultsmentioning
confidence: 99%
“…To identify genes selectively expressed in OB projection neurons, we used the R-package glmGamPoi ( Ahlmann-Eltze and Huber, 2021 ). We combined the average expression levels of the top differentially expressed genes for each cell type and found it to be highly specific for each cluster ( Figure 3A,C ).…”
Section: Resultsmentioning
confidence: 99%
“…Two subsets of all naïve CD4 and CD8 T cells from healthy paediatric donors were reclustered after hypervariable gene selection within each subset to generate the UMAP in Figure 3d. Differential gene expression across age within these subsets was tested by fitting a gamma-poisson generalized linear model on log 2 transformed age and by creating pseudo-bulks of each donor with the glmGamPoi package 91, 92 [https://doi.org/10.1093/bioinformatics/btaa1009].…”
Section: Methodsmentioning
confidence: 99%
“…In Figure 2A-B, we fit NB GLM to each gene in each dataset, in order to estimate the inverse overdispersion parameter θ. We model the observed counts for each gene using the following model gene_umi ∼ 1, and estimate parameters using glmGamPoi::glm_gp(gene_umi, model, offset=log(total_umi), size_factors=FALSE) using the glmGamPoi package (49). We perform this procedure for all genes where the variance of the observed counts exceeds the mean.…”
Section: Comparing Levels Of Overdispersion Across Datasetsmentioning
confidence: 99%
“…As outlier values can originate from multiple sources including the presence of cell doublets, errors in UMI collapsing, or ambient RNA, we have empirically improved performance when using the geometric mean instead of the arithmetic mean. Although sctransform supports multiple estimators for θ, we recommend the use of glmGamPoi (49), an alternate estimator that is more robust and faster.…”
Section: Modeling Scrna-seq Datasets With Sctransformmentioning
confidence: 99%