“…More specifically, moving beyond existing algorithms that model over-dispersed counts with the NB distribution, our Bayesian nonparametric (BNP) algorithms model the gene counts using the gamma-negative binomial process (GNBP) [Zhou et al, 2016], which mixes the NB shape parameter for each gene with the distribution of the weight of an atom of a gamma process [Ferguson, 1973], or beta-negative binomial process (BNBP) [Zhou et al, 2012, Zhou and Carin, 2015, Broderick et al, 2015, which mixes the NB probability parameter of each gene with the distribution of the weight of an atom of a beta process [Hjort, 1990]. In addition to the GNBP and BNBP, for comparison, we have extended the negative binomial process (NBP) of Zhou et al [2016] by multiplying the gene-specific Poisson rates with gamma distributed sample-specific scaling parameters, and refer to it as the scaled NBP. While the NBP of Zhou et al [2016] is not expected to work well since it does not explicitly model the variation of a sample's total count, the scaled NBP, even with a scaling parameter for each sample to capture that variation, is found to provide poor performance, indicating a clear limitation of the Poisson distribution assumption.…”