2011
DOI: 10.1198/jasa.2011.tm10552
|View full text |Cite
|
Sign up to set email alerts
|

Bayesian Kernel Mixtures for Counts

Abstract: Although Bayesian nonparametric mixture models for continuous data are well developed, there is a limited literature on related approaches for count data. A common strategy is to use a mixture of Poissons, which unfortunately is quite restrictive in not accounting for distributions having variance less than the mean. Other approaches include mixing multinomials, which requires finite support, and using a Dirichlet process prior with a Poisson base measure, which does not allow smooth deviations from the Poisso… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
87
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
8
1

Relationship

2
7

Authors

Journals

citations
Cited by 74 publications
(88 citation statements)
references
References 39 publications
0
87
0
Order By: Relevance
“…Kernels are a widely used and flexible method to deal with complex data of various types: they can be obtained from β-diversity measures (Bray and Curtis, 1957;Lozupone et al, 2007) to explore microbiome datasets. They can also account for datasets obtained as read counts by the discrete Poisson kernel (Canale and Dunson, 2011) and are also commonly adopted to quantifies genetic similarities by the state kernel (Kwee et al, 2008;Wu et al, 2010). Our contribution is to propose three alternative approaches able to combine several kernels into one meta-kernel in an unsupervised framework.…”
Section: Introductionmentioning
confidence: 99%
“…Kernels are a widely used and flexible method to deal with complex data of various types: they can be obtained from β-diversity measures (Bray and Curtis, 1957;Lozupone et al, 2007) to explore microbiome datasets. They can also account for datasets obtained as read counts by the discrete Poisson kernel (Canale and Dunson, 2011) and are also commonly adopted to quantifies genetic similarities by the state kernel (Kwee et al, 2008;Wu et al, 2010). Our contribution is to propose three alternative approaches able to combine several kernels into one meta-kernel in an unsupervised framework.…”
Section: Introductionmentioning
confidence: 99%
“…It is common to incorporate latent variables in Poisson factor models (e.g Dunson and Herring, 2005). Including this generalization requires minor modifications of our current procedures, however, as noted in Canale and Dunson (2011), there is a pitfall in such models due to the dual role of the latent variable component in controlling the degree of dependence and the magnitude of over-dispersion in the marginal distributions. Canale and Dunson (2011) address these issues via a rounded kernel method which improves flexibility in modeling count variables.…”
Section: Discussionmentioning
confidence: 99%
“…Including this generalization requires minor modifications of our current procedures, however, as noted in Canale and Dunson (2011), there is a pitfall in such models due to the dual role of the latent variable component in controlling the degree of dependence and the magnitude of over-dispersion in the marginal distributions. Canale and Dunson (2011) address these issues via a rounded kernel method which improves flexibility in modeling count variables. Our current efforts are aimed at adapting these procedures to develop nonparametric approaches for inference on the distribution of weighted networks.…”
Section: Discussionmentioning
confidence: 99%
“…The Gibbs sampler iterates through the following steps. Step 1 : the first step involves data augmentation in the spirit of Canale and Dunson () to simulate the latent variable yi*false(sjfalse) from the corresponding truncated normal distribution, conditionally on the model parameters and the cluster indicators. Assuming that the random effects uifalse(sjfalse) are drawn from N(italicθh,1), we haveyi*false(sjfalse)Nfalse(xibold-italicβ+zjbold-italicγ+θh,1false)1false{(ayifalse(sjfalse),ayifalse(sjfalse)+1)false},where Nfalse(italicμ,σ2false)1false{normalΔfalse} denotes the normal density with mean μ and variance σ2, truncated to the set Δ. Step 2 : next, conditionally on the latent continuous variables, the cluster indicators ξ=(italicξifalse(sjfalse)) and the atoms θ , we sample the regression parameters.…”
Section: Bayesian Inferencementioning
confidence: 99%