Kwang Mo Jeong scite author profile

We frequently encounter outcomes of count that have extra variation. This paper considers several alternative models for overdispersed count responses such as a quasi-Poisson model, zero-inflated Poisson model and a negative binomial model with a special focus on a generalized linear mixed model. We also explain various goodness-of-fit criteria by discussing their appropriateness of applicability and cautions on misuses according to the patterns of response categories. The overdispersion models for counts data have been explained through two examples with different response patterns.

show abstract

Tests for homogeneity of proportions in clustered binomial data

Jeong

2016

CSAM

View full text Add to dashboard Cite

When we observe binary responses in a cluster (such as rat lab-subjects), they are usually correlated to each other. In clustered binomial counts, the independence assumption is violated and we encounter an extra-variation.In the presence of extra-variation, the ordinary statistical analyses of binomial data are inappropriate to apply. In testing the homogeneity of proportions between several treatment groups, the classical Pearson chi-squared test has a severe flaw in the control of Type I error rates. We focus on modifying the chi-squared statistic by incorporating variance inflation factors. We suggest a method to adjust data in terms of dispersion estimate based on a quasi-likelihood model. We explain the testing procedure via an illustrative example as well as compare the performance of a modified chi-squared test with competitive statistics through a Monte Carlo study.

show abstract

Effects on Regression Estimates under Misspecified Generalized Linear Mixed Models for Counts Data

Jeong¹

2012

Korean Journal of Applied Statistics

View full text Add to dashboard Cite

The generalized linear mixed model(GLMM) is widely used in fitting categorical responses of clustered data. In the numerical approximation of likelihood function the normality is assumed for the random effects distribution; subsequently, the commercial statistical packages also routinely fit GLMM under this normality assumption. We may also encounter departures from the distributional assumption on the response variable. It would be interesting to investigate the impact on the estimates of parameters under misspecification of distributions; however, there has been limited researche on these topics. We study the sensitivity or robustness of the maximum likelihood estimators(MLEs) of GLMM for counts data when the true underlying distribution is normal, gamma, exponential, and a mixture of two normal distributions. We also consider the effects on the MLEs when we fit Poisson-normal GLMM whereas the outcomes are generated from the negative binomial distribution with overdispersion. Through a small scale Monte Carlo study we check the empirical coverage probabilities of parameters and biases of MLEs of GLMM.

show abstract

Accuracy Measures of Empirical Bayes Estimator for Mean Rates

Jeong¹

2010

Communications for Statistical Applications and Methods

View full text Add to dashboard Cite

The outcomes of counts commonly occur in the area of disease mapping for mortality rates or disease rates. A Poisson distribution is usually assumed as a model of disease rates in conjunction with a gamma prior. The small area typically refers to a small geographical area or demographic group for which very little information is available from the sample surveys. Under this situation the model-based estimation is very popular, in which the auxiliary variables from various administrative sources are used. The empirical Bayes estimator under Poissongamma model has been considered with its accuracy measures. An accuracy measure using a bootstrap samples adjust the underestimation incurred by the posterior variance as an estimator of true mean squared error. We explain the suggested method through a practical dataset of hitters in baseball games. We also perform a Monte Carlo study to compare the accuracy measures of mean squared error.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Kwang Mo Jeong

A statistical method for structure learning of Bayesian networks from data

Modelling Count Responses with Overdispersion

Tests for homogeneity of proportions in clustered binomial data

Effects on Regression Estimates under Misspecified Generalized Linear Mixed Models for Counts Data

Accuracy Measures of Empirical Bayes Estimator for Mean Rates

Contact Info

Product

Resources

About