Testing variance components in balanced linear growth curve models

Pharmaceutical Statistics

2019

Self Cite

Nonlinear mixed-effects models are being widely used for the analysis of longitudinal data, especially from pharmaceutical research. They use random effects which are latent and unobservable variables so the random-effects distribution is subject to misspecification in practice. In this paper, we first study the consequences of misspecifying the random-effects distribution in nonlinear mixed-effects models. Our study is focused on Gauss-Hermite quadrature, which is now the routine method for calculation of the marginal likelihood in mixed models. We then present a formal diagnostic test to check the appropriateness of the assumed random-effects distribution in nonlinear mixed-effects models, which is very useful for real data analysis. Our findings show that the estimates of fixed-effects parameters in nonlinear mixed-effects models are generally robust to deviations from normality of the random-effects distribution, but the estimates of variance components are very sensitive to the distributional assumption of random effects. Furthermore, a misspecified random-effects distribution will either overestimate or underestimate the predictions of random effects. We illustrate the results using a real data application from an intensive pharmacokinetic study.

Section: Conclusion and Discussionmentioning

confidence: 99%

Nonlinear mixed‐effects models with misspecified random‐effects distribution

Pharmaceutical Statistics

2019

Self Cite

“…Again, care ought to be taken when calculating the caught variance with associated or correlated loadings. Note that assessing and testing a significant variance in correlated models is a nonstandard testing problem [11][12][13][14].…”

Section: Correlation Of Loadingsmentioning

confidence: 99%

Sparse Principal Component Analysis for Natural Language Processing

Lawal

2020

Ann. Data. Sci.

Self Cite

High dimensional data are rapidly growing in many different disciplines, particularly in natural language processing. The analysis of natural language processing requires working with high dimensional matrices of word embeddings obtained from text data. Those matrices are often sparse in the sense that they contain many zero elements. Sparse principal component analysis is an advanced mathematical tool for the analysis of high dimensional data. In this paper, we study and apply the sparse principal component analysis for natural language processing, which can effectively handle large sparse matrices. We study several formulations for sparse principal component analysis, together with algorithms for implementing those formulations. Our work is motivated and illustrated by a real text dataset. We find that the sparse principal component analysis performs as good as the ordinary principal component analysis in terms of accuracy and precision, while it shows two major advantages: faster calculations and easier interpretation of the principal components. These advantages are very helpful especially in big data situations.

“…; see, for example, Crainiceanu and Ruppert () Drikvandi et al. () Drikvandi et al. (); Fitzmaurice, Lipsitz, and Ibrahim (); Giampaoli and Singer (); Lee and Braun (); Miller (); Saville and Herring (); Sinha (); Stram and Lee (); Verbeke and Molenberghs ().…”

Section: Introductionmentioning

confidence: 99%

“…There is a large literature on testing random effects when measurement errors are assumed to be i.i.d. ; see, for example, Crainiceanu and Ruppert (2004) Drikvandi et al (2012) Drikvandi et al (2013); Fitzmaurice, Lipsitz, and Ibrahim (2007); Giampaoli and Singer (2009); Lee and Braun (2012); Miller (1977); Saville and Herring (2009); Sinha (2009); Stram and Lee (1994); Verbeke and Molenberghs (2003). It is well understood that the main challenge with testing random effects is that the null hypothesis puts the true values of variance components on the boundary of parameter space, and hence the asymptotic chi-squared distribution of the classical tests such as likelihood ratio, Wald, and score tests is incorrect.…”

Section: Introductionmentioning

confidence: 99%

“…It is important to test for the need of random effects in linear mixed‐effects models to decide which random effects should be included or excluded from the model. While several practical examples on testing random effects are given in (Drikvandi, Khodadadi, & Verbeke, ; Drikvandi et al., ), there are some theoretical and computational reasons why such a test on random effects is important. For example, if unnecessary random effects are included in the model, the parameter estimates will not be efficient.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Testing random effects in linear mixed‐effects models with serially correlated errors

Noorian

2019

Biometrical J

Self Cite

In linear mixed‐effects models, random effects are used to capture the heterogeneity and variability between individuals due to unmeasured covariates or unknown biological differences. Testing for the need of random effects is a nonstandard problem because it requires testing on the boundary of parameter space where the asymptotic chi‐squared distribution of the classical tests such as likelihood ratio and score tests is incorrect. In the literature several tests have been proposed to overcome this difficulty, however all of these tests rely on the restrictive assumption of i.i.d. measurement errors. The presence of correlated errors, which often happens in practice, makes testing random effects much more difficult. In this paper, we propose a permutation test for random effects in the presence of serially correlated errors. The proposed test not only avoids issues with the boundary of parameter space, but also can be used for testing multiple random effects and any subset of them. Our permutation procedure includes the permutation procedure in Drikvandi, Verbeke, Khodadadi, and Partovi Nia (2013) as a special case when errors are i.i.d., though the test statistics are different. We use simulations and a real data analysis to evaluate the performance of the proposed permutation test. We have found that random slopes for linear and quadratic time effects may not be significant when measurement errors are serially correlated.