Abstract:It is well known that the testing of zero variance components is a non-standard problem since the null hypothesis is on the boundary of the parameter space. The usual asymptotic chi-square distribution of the likelihood ratio and score statistics under the null does not necessarily hold because of this null hypothesis. To circumvent this difficulty in balanced linear growth curve models, we introduce an appropriate test statistic and suggest a permutation procedure to approximate its finite-sample distribution… Show more
“…Classical tests such as the likelihood ratio and score tests cannot be easily applied to this testing problem because it requires testing on the boundary of parameter space. Bootstrap and permutation tests have been suggested for testing random effects in linear and generalised linear mixed models; however, little is known on testing random effects in nonlinear mixed‐effects models. Developing such tests for random effects in nonlinear mixed‐effects models will be very helpful.…”
Nonlinear mixed-effects models are being widely used for the analysis of longitudinal data, especially from pharmaceutical research. They use random effects which are latent and unobservable variables so the random-effects distribution is subject to misspecification in practice. In this paper, we first study the consequences of misspecifying the random-effects distribution in nonlinear mixed-effects models. Our study is focused on Gauss-Hermite quadrature, which is now the routine method for calculation of the marginal likelihood in mixed models. We then present a formal diagnostic test to check the appropriateness of the assumed random-effects distribution in nonlinear mixed-effects models, which is very useful for real data analysis. Our findings show that the estimates of fixed-effects parameters in nonlinear mixed-effects models are generally robust to deviations from normality of the random-effects distribution, but the estimates of variance components are very sensitive to the distributional assumption of random effects. Furthermore, a misspecified random-effects distribution will either overestimate or underestimate the predictions of random effects. We illustrate the results using a real data application from an intensive pharmacokinetic study.
“…Classical tests such as the likelihood ratio and score tests cannot be easily applied to this testing problem because it requires testing on the boundary of parameter space. Bootstrap and permutation tests have been suggested for testing random effects in linear and generalised linear mixed models; however, little is known on testing random effects in nonlinear mixed‐effects models. Developing such tests for random effects in nonlinear mixed‐effects models will be very helpful.…”
Nonlinear mixed-effects models are being widely used for the analysis of longitudinal data, especially from pharmaceutical research. They use random effects which are latent and unobservable variables so the random-effects distribution is subject to misspecification in practice. In this paper, we first study the consequences of misspecifying the random-effects distribution in nonlinear mixed-effects models. Our study is focused on Gauss-Hermite quadrature, which is now the routine method for calculation of the marginal likelihood in mixed models. We then present a formal diagnostic test to check the appropriateness of the assumed random-effects distribution in nonlinear mixed-effects models, which is very useful for real data analysis. Our findings show that the estimates of fixed-effects parameters in nonlinear mixed-effects models are generally robust to deviations from normality of the random-effects distribution, but the estimates of variance components are very sensitive to the distributional assumption of random effects. Furthermore, a misspecified random-effects distribution will either overestimate or underestimate the predictions of random effects. We illustrate the results using a real data application from an intensive pharmacokinetic study.
“…Again, care ought to be taken when calculating the caught variance with associated or correlated loadings. Note that assessing and testing a significant variance in correlated models is a nonstandard testing problem [11][12][13][14].…”
High dimensional data are rapidly growing in many different disciplines, particularly in natural language processing. The analysis of natural language processing requires working with high dimensional matrices of word embeddings obtained from text data. Those matrices are often sparse in the sense that they contain many zero elements. Sparse principal component analysis is an advanced mathematical tool for the analysis of high dimensional data. In this paper, we study and apply the sparse principal component analysis for natural language processing, which can effectively handle large sparse matrices. We study several formulations for sparse principal component analysis, together with algorithms for implementing those formulations. Our work is motivated and illustrated by a real text dataset. We find that the sparse principal component analysis performs as good as the ordinary principal component analysis in terms of accuracy and precision, while it shows two major advantages: faster calculations and easier interpretation of the principal components. These advantages are very helpful especially in big data situations.
“…; see, for example, Crainiceanu and Ruppert () Drikvandi et al. () Drikvandi et al. (); Fitzmaurice, Lipsitz, and Ibrahim (); Giampaoli and Singer (); Lee and Braun (); Miller (); Saville and Herring (); Sinha (); Stram and Lee (); Verbeke and Molenberghs ().…”
Section: Introductionmentioning
confidence: 99%
“…There is a large literature on testing random effects when measurement errors are assumed to be i.i.d. ; see, for example, Crainiceanu and Ruppert (2004) Drikvandi et al (2012) Drikvandi et al (2013); Fitzmaurice, Lipsitz, and Ibrahim (2007); Giampaoli and Singer (2009); Lee and Braun (2012); Miller (1977); Saville and Herring (2009); Sinha (2009); Stram and Lee (1994); Verbeke and Molenberghs (2003). It is well understood that the main challenge with testing random effects is that the null hypothesis puts the true values of variance components on the boundary of parameter space, and hence the asymptotic chi-squared distribution of the classical tests such as likelihood ratio, Wald, and score tests is incorrect.…”
Section: Introductionmentioning
confidence: 99%
“…It is important to test for the need of random effects in linear mixed‐effects models to decide which random effects should be included or excluded from the model. While several practical examples on testing random effects are given in (Drikvandi, Khodadadi, & Verbeke, ; Drikvandi et al., ), there are some theoretical and computational reasons why such a test on random effects is important. For example, if unnecessary random effects are included in the model, the parameter estimates will not be efficient.…”
In linear mixed‐effects models, random effects are used to capture the heterogeneity and variability between individuals due to unmeasured covariates or unknown biological differences. Testing for the need of random effects is a nonstandard problem because it requires testing on the boundary of parameter space where the asymptotic chi‐squared distribution of the classical tests such as likelihood ratio and score tests is incorrect. In the literature several tests have been proposed to overcome this difficulty, however all of these tests rely on the restrictive assumption of i.i.d. measurement errors. The presence of correlated errors, which often happens in practice, makes testing random effects much more difficult. In this paper, we propose a permutation test for random effects in the presence of serially correlated errors. The proposed test not only avoids issues with the boundary of parameter space, but also can be used for testing multiple random effects and any subset of them. Our permutation procedure includes the permutation procedure in Drikvandi, Verbeke, Khodadadi, and Partovi Nia (2013) as a special case when errors are i.i.d., though the test statistics are different. We use simulations and a real data analysis to evaluate the performance of the proposed permutation test. We have found that random slopes for linear and quadratic time effects may not be significant when measurement errors are serially correlated.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.