Deriving statistical models to predict one variable from one or more other variables, or predictive modeling, is an important activity in obesity and nutrition research. To determine the quality of the model, it is necessary to quantify and report the predictive validity of the derived models. Conducting validation of the predictive measures provides essential information to the research community about the model. Unfortunately, many articles fail to account for the nearly inevitable reduction in predictive ability that occurs when a model derived on one dataset is applied to a new dataset. Under some circumstances, the predictive validity can be reduced to nearly zero. In this overview, we explain why reductions in predictive validity occur, define the metrics commonly used to estimate the predictive validity of a model (e.g., R2, mean squared error, sensitivity, specificity, receiver operating characteristic, concordance index), and describe methods to estimate the predictive validity (e.g., cross-validation, bootstrap, adjusted and shrunken R2). We emphasize that methods for estimating the expected reduction in predictive ability of a model in new samples are available and this expected reduction should always be reported when new predictive models are introduced.
BackgroundChildhood growth is a cornerstone of pediatric research. Statistical models need to consider individual trajectories to adequately describe growth outcomes. Specifically, well-defined longitudinal models are essential to characterize both population and subject-specific growth. Linear mixed-effect models with cubic regression splines can account for the nonlinearity of growth curves and provide reasonable estimators of population and subject-specific growth, velocity and acceleration.MethodsWe provide a stepwise approach that builds from simple to complex models, and account for the intrinsic complexity of the data. We start with standard cubic splines regression models and build up to a model that includes subject-specific random intercepts and slopes and residual autocorrelation. We then compared cubic regression splines vis-à-vis linear piecewise splines, and with varying number of knots and positions. Statistical code is provided to ensure reproducibility and improve dissemination of methods. Models are applied to longitudinal height measurements in a cohort of 215 Peruvian children followed from birth until their fourth year of life.ResultsUnexplained variability, as measured by the variance of the regression model, was reduced from 7.34 when using ordinary least squares to 0.81 (p < 0.001) when using a linear mixed-effect models with random slopes and a first order continuous autoregressive error term. There was substantial heterogeneity in both the intercept (p < 0.001) and slopes (p < 0.001) of the individual growth trajectories. We also identified important serial correlation within the structure of the data (ρ = 0.66; 95 % CI 0.64 to 0.68; p < 0.001), which we modeled with a first order continuous autoregressive error term as evidenced by the variogram of the residuals and by a lack of association among residuals. The final model provides a parametric linear regression equation for both estimation and prediction of population- and individual-level growth in height. We show that cubic regression splines are superior to linear regression splines for the case of a small number of knots in both estimation and prediction with the full linear mixed effect model (AIC 19,352 vs. 19,598, respectively). While the regression parameters are more complex to interpret in the former, we argue that inference for any problem depends more on the estimated curve or differences in curves rather than the coefficients. Moreover, use of cubic regression splines provides biological meaningful growth velocity and acceleration curves despite increased complexity in coefficient interpretation.ConclusionsThrough this stepwise approach, we provide a set of tools to model longitudinal childhood data for non-statisticians using linear mixed-effect models.Electronic supplementary materialThe online version of this article (doi:10.1186/s12982-015-0038-3) contains supplementary material, which is available to authorized users.
BackgroundChanges in Quality of Life (QOL) measures over time with treatment of obesity have not previously been described for youth. We describe the changes from baseline through two follow up visits in youth QOL (assessed by the Pediatric Quality Life Inventory, PedsQL4.0), teen depression (assessed by the Patient Health Questionnaire, PHQ9A), Body Mass Index (BMI) and BMI z-score. We also report caregiver proxy ratings of youth QOL.MethodsA sample of 267 pairs of youth and caregiver participants were recruited at their first visit to an outpatient weight-treatment clinic that provides care integrated between a physician, dietician, and mental health provider; of the 267, 113 attended a visit two (V2) follow-up appointment, and 48 attended visit three (V3). We investigated multiple factors longitudinally experienced by youth who are overweight and their caregivers across up to three different integrated care visits. We determined relationships at baseline in QOL, PHQ9A, and BMI z-score, as well as changes in variables over time using linear mixed models with time as a covariate.ResultsOverall across three visits the results indicate that youth had slight declines in relative BMI, significant increases in their QOL and improvements in depression.ConclusionsWe encourage clinicians and researchers to track youth longitudinally throughout treatment to investigate not only youth’s BMI changes, but also psychosocial changes including QOL.
The newly derived models provide a personalized prediction of long-term success from early weight-loss variables. The predictions improve on existing fixed percent-weight-loss thresholds. Future research is needed to explore model application for informing treatment approaches during early intervention.
Facilitative linguistic input directly connected to children’s interest and focus of attention has become a recommended component of interventions for young children with autism spectrum disorder (ASD). This longitudinal correlational study used two assessment time points and examined the association between parental undemanding topic-continuing talk related to the child’s attentional focus (i.e., follow-in comments) and later receptive language for 37 parent–child dyads with their young (mean = 21 months, range 15–24 months) children with autism symptomology. The frequency of parental follow-in comments positively predicted later receptive language after considering children’s joint attention skills and previous receptive language abilities.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.