We introduce a quantile-adaptive framework for nonlinear variable screening with high-dimensional heterogeneous data. This framework has two distinctive features: (1) it allows the set of active variables to vary across quantiles, thus making it more flexible to accommodate heterogeneity; (2) it is model-free and avoids the difficult task of specifying the form of a statistical model in a high dimensional space. Our nonlinear independence screening procedure employs spline approximations to model the marginal effects at a quantile level of interest. Under appropriate conditions on the quantile functions without requiring the existence of any moments, the new procedure is shown to enjoy the sure screening property in ultra-high dimensions. Furthermore, the quantile-adaptive framework can naturally handle censored data arising in survival analysis. We prove that the sure screening property remains valid when the response variable is subject to random right censoring. Numerical studies confirm the fine performance of the proposed method for various semiparametric models and its effectiveness to extract quantile-specific information from heteroscedastic data.
Research has shown that personality traits associated with impulsivity influence alcohol use during emerging adulthood, yet relatively few studies have examined how distinct facets of impulsivity are associated with alcohol use and abuse. We examine the influence of impulsivity traits on four patterns of alcohol use including frequency of alcohol use, alcohol-related problems, binge drinking, and alcohol use disorders (AUDs) in a community sample of young individuals (N = 190). In multivariate regression analyses that controlled for peer and parental alcohol use, psychological distress, and developmental correlates (i.e., college, marriage, employment) in emerging adulthood, we found that urgency and sensation seeking were consistently related to all four constructs of alcohol use. The present study suggests that distinct impulsivity traits may play different roles in escalation of alcohol use and development of AUDs during emerging adulthood.
The coronavirus pandemic has rapidly evolved into an unprecedented crisis. The susceptible-infectious-removed (SIR) model and its variants have been used for modeling the pandemic. However, time-independent parameters in the classical models may not capture the dynamic transmission and removal processes, governed by virus containment strategies taken at various phases of the epidemic. Moreover, few models account for possible inaccuracies of the reported cases. We propose a Poisson model with time-dependent transmission and removal rates to account for possible random errors in reporting and estimate a time-dependent disease reproduction number, which may reflect the effectiveness of virus control strategies. We apply our method to study the pandemic in several severely impacted countries, and analyze and forecast the evolving spread of the coronavirus. We have developed an interactive web application to facilitate readers’ use of our method.
Lower income was most closely associated with prevalence and progression of CKD, and lower education was significantly associated with its prevalence. Evidence for other indicators was inconclusive.
Identifying important biomarkers that are predictive for cancer
patients’ prognosis is key in gaining better insights into the
biological influences on the disease and has become a critical component of
precision medicine. The emergence of large-scale biomedical survival studies,
which typically involve excessive number of biomarkers, has brought high demand
in designing efficient screening tools for selecting predictive biomarkers. The
vast amount of biomarkers defies any existing variable selection methods via
regularization. The recently developed variable screening methods, though
powerful in many practical setting, fail to incorporate prior information on the
importance of each biomarker and are less powerful in detecting marginally weak
while jointly important signals. We propose a new conditional screening method
for survival outcome data by computing the marginal contribution of each
biomarker given priorily known biological information. This is based on the
premise that some biomarkers are known to be associated with disease outcomes a
priori. Our method possesses sure screening properties and a vanishing false
selection rate. The utility of the proposal is further confirmed with extensive
simulation studies and analysis of a diffuse large B-cell lymphoma dataset. We
are pleased to dedicate this work to Jack Kalbfleisch, who has made instrumental
contributions to the development of modern methods of analyzing survival
data.
Summary
Traditional variable selection methods are compromised by overlooking useful information on covariates with similar functionality or spatial proximity, and by treating each covariate independently. Leveraging prior grouping information on covariates, we propose partition-based screening methods for ultrahigh-dimensional variables in the framework of generalized linear models. We show that partition-based screening exhibits the sure screening property with a vanishing false selection rate, and we propose a data-driven partition screening framework with unavailable or unreliable prior knowledge on covariate grouping and investigate its theoretical properties. We consider two special cases: correlation-guided partitioning and spatial location- guided partitioning. In the absence of a single partition, we propose a theoretically justified strategy for combining statistics from various partitioning methods. The utility of the proposed methods is demonstrated via simulation and analysis of functional neuroimaging data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.