Bradley A. Warner scite author profile

A variety of methods and algorithms are available for estimating parameters in the class of a generalized linear model in the presence of missing values. However, there is little information on how this already built model can be used for prediction in new observations with missing data in the covariates. Dropping the observations with missing values is a widespread practice with serious statistical and non-statistical implications. One solution is to fit separate regression models, or submodels, to each pattern of missing covariates. In practice, for any iterative regression method, this approach is computationally intensive. We propose a simple methodology to predict outcomes for individuals with incomplete information based on the estimated coefficients and covariance from the already built model. This method does not require revisiting the original data set used to build the original model and works by generating a first-order approximation of any submodel coefficient estimates. This is achieved by using the SWEEP operator on an augmented covariance matrix obtained from the original model. We refer to this approach as the one-step sweep (OSS) method. The methodology is demonstrated using data from the Department of Veterans Affairs Continuous Improvement in Cardiac Surgery Program (CICSP). These data contain 30 day mortality, the outcome of interest, and risk information for over 14,000 patients who underwent coronary artery bypass grafting (CABG) surgery over a four-year period. Using complete data from the first 3.5 years of this study period, a logistic regression model was built. This model was then used to predict mortality for patients undergoing CABG in the most recent 6-months. In order to evaluate the performance of the OSS method we randomly generated observations with missing covariates in the 6-month prediction database. We use this simulation to demonstrate that the computationally efficient OSS substantially reduces the error in risk-adjusted mortality created when cases with incomplete information are eliminated. Lastly, we derive the relationship between the OSS method and data imputation.

show abstract

Identification of risk factors for increased cost, charges, and length of stay for cardiac patients

MaWhinney

Brown

Malcolm

et al. 2000

The Annals of Thoracic Surgery

View full text Add to dashboard Cite

What's The Point? The Benefits of Grading Without Points

Brilleslyper¹,

Ghrist²,

Holcomb³

et al. 2012

PRIMUS

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Bradley A. Warner

No continuous relationship between veterans affairs hospital coronary artery bypass grafting surgical volume and operative mortality

Prospective prediction in the presence of missing data

Identification of risk factors for increased cost, charges, and length of stay for cardiac patients

What's The Point? The Benefits of Grading Without Points

Contact Info

Product

Resources

About