2016
DOI: 10.1177/0962280213511027
|View full text |Cite
|
Sign up to set email alerts
|

Multiple imputation in the presence of high-dimensional data

Abstract: Missing data are frequently encountered in biomedical, epidemiologic and social research. It is well known that a naive analysis without adequate handling of missing data may lead to bias and/or loss of efficiency. Partly due to its ease of use, multiple imputation has become increasingly popular in practice for handling missing data. However, it is unclear what is the best strategy to conduct multiple imputation in the presence of high-dimensional data. To answer this question, we investigate several approach… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
64
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
7
2

Relationship

2
7

Authors

Journals

citations
Cited by 72 publications
(64 citation statements)
references
References 41 publications
(50 reference statements)
0
64
0
Order By: Relevance
“…Different penalty specifications give rise to various regularized regression methods. Zhao and Long (2013) 20 investigated the use of regularized regression for MI including lasso 21 , elastic net 22 (EN), and adaptive lasso 23 (Alasso). They also developed MI using a Bayesian lasso approach.…”
mentioning
confidence: 99%
“…Different penalty specifications give rise to various regularized regression methods. Zhao and Long (2013) 20 investigated the use of regularized regression for MI including lasso 21 , elastic net 22 (EN), and adaptive lasso 23 (Alasso). They also developed MI using a Bayesian lasso approach.…”
mentioning
confidence: 99%
“…Some recent work by Zhao and Long 49 and Deng et al 50 has investigated imputation methods in the presence of high-dimensional data, but methods in this area are largely under-developed and additional research is urgently needed.…”
Section: Discussionmentioning
confidence: 99%
“…Serum 25(OH)D level (ng/ml), ethnicity and gender were the only variables in the dataset that did not have any missing information. It has now been recognised that complete case analysis without adequate handling of missing data may lead to biased results, or reduced power and precision of estimates (Zhao and Long, 2016). We assumed that the missing variables were missing at random (MAR).…”
Section: Multiple Imputationmentioning
confidence: 99%
“…We imputed the missing values using multiple imputation by chained equations (MICE) (Deng et al, 2016). MICE has been shown to be a robust method for dealing with missing data across empirical and longitudinal studies (He et al, 2011;Zhao and Long, 2016). In the MICE procedure a series of regression models are run whereby each variable with missing data is modelled according to its distribution (Azur et al, 2011); for continuous variables, this would be a multivariable linear regression; and for binary variables, a logistic regression.…”
Section: Multiple Imputationmentioning
confidence: 99%