The impact of different strategies to handle missing data on both precision and bias in a drug safety study: a multidatabase multinational population-based cohort study

Martín-Merino, Elisa; Calderón‐Larrañaga, Amaia; Hawley, Samuel; Poblador‐Plou, Beatriz; Llorente-García, Ana; Petersen, Irene; Prieto-Alhambra, Daniel

doi:10.2147/clep.s154914

Cited by 12 publications

(19 citation statements)

References 15 publications

(27 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…[33][34][35][36] In the literature, many statistical analyses and simulation articles have indicated that either multiple imputation techniques or analyses that account for missing data are superior to complete case analyses. [33][34][35][36][37] However, we noticed that such techniques are counterintuitive to many readers. Consequently, we have frequently been asked by journal reviewers to report complete cases, despite literature advising otherwise.…”

Section: Discussionmentioning

confidence: 99%

Routine Health Outcome Measurement: Development, Design, and Implementation of the Hand and Wrist Cohort

Selles¹,

Wouters²,

Poelstra³

et al. 2020

Plastic &Amp; Reconstructive Surgery

View full text Add to dashboard Cite

outine measurement of the outcome of clinical care is increasingly considered important in health care. It is a key aspect of value-based health care, patient-centered care, and other quality-of-care initiatives. 1 For example, the Dutch government strives to have objective outcome data on 50 percent of all health care in 2022, 2 and in Sweden, outcome measurements have been part of a national registry for years. 3 The goals of routine outcome measurement are multiple, including improving communication and treatment guidance at the patient level, in addition to benchmarking of outcome at the level of individual clinicians or treatment centers. This benchmark information may help to establish priorities in resource allocation, and provide clinicians and managers with valuable feedback on performance. Furthermore, routine outcome measurement systems generate large data sets that can be used in scientific research. These "big data" can help provide knowledge on, for example, comparative effectiveness, predictive factors of outcome, and psychometric properties of measurement instruments. Although routine outcome measurement has been advocated for years, implementation in clinical practice is limited because of several

show abstract

Section: Discussionmentioning

confidence: 99%

Routine Health Outcome Measurement: Development, Design, and Implementation of the Hand and Wrist Cohort

Selles¹,

Wouters²,

Poelstra³

et al. 2020

Plastic &Amp; Reconstructive Surgery

View full text Add to dashboard Cite

show abstract

“…In summarizing the use of EHR data to develop risk prediction models, Goldstein et al [ 9 ] found that only 58 of the 90 studies evaluated addressed missing data prior to analysis. The simplest approaches toward managing missing values involve selecting subsets of the data that contain complete information [ 11 , 12 ], and using stratified mean imputation used to fill-in missing values [ 13 ]. Others have designed functions to interpolate longitudinal variables with limited individual-level variability that are typically not dependent on other covariates [ 14 ].…”

Section: Introductionmentioning

confidence: 99%

“…Simpler approaches toward EHR imputation must consider whether missing values are missing completely at random (MCAR), missing at random (MAR), or missing not at random (MNAR) [ 14 ]. Conditional imputation methods may be used to account for these dependencies, most effectively if missing data are MAR [ 10 , 12 , 15 ]. While they may improve completeness and predictive precision, these methods may be computationally intensive when applied to large-scale EHR data with significant amounts of missing values.…”

Section: Introductionmentioning

confidence: 99%

A multi-step approach to managing missing data in time and patient variant electronic health records

Cesare

Were

2022

BMC Res Notes

View full text Add to dashboard Cite

Objective Electronic health records (EHR) hold promise for conducting large-scale analyses linking individual characteristics to health outcomes. However, these data often contain a large number of missing values at both the patient and visit level due to variation in data collection across facilities, providers, and clinical need. This study proposes a stepwise framework for imputing missing values within a visit-level EHR dataset that combines informative missingness and conditional imputation in a scalable manner that may be parallelized for efficiency. Results For this study we use a subset of data from AMPATH representing information from 530,812 clinic visits from 16,316 Human Immunodeficiency Virus (HIV) positive women across Western Kenya who have given birth. We apply this process to a set of 84 clinical, social and economic variables and are able to impute values for 84.6% of variables with missing data with an average reduction in missing data of approximately 35.6%. We validate the use of this imputed dataset by predicting National Hospital Insurance Fund (NHIF) enrollment with 94.8% accuracy.

show abstract

“…8,9 For example, one study showed that risk estimates of venous thromboembolism associated with anti-osteoporotic medications were substantially affected by the use of different strategies for the handling of missing data, leading to differences in the direction of treatment effect estimates. 8 Missing data can arise at several stages within a multi-database pharmacoepidemiologic study. Like in a single database study, data may not be recorded at the stage of data entry into the database.…”

Section: Introductionmentioning

confidence: 99%

“…14 Methods to account for sporadically missing data, such as multiple imputation (MI) and inverse probability weighting, are widely known. 8,15 To handle systematically missing data, a practical approach is to exclude the missing variable from the analyses or exclude an entire database. 8 A recently proposed alternative is multi-level MI (MLMI), which can account for both sporadically and systematically missing data.…”

Section: Introductionmentioning

confidence: 99%

A systematic review of how missing data are handled and reported in multi‐database pharmacoepidemiologic studies

Hunt

Gardarsdóttir

Bazelier

et al. 2021

Pharmacoepidemiology and Drug

View full text Add to dashboard Cite

Purpose: Pharmacoepidemiologic multi-database studies (MDBS) provide opportunities to better evaluate the safety and effectiveness of medicines. However, the issue of missing data is often exacerbated in MDBS, potentially resulting in bias and precision loss. We sought to measure how missing data are being recorded and addressed in pharmacoepidemiologic MDBS.Methods: We conducted a systematic literature search in PubMed for pharmacoepidemiologic MDBS published between 1st January 2018 and 31st December 2019. Included studies were those that used ≥2 distinct databases to assess the same safety/effectiveness outcome associated with a drug exposure. Outcome variables extracted from the studies included strategies to execute a MDBS, reporting of missing data (type, bias evaluation) and the methods used to account for missing data.Results: Two thousand seven hundred and twenty-six articles were identified, and 62 studies were included: using data from either North America (56%), Europe (31%), multiple regions (11%) or East-Asia (2%). Thirty-five (56%) articles reported missing data: 11 of these studies reported that this could have introduced bias and 19 studies reported a method to address missing data. Thirteen (68%) carried out a complete case analysis, 2 (11%) applied multiple imputation, 2 (11%) used both methods, 1 (5%) used mean imputation and 1 (5%) substituted information from a similar variable.Conclusions: Just over half of the recent pharmacoepidemiologic MDBS reported missing data and two-thirds of these studies reported how they accounted for it. We should increase our vigilance for database completeness in MDBS by reporting and addressing the missing data that could introduce bias.

show abstract

The impact of different strategies to handle missing data on both precision and bias in a drug safety study: a multidatabase multinational population-based cohort study

Cited by 12 publications

References 15 publications

Routine Health Outcome Measurement: Development, Design, and Implementation of the Hand and Wrist Cohort

Routine Health Outcome Measurement: Development, Design, and Implementation of the Hand and Wrist Cohort

A multi-step approach to managing missing data in time and patient variant electronic health records

A systematic review of how missing data are handled and reported in multi‐database pharmacoepidemiologic studies

Contact Info

Product

Resources

About