2019
DOI: 10.1103/physrevphyseducres.15.020106
|View full text |Cite
|
Sign up to set email alerts
|

Missing data and bias in physics education research: A case for using multiple imputation

Abstract: Physics education researchers (PER) commonly use complete-case analysis to address missing data. For complete-case analysis, researchers discard all data from any student who is missing any data. Despite its frequent use, no PER article we reviewed that used complete-case analysis provided evidence that the data met the assumption of missing completely at random (MCAR) necessary to ensure accurate results. Not meeting this assumption raises the possibility that prior studies have reported biased results with i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
49
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 56 publications
(50 citation statements)
references
References 30 publications
1
49
0
Order By: Relevance
“…The size of the dataset after each step in the filtering process is shown in Table . After filtering, 58% of the students had matched pretest and posttest scores, which fell in the range of typical participation rates in the literature (Nissen, Donatello, & Van Dusen, in press).…”
Section: Methodsmentioning
confidence: 86%
“…The size of the dataset after each step in the filtering process is shown in Table . After filtering, 58% of the students had matched pretest and posttest scores, which fell in the range of typical participation rates in the literature (Nissen, Donatello, & Van Dusen, in press).…”
Section: Methodsmentioning
confidence: 86%
“…The final dataset included data from 5959 students in 112 courses at 17 institutions with missing data for 15% of the pretests and 30% of the post-tests. This resulted in 55% of the responses having matched pretest and post-test which falls in the middle of the 30% to 80% range of matched data reported in the PER literature [37]. We calculated pretest and post-test scores using the total percentage correct of all the items on the assessment.…”
Section: Data Collection and Preparationmentioning
confidence: 98%
“…We used Multivariate Imputations by Chained Equa-tions (MICE) [61] with predictive mean matching (pmm) to impute the missing closed-response data. These methods have been discussed previously in [62,63] and so we do not elaborate on them here. For each respondent, we used the levels of the lab they were enrolled in (FY or BFY), the type of lab they were enrolled in (CTLabs or other), and the score on the closed-response survey they completed to estimate their missing score.…”
Section: Appendix A: Multiple Imputation Analysismentioning
confidence: 99%
“…MICE operates by imputing our dataset M times, creating M complete datasets, each containing data from 4084 students. Each of these M datasets will have somewhat different values for the imputed data [62,63]. If the calculation is not prohibitive, it has been recommended that M be set to the average percentage of missing data [63], which in our case is 27.…”
Section: Appendix A: Multiple Imputation Analysismentioning
confidence: 99%
See 1 more Smart Citation