Maureen A. Guarcello scite author profile

Observational studies require matching across groups over multiple confounding variables. Across the literature, matching algorithms fail to handle this issue. In this way, missing values are regularly imputed prior to being considered in the matching process. However, imputing is not always practical, forcing us to drop an observation due to the deficiency of the chosen algorithm, decreasing the power of the study, and possibly failing to capture crucial latent information. We propose a missing data mechanism to incorporate within an iterative multivariate matching method. The underlying framework utilizes random forest, implemented with surrogate splits as a natural tool in constructing a distance matrix where there might be missing values. The out-put is then easily fed into an optimal matching algorithm. We apply this method to evaluate the effectiveness of Supplemental Instruction (SI) sessions, a voluntary program where students seek additional help, in a large enrollment, bottleneck introductory business statistics course. This is an observational study with two groups, those who attend multiple SI sessions and those who do not, and, as typical in educational data mining, challenged by missing data. Additionally, we perform a data simulation on missingness to further demonstrate the efficacy of our proposed approach.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Maureen A. Guarcello

Balancing Student Success: Assessing Supplemental Instruction Through Coarsened Exact Matching

Estimating a Dose-Response Relationship in Quasi-Experimental Student Success Studies

Estimating the optimal treatment regime for student success programs

Stacked Ensemble Learning for Propensity Score Methods in Observational Studies

Causal inference in the presence of missing data using a random forest‐based matching algorithm

Contact Info

Product

Resources

About