When exploring missing data techniques in a realistic scenario, the current literature is limited: most studies only consider consequences with data missing on a single variable. This simulation study compares the relative bias of two commonly used missing data techniques when data are missing on more than one variable. Factors varied include type of missingness (MCAR, MAR), degree of missingness (10%, 25%, and 50%), and where missingness occurs (one predictor, two predictors, or two predictors with overlap). Using a real dataset, cells are systematically deleted to create various scenarios of missingness so that parameter estimates from listwise deletion and multiple imputation may be compared to the "true" estimates from the full dataset. Results suggest the multiple imputation works well, even when the imputation model itself is missing data. Accessed 7,222 times on https://pareonline.net from August 12, 2014 to December 31, 2019. For downloads from January 1, 2020 forward, please click on the PlumX Metrics link to the right.
Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.
Rubright, Jonathan D.; Nandakumar, Ratna; and Gluttin, Joseph J.
"A Simulation Study of Missing Data with Multiple Missing X’s,"
Practical Assessment, Research, and Evaluation: Vol. 19
, Article 10.
Available at: https://scholarworks.umass.edu/pare/vol19/iss1/10