TY - JOUR
T1 - A Comparison of Imputation Strategies for Ordinal Missing Data on Likert Scale Variables
AU - Wu, Wei
AU - Jia, Fan
AU - Enders, Craig
N1 - Funding Information:
Funding: This work was supported by Grant No. 1053160 from the NSF.
Publisher Copyright:
© 2015, Copyright © Taylor & Francis Group, LLC.
PY - 2015/9/3
Y1 - 2015/9/3
N2 - This article compares a variety of imputation strategies for ordinal missing data on Likert scale variables (number of categories = 2, 3, 5, or 7) in recovering reliability coefficients, mean scale scores, and regression coefficients of predicting one scale score from another. The examined strategies include imputing using normal data models with naïve rounding/without rounding, using latent variable models, and using categorical data models such as discriminant analysis and binary logistic regression (for dichotomous data only), multinomial and proportional odds logistic regression (for polytomous data only). The result suggests that both the normal model approach without rounding and the latent variable model approach perform well for either dichotomous or polytomous data regardless of sample size, missing data proportion, and asymmetry of item distributions. The discriminant analysis approach also performs well for dichotomous data. Naïvely rounding normal imputations or using logistic regression models to impute ordinal data are not recommended as they can potentially lead to substantial bias in all or some of the parameters.
AB - This article compares a variety of imputation strategies for ordinal missing data on Likert scale variables (number of categories = 2, 3, 5, or 7) in recovering reliability coefficients, mean scale scores, and regression coefficients of predicting one scale score from another. The examined strategies include imputing using normal data models with naïve rounding/without rounding, using latent variable models, and using categorical data models such as discriminant analysis and binary logistic regression (for dichotomous data only), multinomial and proportional odds logistic regression (for polytomous data only). The result suggests that both the normal model approach without rounding and the latent variable model approach perform well for either dichotomous or polytomous data regardless of sample size, missing data proportion, and asymmetry of item distributions. The discriminant analysis approach also performs well for dichotomous data. Naïvely rounding normal imputations or using logistic regression models to impute ordinal data are not recommended as they can potentially lead to substantial bias in all or some of the parameters.
KW - missing data
KW - multiple imputation
KW - ordinal data
UR - http://www.scopus.com/inward/record.url?scp=84944153645&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84944153645&partnerID=8YFLogxK
U2 - 10.1080/00273171.2015.1022644
DO - 10.1080/00273171.2015.1022644
M3 - Article
AN - SCOPUS:84944153645
SN - 0027-3171
VL - 50
SP - 484
EP - 503
JO - Multivariate Behavioral Research
JF - Multivariate Behavioral Research
IS - 5
ER -