Relative efficiency of joint-model and full-conditional-specification multiple imputation when conditional models are compatible: The general location model.
Hughes, Rachael A
Stat Methods Med Res
MetadataShow full item record
Seaman, S., & Hughes, R. A. (2018). Relative efficiency of joint-model and full-conditional-specification multiple imputation when conditional models are compatible: The general location model.. Stat Methods Med Res, 27 (6), 1603-1614. https://doi.org/10.1177/0962280216665872
Estimating the parameters of a regression model of interest is complicated by missing data on the variables in that model. Multiple imputation is commonly used to handle these missing data. Joint model multiple imputation and full-conditional specification multiple imputation are known to yield imputed data with the same asymptotic distribution when the conditional models of full-conditional specification are compatible with that joint model. We show that this asymptotic equivalence of imputation distributions does not imply that joint model multiple imputation and full-conditional specification multiple imputation will also yield asymptotically equally efficient inference about the parameters of the model of interest, nor that they will be equally robust to misspecification of the joint model. When the conditional models used by full-conditional specification multiple imputation are linear, logistic and multinomial regressions, these are compatible with a restricted general location joint model. We show that multiple imputation using the restricted general location joint model can be substantially more asymptotically efficient than full-conditional specification multiple imputation, but this typically requires very strong associations between variables. When associations are weaker, the efficiency gain is small. Moreover, full-conditional specification multiple imputation is shown to be potentially much more robust than joint model multiple imputation using the restricted general location model to mispecification of that model when there is substantial missingness in the outcome variable.
Compatibility, Gibbs sampler, chained equations, congeniality, informative margins, linear discriminant analysis, log linear model, missing data, Algorithms, Bias, Biomedical Research, Data Interpretation, Statistical, Models, Statistical, Randomized Controlled Trials as Topic, Regression Analysis
External DOI: https://doi.org/10.1177/0962280216665872
This record's URL: https://www.repository.cam.ac.uk/handle/1810/277051
Attribution 4.0 International
Licence URL: http://creativecommons.org/licenses/by/4.0/
Recommended or similar items
The following licence files are associated with this item: