Survey Methodology
Multiple imputation of missing values in household data with structural zeros
Archived Content
Information identified as archived is provided for reference, research or recordkeeping purposes. It is not subject to the Government of Canada Web Standards and has not been altered or updated since it was archived. Please "contact us" to request a format other than those available.
by Olanrewaju Akande, Jerome Reiter and Andrés F. BarrientosNote 1
- Release date: June 27, 2019
Abstract
We present an approach for imputation of missing items in multivariate categorical data nested within households. The approach relies on a latent class model that (i) allows for household-level and individual-level variables, (ii) ensures that impossible household configurations have zero probability in the model, and (iii) can preserve multivariate distributions both within households and across households. We present a Gibbs sampler for estimating the model and generating imputations. We also describe strategies for improving the computational efficiency of the model estimation. We illustrate the performance of the approach with data that mimic the variables collected in typical population censuses.
Key Words: Categorical; Census; Edit; Latent; Mixture; Nonresponse.
Table of contents
- Section 1. Introduction
- Section 2. Review of the NDPMPM model
- Section 3. Handling missing data using the NDPMPM
- Section 4. Strategies for speeding up the MCMC sampler
- Section 5. Empirical study
- Section 6. Discussion
- Acknowledgements
- Appendix
- References
How to cite
Akande, O., Reiter, J. and Barrientos, A.F. (2019). Multiple imputation of missing values in household data with structural zeros. Survey Methodology, Statistics Canada, Catalogue No. 12-001-X, Vol. 45, No. 2. Paper available at https://www150.statcan.gc.ca/n1/pub/12-001-x/2019002/article/00005-eng.htm.
Note
- Date modified: