Bayesian multiple imputation for large-scale categorical data with structural zeros

Articles and reports: 12-001-X201400114002
Description:

We propose an approach for multiple imputation of items missing at random in large-scale surveys with exclusively categorical variables that have structural zeros. Our approach is to use mixtures of multinomial distributions as imputation engines, accounting for structural zeros by conceiving of the observed data as a truncated sample from a hypothetical population without structural zeros. This approach has several appealing features: imputations are generated from coherent, Bayesian joint models that automatically capture complex dependencies and readily scale to large numbers of variables. We outline a Gibbs sampling algorithm for implementing the approach, and we illustrate its potential with a repeated sampling study using public use census microdata from the state of New York, U.S.A.

Issue Number: 2014001
Author(s): Manrique-Vallier, Daniel; Reiter, Jerome P.
Main Product: Survey Methodology
Format Release date More information
HTML June 27, 2014
PDF June 27, 2014