Survey Methodology
A new double hot-deck imputation method for missing values under boundary conditions
Archived Content
Information identified as archived is provided for reference, research or recordkeeping purposes. It is not subject to the Government of Canada Web Standards and has not been altered or updated since it was archived. Please "contact us" to request a format other than those available.
by Yousung Park and Tae Yeon KwonNote 1
- Release date: June 30, 2020
Abstract
In surveys, logical boundaries among variables or among waves of surveys make imputation of missing values complicated. We propose a new regression-based multiple imputation method to deal with survey nonresponses with two-sided logical boundaries. This imputation method automatically satisfies the boundary conditions without an additional acceptance/rejection procedure and utilizes the boundary information to derive an imputed value and to determine the suitability of the imputed value. Simulation results show that our new imputation method outperforms the existing imputation methods for both mean and quantile estimations regardless of missing rates, error distributions, and missing-mechanisms. We apply our method to impute the self-reported variable “years of smoking” in successive health screenings of Koreans.
Key Words: Hot-deck; Two-sided boundary conditions; Multiple imputation; Item nonresponse.
Table of contents
- Section 1. Introduction
- Section 2. Double hot-deck boundary information matching proportioned residual draw
- Section 3. Simulation
- Section 4. Empirical analysis
- Section 5. Conclusion
- Acknowledgements
- Appendix
- References
How to cite
Park, Y., and Kwon, T.Y. (2020). A new double hot-deck imputation method for missing values under boundary conditions. Survey Methodology, Statistics Canada, Catalogue No. 12-001-X, Vol. 46, No. 1. Paper available at http://www.statcan.gc.ca/pub/12-001-x/2020001/article/00006-eng.htm.
Note
- Date modified: