Statistical Inference for a Finite Population Mean with Machine Learning-Based Imputation for Missing Survey Data

Articles and reports: 11-522-X202500100025
Description: National statistical offices have increasingly adopted machine learning (ML) for its potential to improve survey estimates. ML techniques offer significant advantages, notably the ability to manage high-dimensional data and to capture complex, nonlinear relationships, thereby enhancing the overall quality of survey statistics. In this article, following the approach of Chernozhukov et al. (2018), we describe a double debiased machine learning framework that enables valid statistical inference when imputed estimators are derived from ML procedures. Simulation results suggest that the proposed framework performs well in a wide range of scenarios.
Issue Number: 2025001
Author(s): Haziza, David; Dagdoug, Mehdi
Main Product: Statistics Canada International Symposium Series: Proceedings
Format Release date More information
PDF September 8, 2025

Related information

Subjects and keywords

Subjects

Keywords