Comments on “Statistical inference with non-probability survey samples”
Section 1. Introduction
Many thanks to Changbao Wu for his stimulating review and assessment of methods for making inferences from nonprobability samples. I especially appreciate his thoughtful examination of the strong assumptions needed to derive the bias and variance of estimates.
Wu reviews three approaches for estimating the finite population mean of a variable that is measured in a nonprobability sample of size Because this sample is not representative of the population (and hence the sample mean is likely biased for estimating each approach relies on information from a high-quality probability sample of size does not measure but it contains a set of auxiliary variables that are also observed in
In the model-based predictive approach, a model is developed on to predict from The mass imputation (MI) estimator, for example, uses the model to impute an estimate of for every member of the probability sample Then the population total of is estimated by where is the design weight of unit in
In the inverse propensity weighting (IPW) approach, a model is developed predicting the probability that population unit appears in as a function of Then unit in is assigned weight and the population total is estimated by
Wu also reviews a “doubly robust” estimator of that, by combining the predictive and IPW estimators, is approximately unbiased under the assumptions if either model is correctly specified. In this discussion, I will concentrate on the predictive and IPW approaches because these methods generalize more easily for multivariate analyses and estimating population characteristics other than means.
In Section 2, I explore assumptions needed for inference from nonprobability samples and diagnostics for assessing them. Then, in Section 3, I look at some questions to ask when deciding which approach (if any) to use for inference.
- Date modified: