Survey Methodology
Use of nonprobability samples for official statistics, state of the art
by Danny Pfeffermann and Michael SverchkovNote 1
- Release date: June 30, 2025
Abstract
Tightened budgets, continuing decrease of response rates in traditional probability surveys and increasing pressure by users for more timely data, has stimulated research on the use of nonprobability sample data, such as administrative records, web scraping, mobile phone data and voluntary internet surveys, for inference on finite population parameters like means and totals. These data are often easier, faster and cheaper to collect than traditional probability samples. However, a major concern with the use of this kind of data for official statistics is their nonrepresentativeness due to possible selection bias, which if not accounted for properly, could bias the inference. In this article, we review and discuss methods considered in the literature to deal with this problem and propose new methods, distinguishing between methods based on integration of the nonprobability sample with an appropriate probability sample, and methods that base the inference solely on the nonprobability sample. Empirical illustrations, based on simulated data are provided.
Key Words: Empirical likelihood; Probability and nonprobability samples; Sample integration; Selection bias.
Table of contents
- Section 1. Introduction
- Section 2. Integration of nonprobability and probability samples
- Section 3. Inference from a nonprobability sample without integration
- Section 4. A new (old) approach for inference from a nonprobability sample
- Section 5. Simulation study
- Section 6. Concluding remarks
- References
How to cite
Pfeffermann, D., and Sverchkov, M. (2025). Use of nonprobability samples for official statistics, state of the art. Survey Methodology, 51(1), 169-196. Paper available at http://www.statcan.gc.ca/pub/12-001-x/2025001/article/00008-eng.pdf.
Note
- Date modified: