“Optimal” calibration weights under unit nonresponse in survey sampling
Section 1. Introduction

Table of contents

In a survey the response (nonresponse) mechanism for units is in reality unknown. To avoid defining a proper probability measure which might not be meaningful or realistic, one usually discusses the nonresponse situation in terms of a propensity for a unit to participate. To be able to take into account the possible nonresponse effect on estimators, it is however the practice to treat the propensities as probabilities to be estimated (e.g., propensity scores). This can be done for individual units, for groups of units or as an “average” over the whole response set.

For example, in Haziza and Lesage (2016) two main approaches are discussed: calibration weighting with and without foregoing propensity score weighting, the former case involving model-based estimation. The authors warn against potential negative effects on the bias and variance for the resulting estimators when not taking into account the propensities. (These two options of weighting are referred to by the authors as two-step and one-step procedures, respectively not to be mistaken for the two- and single-step calibrations as defined by Särndal and Lundström (2005).) However, in the simulation study by Haziza and Lesage (2016) the sampling design plays no role, since there $n = N$ and the focus is solely on how the auxiliary information relates to the study variable and the nonresponse mechanism.

In this paper we propose to use a nonresponse version of what in the full response case is called the (design-based) optimal regression estimator. The underlying distance measure is a quadratic form with a more complex structure (see Andersson and Thorburn (2015)) than the one leading to the GREG estimator (see Deville and Särndal (1992)). As it turns out there is also room for refinement in terms of the average response propensity (probability) when constructing the distance measure under nonresponse, which leads to a modified “optimal” estimator.

1.1 Outline of the paper

Section 2 starts with an introduction to the calibration idea under full response before dealing with the nonresponse situation. Three estimators of a population total are mainly considered: the GREG related estimator and two versions of the “optimal” estimator. Some theoretical results for the resulting bias follows. Section 3 contains a simulation study where simple random sampling and Poisson sampling are used for illustration. The Poisson design enables us to construct and investigate a situation where the auxiliary information is involved in the design as well as in the nonresponse mechanism. We also illustrate the risks of using an incorrect model when estimating individual propensities. We end with concluding remarks in Section 4.

1.2 Notation and setup

We will start with a population $U$ of size $N$ from which we take a probability sample $s$ of size $n_{s}$ with inclusion probabilities $π_{1}, \dots, π_{N} .$ Nonresponse means that we only observe the response set $r$ of size $n_{r} .$ Our aim is to estimate the study variable total $t_{y} = \sum_{U} y_{k} .$ We assume access to an auxiliary variable vector $x$ of dimension $J,$ where either $x = x^{*}$ and ${(x_{k}^{*})}_{k \in U}$ are known (the population level) or $x = x^{o}$ and ${(x_{k}^{o})}_{k \in s}$ are known (the sample level) or possibly a mixture of these cases: $x = {(x^{*}^{'} , x^{o}^{'})}^{'} .$

ISSN : 1492-0921

Editorial policy

Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.

Submission of Manuscripts

Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).

Note of appreciation

Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.

Standards of service to the public

Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.

Copyright

Published by authority of the Minister responsible for Statistics Canada.

Use of this publication is governed by the Statistics Canada Open Licence Agreement.

Catalogue No. 12-001-X

Frequency: Semi-annual

Ottawa

Date modified:: 2019-12-17

Language selection

Search and menus

Search

“Optimal” calibration weights under unit nonresponse in survey sampling
Section 1. Introduction

1.1 Outline of the paper

1.2 Notation and setup

“Optimal” calibration weights under unit nonresponse in survey sampling Section 1. Introduction

1.1 Outline of the paper

1.2 Notation and setup

Editorial policy

Submission of Manuscripts

Note of appreciation

Standards of service to the public

Copyright

“Optimal” calibration weights under unit nonresponse in survey sampling
Section 1. Introduction