“Optimal” calibration weights under unit nonresponse in survey sampling
Section 2. Calibration estimation

2.1 Calibration estimators under full response

Starting with the full response situation $(r = s)$ and following the procedure as established by Deville and Särndal (1992), the calibration estimator is defined as

${\hat{t}}_{y cal} = \sum_{s} w_{k s} y_{k},$

where the sample dependent weights $w_{k s}$ are chosen so that

$\sum_{s} w_{k s} x_{k} = t_{x}, (the calibration equation) (2.1)$

while also minimizing the quadratic distance measure

${(w_{s} - w_{0 s})}^{'} R (w_{s} - w_{0 s}),$

where $w_{s} = {(w_{k s})}_{k \in s},$ $w_{0 s} = {(1 / π_{k})}_{k \in s} = {(d_{k})}_{k \in s}$ and $R$ is diagonal. (Alternative distance measures are considered in both Deville and Särndal (1992) and Haziza and Lesage (2016).)

In other words, given the constraint (2.1) the $w_{k s}$ should be “as close as possible” to the design weights $d_{k},$ which is desirable since $\sum_{s} d_{k} y_{k}$ is an unbiased estimator of $t_{y} .$

The resulting weights are

$w_{s} = w_{0 s} + R^{- 1} x^{'} {(X R^{- 1} X^{'})}^{- 1} (t_{x} - {\hat{t}}_{x}) .$

It turns out that the model assisted homoskedastic GREG estimator ${\hat{t}}_{y r}$ (Särndal, Swensson and Wretman (1992)) is a calibration estimator for which

$R = {(w_{0 s} I_{n_{s}})}^{- 1},$

where $I_{n_{s}}$ is the unit diagonal matrix of size $n_{s} .$

Another calibration estimator is the optimal regression estimator ${\hat{t}}_{y opt}$ (see e.g., Rao (1994) and Montanari (1998)), for which

$R = {(\frac{π_{k l} - π_{k} π_{l}}{π_{k l} π_{k} π_{l}})}_{k, l \in s}^{- 1},$

as shown by Andersson and Thorburn (2005).

Asymptotically, this estimator has (in a design-based sense) minimum variance among linear regression type estimators.

2.2 Calibration estimators under nonresponse

In the nonresponse case, a possible calibration estimator is

$\sum_{r} w_{k r} y_{k},$

where it should hold that

$\sum_{r} w_{k r} x_{k} = X, (2.2)$

where $X = \sum_{U} x_{k}^{*},$ if the auxiliary information is known up to the population level. Otherwise, $X = \sum_{s} d_{k} x_{k}^{o},$ the unbiased estimator of $t_{x} .$ (We can also combine the two types of information in the constraint $X .)$

For a variety of cases weights fulfilling the requirement (2.2) are presented by e.g., Särndal and Lundström (2005). Using the direct approach, where all information is used in one single calibration, we get

$w_{k r} = d_{k} (1 + x_{k}^{'} {(\sum_{r} d_{k} x_{k} x_{k}^{'})}^{- 1} (X - \sum_{r} d_{k} x_{k})) . (2.3)$

The resulting estimator will henceforth be denoted ${\hat{t}}_{y cal} .$ (Other approaches, including two-step procedures, are presented and investigated by e.g., Andersson and Särndal (2016).)

An evident question to ask is: What is the underlying distance measure generating these weights? Särndal and Lundström (2005) do not comment on this particular issue, but according to Lundström and Särndal (1999), we should choose $“ w_{k}$ ‘as close as possible’ to the $d_{k} ”,$ which does not seem quite adequate under nonresponse. Going back to Lundström (1997) we will find that the corresponding distance measure is actually

${(w_{r} - w_{0 r})}^{'} {(w_{0 r} I_{n_{r}})}^{- 1} (w_{r} - w_{0 r}),$

where $w_{r} = {(w_{k r})}_{k \in r}$ and $w_{0 r} = {(d_{k})}_{k \in r} .$

If we have a random mechanism generating the response set $r$ from the sample $s$ with probabilities $θ_{k}$ of inclusion, we can view the nonresponse situation as a two-phase design and this is the assumption we will make in the following. Then we should minimize the distance between $w_{k r}$ and $d_{k} \cdot (1 / θ_{k}) .$ Using some modelling $θ_{k}$ can be estimated by ${\hat{θ}}_{k},$ to be put to use for the distance minimization. But in this paper we will not go in the direction of model-based inference. In order to reduce the bias effect under nonresponse one could instead in the distance measure think of comparing $w_{k r}$ not with $d_{k},$ but with $d_{k, alt} = d_{k} \cdot c,$ where $c$ is a constant larger than 1, aiming to compensate for the “average” nonresponse effect.

However, Lundström (1997) shows that in many important cases, namely when one can find a vector $μ$ for which $μ^{'} x_{k} =1,$ for all $k,$ the multiplicative increase in $d_{k, alt}$ implies the same resulting calibration weights $w_{k r} .$ This follows from the result that if $μ^{'} x_{k} =1,$ for all $k \in U,$ we can simplify the expression (2.3) of $w_{k r}$ as

$w_{k r} = d_{k} x_{k}^{'} {(\sum_{r} d_{k} x_{k} x_{k}^{'})}^{- 1} X .$

Thus, we have an invariance property for the weights. The result holds also when the population is partitioned into groups and the initial weights are inflated with a constant within each group. Note that if we include a constant, e.g., “ 1” , as a first component of the auxiliary vector $x_{k},$ we can simply let $μ^{'} = (1, 0, \dots, 0)$ to achieve $μ^{'} x_{k} =1.$

With this as a background we propose to use alternative “optimal” weights resulting from the distance measure

${(w_{r} - w_{0 r})}^{'} {(\frac{π_{k l} - π_{k} π_{l}}{π_{k l} π_{k} π_{l}})}_{k, l \in r}^{- 1} (w_{r} - w_{0 r}),$

leading to ${\hat{t}}_{y opt} .$ $(π_{k l}$ denotes the inclusion probability for the pair $(k, l)) .$

It is to be observed that as for the full response situation, there are cases for which the “optimal” weights are identical to (2.3), as e.g., under simple random sampling.

Using quotation marks around optimal is deliberate, but under full response optimal has a very clear meaning. As mentioned earlier, the optimal regression estimator has asymptotically minimum variance among linear regression estimators. Adding nonresponse where the nonresponse mechanism is at least partially unknown, makes it difficult to define optimality criteria in a proper way.

For this “optimal” measure it might be fruitful to replace $d_{k}$ with $d_{k, alt},$ where we include in $d_{k, alt}$ the reciprocal of an estimate of the average response probability ${\bar{θ}}_{U} = \sum_{U} θ_{k} / N .$ One simple candidate is

${\hat{\bar{θ}}}_{U} = n_{r} / n_{s},$

thus yielding $d_{k, alt} = d_{k} \cdot (n_{s} / n_{r}) .$ Another natural choice is

${\hat{\bar{θ}}}_{U} = \sum_{r} d_{k} / \sum_{s} d_{k}, (2.4)$

since $E (\sum_{s} d_{k}) = N$ and $E (\sum_{r} d_{k}) = \sum_{U} θ_{k} = N \bar{θ},$ which lead to $E (\sum_{r} d_{k} / \sum_{s} d_{k}) \approx {\bar{θ}}_{U} .$ The resulting modified estimator is denoted by ${\hat{t}}_{y optm} .$ (Also observe that $E (n_{r} / n_{s}) \approx \sum_{U} (θ_{k} / d_{k}) / \sum_{U} (1 / d_{k}) .$

In the following simulation study we will focus on a sampling design where generally ${\hat{t}}_{y cal} \neq {\hat{t}}_{y opt},$ namely Poisson sampling. The independence of drawings simplifies the “optimal” distance measure:

$\sum_{r} \frac{π_{k}^{2}}{1 - π_{k}} {(w_{k r} - d_{k})}^{2} = \sum_{r} \frac{{(w_{k r} - d_{k})}^{2}}{d_{k} (d_{k} - 1)}$

and minimization yields

$w_{k r} = d_{k} (1 + (d_{k} - 1) x_{k}^{'} {(\sum_{r} d_{k} (1 - d_{k}) x_{k} x_{k}^{'})}^{- 1} (X - \sum_{r} d_{k} x_{k})) .$

For the modified “optimal” estimator $d_{k}$ is replaced by $d_{k alt} = d_{k} \cdot (1 / {\hat{\bar{θ}}}_{U}),$ with ${\hat{\bar{θ}}}_{U}$ as in (2.4).

2.2.1 Bias for calibration estimators under nonresponse

We can write ${\hat{t}}_{y cal}$ as

${\hat{t}}_{y cal} = \sum_{r} d_{k} y_{k} + {\hat{B}}_{U; θ} (X - \sum_{r} d_{k} x_{k}), (2.5)$

where ${\hat{B}}_{U; θ} = (\sum_{r} d_{k} x_{k}^{'} y_{k}) {(\sum_{r} d_{k} x_{k} x_{k}^{'})}^{- 1} .$ In order to arrive at an approximate expression for the bias of ${\hat{t}}_{y cal}$ and subsequently ${\hat{t}}_{y opt}$ and ${\hat{t}}_{y optm},$ we follow the derivation in Särndal and Lundström (2005) and first note that ${\hat{t}}_{y cal}$ can be rewritten as

${\hat{t}}_{y cal} = \sum_{r} d_{k} y_{k} + B_{U; θ} (X - \sum_{r} d_{k} x_{k}) + ({\hat{B}}_{U; θ} - B_{U; θ}) (X - \sum_{r} d_{k} x_{k}),$

where $B_{U; θ} = (\sum_{U} θ_{k} x_{k}^{'} y_{k}) {(\sum_{U} θ_{k} x_{k} x_{k}^{'})}^{- 1} .$

If we let ${\hat{t}}_{y cal} - t_{y} = A_{1} + A_{2},$ where $A_{1} = \sum_{r} d_{k} y_{k} - t_{y} + B_{U; θ} (X - \sum_{r} d_{k} x_{k})$ and $A_{2} = ({\hat{B}}_{U; θ} - B_{U; θ}) (X - \sum_{r} d_{k} x_{k}),$ it can further be shown that

$A_{1} = \sum_{r} d_{k} e_{θ k} - \sum_{U} e_{θ k} + B_{U; θ}^{o} (\sum_{s} d_{k} x_{k}^{o} - \sum_{U} x_{k}^{o}),$

where $e_{θ k} = y_{k} - B_{U; θ} x_{k}$ and $B_{U; θ}^{o} = {(\sum_{U} θ_{k} x_{k}^{o} x_{k}^{o}^{'})}^{- 1} \sum_{U} θ_{k} x_{k}^{o} y_{k} .$

Then

$E ({\hat{t}}_{y cal}) - t_{y} \approx E (A_{1}) = \sum_{U} θ_{k} e_{θ k} - \sum_{U} e_{θ k} = - \sum_{U} (1 - θ_{k}) e_{θ k},$

since it can be argued that ${\hat{B}}_{U; θ}$ is a consistent estimator of $B_{U; θ}$ and therefore $E (A_{2}) \approx 0.$

The approximation for the bias of ${\hat{t}}_{y cal}$ is called the nearbias:

$nearbias ({\hat{t}}_{y cal}) = - \sum_{U} (1 - θ_{k}) e_{θ k} .$

The nearbias of ${\hat{t}}_{y cal}$ is zero if $θ_{k} =1,$ for all $k \in U$ and/or $y_{k} = B_{U; θ} x_{k},$ for all $k \in U .$

Then, if we consider ${\hat{t}}_{y opt},$ we have that

${\hat{t}}_{y opt} = \sum_{r} d_{k} y_{k} + (X - \sum_{r} d_{k} x_{k}) {\hat{C}}_{U; θ}, (2.6)$

where

${\hat{C}}_{U; θ} = (\sum_{k \in r} \sum_{l \in r} \frac{π_{k l} - π_{k} π_{l}}{π_{k l}} \frac{x_{k}^{'}}{π_{k}} \frac{y_{l}}{π_{l}}) {(\sum_{k \in r} \sum_{l \in r} \frac{π_{k l} - π_{k} π_{l}}{π_{k l}} \frac{x_{k}}{π_{k}} \frac{x_{l}^{'}}{π_{l}})}^{- 1} .$

Since ${\hat{t}}_{y opt}$ can be written as (2.6), which is of the same form as for ${\hat{t}}_{y cal}$ in (2.5), we will again arrive at the nearbias expression

$nearbias ({\hat{t}}_{y opt}) = - \sum_{U} (1 - θ_{k}) e_{θ k}, (2.7)$

where $e_{θ k} = y_{k} - C_{U; θ} x_{k}$ and with $θ_{k l}$ denoting the response probability for the pair $(k, l):$

$C_{U; θ} = (\sum_{k \in U} \sum_{l \in U} θ_{k l} (π_{k l} - π_{k} π_{l}) \frac{x_{k}^{'}}{π_{k}} \frac{y_{l}}{π_{l}}) {(\sum_{k \in U} \sum_{l \in U} θ_{k l} (π_{k l} - π_{k} π_{l}) \frac{x_{k}}{π_{k}} \frac{x_{l}^{'}}{π_{l}})}^{- 1} .$

If we use the alternative weighting $d_{k, alt} = d_{k} \cdot (1 / \hat{\bar{θ}}) = d_{k} \cdot (\sum_{s} d_{k} / \sum_{r} d_{k}),$ we get that

nearbias $({\hat{t}}_{y optm}) = E (\sum_{r} d_{k, alt} e_{θ k} - \sum_{U} e_{θ k}) \approx \sum_{U} \frac{θ_{k}}{{\bar{θ}}_{U}} e_{θ k} - \sum_{U} e_{θ k} = - \sum_{U} (1 - \frac{θ_{k}}{{\bar{θ}}_{U}}) e_{θ k},$

where $\sum_{U} (1 - (θ_{k} / {\bar{θ}}_{U})) =0,$ to be compared with (2.7), where $\sum_{U} (1 - θ_{k}) = N (1 - {\bar{θ}}_{U}) .$

Unless $μ^{'} x_{k} =1,$ for all $k \in U,$ an equivalent expression can be obtained for ${\hat{t}}_{y cal} .$ On the other hand, if the restriction $μ^{'} x_{k} =1,$ for all $k \in U$ does hold, it can be shown (Särndal and Lundström (2005)) that

$nearbias ({\hat{t}}_{y cal}) = - \sum_{U} e_{θ k},$

which holds independently of the sampling design and which is a result completely in line with the aforementioned invariance property of the calibration weights.

ISSN : 1492-0921

Editorial policy

Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.

Submission of Manuscripts

Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).

Note of appreciation

Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.

Standards of service to the public

Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.

Copyright

Published by authority of the Minister responsible for Statistics Canada.

Use of this publication is governed by the Statistics Canada Open Licence Agreement.

Catalogue No. 12-001-X

Frequency: Semi-annual

Ottawa

Date modified:: 2019-12-17

Language selection

Search and menus

Search

“Optimal” calibration weights under unit nonresponse in survey sampling
Section 2. Calibration estimation

2.1 Calibration estimators under full response

2.2 Calibration estimators under nonresponse

2.2.1 Bias for calibration estimators under nonresponse

“Optimal” calibration weights under unit nonresponse in survey sampling Section 2. Calibration estimation

2.1 Calibration estimators under full response

2.2 Calibration estimators under nonresponse

2.2.1 Bias for calibration estimators under nonresponse

Editorial policy

Submission of Manuscripts

Note of appreciation

Standards of service to the public

Copyright

“Optimal” calibration weights under unit nonresponse in survey sampling
Section 2. Calibration estimation