Browse by

4. Two-step calibration weighting

Phillip S. Kott and Dan Liao

4.1 Calibration weighting in two steps

In practice, the components of $x_{k}$ are often 0/1 group-membership identifiers, and the groups are mutually exclusive and exhaustive. In that situation, $g^{T} x_{k}$ can only take on $P$ values. Almost any weight-adjustment function, $α (g^{T} x_{k}),$ will yield equivalent results. An example is the linear function, $α (g^{T} x_{k}) = 1 + g^{T} x_{k},$ of Lundström and Särndal (1999).

One popular weight-adjustment function that sometimes cannot be used (note the italicized “almost” in the previous paragraph) is $α (g^{T} x_{k}) = 1 + \exp (g^{T} x_{k}),$ which assumes response is a logistic function of $x_{k} .$ The problem is that this weight-adjustment function cannot return values less than unity. We noted in the previous section, that sometimes one may need $α_{k}$ to be less than 1. A routine that tries to use $α (g^{T} x_{k}) = 1 + \exp (g^{T} x_{k})$ and fit the calibration equations will fail.

This can be a particular problem when assuming a logistic response model and trying to calibrate to the population in a single step. There may be a component of $z_{k},$ say $z_{k a},$ that is always nonnegative, but the original sample and response set are such that $\sum_{R} d_{k} z_{k a} > \sum_{U} z_{k a}$ even though $\sum_{R} d_{k} z_{k a}$ cannot exceed $\sum_{S} d_{k} z_{k a} .$ Thus, calibrating to the population will always fail because no $α_{k}$ can be less than 1.

Calibrating to the original sample, by contrast, need not fail, since $\sum_{R} d_{k} z_{k a} \leq \sum_{S} d_{k} z_{k a} .$ This suggest that one calibrates first to the original sample, which removes the response bias if the assumed response model holds, and then to the population, which removes the response bias if the prediction model holds. Estevao and Särndal (2002) discuss a variety of ways to calibrate in steps, but we focus on a single method here.

A second advantage of calibration weighting in two steps can be realized even when the calibration variables used in both steps are the same or a subset of those used in the single step. This happens when the response model holds, and the linear prediction model is only roughly true. Some version or “optimal” estimation can then be used in the second calibration-weighting step to increase efficiency. Rao (1994) introduced the notion of the optimal regression estimator. It was put into calibration-weighting form and discussed further in Bankier (2002) and Kott (2009, Section 4.2). Detail and how this can be done are provided in Sections 4.2 and 5.

4.2 Estimation and variance estimation when calibrating in two steps

In this subsection, we start with a fairly general two-step calibration estimator for a total and then address estimating its variance. The first calibration-weighting step, which is to the original sample, employs $x_{1 k}$ as the vector of response-model variables and $z_{1 k}$ as the calibration vector. Each has $P_{1}$ components. The weight-adjustment function has the form described in equation (2.4) with $g_{1}$ now replacing $g .$ The calibration equation is $\sum_{R} d_{k} α (g_{1}^{T} x_{1 k}) z_{1 k} = \sum_{S} d_{k} z_{1 k} .$

The second calibration-weighting step, which is to the population, employs $x_{2 k}$ and $z_{2 k},$ each with $P_{2}$ components. The nonresponse bias under the response model is removed in the first step. For the weight-adjustment function for the second step, we propose using

$h_{k} (g_{2}^{T} x_{2 k}) = \frac{ℓ_{k} + \exp (g_{2}^{T} x_{2 k})}{1 + \exp (g_{2}^{T} x_{2 k}) / u_{k}}, (4.1)$

where $u_{k} > ℓ_{k} > 0$ may be set almost at whim (but see below). The right-hand side of equation (4.1) can vary across the $k$ (and so can depend on $d_{k}$ and $α_{k}),$ yet $h_{k} (0) = {h^{'}}_{k} (0) = 1,$ making it asymptotically indistinguishable from the linear function: $1 + g_{2}^{T} x_{2 k} .$ For simplicity, we will call $h_{k} (g_{2}^{T} x_{2 k})$ and ${h^{'}}_{k} (g_{2}^{T} x_{2 k}), h_{k}$ and ${h^{'}}_{k}$ respectively. From a quasi-sampling-design viewpoint, both are asymptotically identical to unity. The second calibration equation is $\sum_{S} d_{k} h_{k} (g_{2}^{T} x_{2 k}) z_{2 k} = \sum_{U} z_{2 k} .$ Because this equation must hold, there are limits on the available choices for $u_{k}$ and $ℓ_{k}$ in equation (4.1).

A good simultaneous variance estimator for $t_{y} = \sum_{R} w_{k} y_{k} = \sum_{R} d_{k} α (g_{1}^{T} x_{1 k}) h_{k} (g_{2}^{T} x_{2 k}) y_{k}$ is (as we shall see)

$\begin{array}{l} v (t_{y}) = & \sum_{k, j \in S} (1 - \frac{π_{k} π_{j}}{π_{k j}}) [d_{k} (z_{1 k}^{T} b_{1} + α_{k} h_{k} e_{1 k})] [d_{j} (z_{1 j}^{T} b_{1} + α_{j} h_{j} e_{1 j})] \\ + \sum_{k \in R} d_{k} (h_{k}^{2} α_{k}^{2} - h_{k} α_{k}) e_{1 k}^{2}, \end{array} (4.2)$

where

$e_{2 k} = y_{k} - z_{2 k}^{T} {(\sum_{S} d_{j} α_{j} {h^{'}}_{j} x_{2 j} z_{2 j}^{T})}^{- 1} \sum_{S} d_{j} α_{j} {h^{'}}_{j} x_{2 j} y_{j}, (4.3)$

$b_{1} = {(\sum_{S} d_{f} {α^{'}}_{f} x_{1 f} z_{1 f}^{T})}^{- 1} \sum_{S} d_{f} {α^{'}}_{f} h_{f} x_{1 f} e_{2 f}, (4.4)$

and

$e_{1 k} = e_{2 k} - x_{1 k}^{T} b_{1} . (4.5)$

Let $x_{k}$ now be the vector composed of the non-duplicated components of $x_{1 k}$ and $x_{2 k}$ and define $z_{k}$ analogously. Sufficient conditions for (4.2) to be a simultaneous variance estimator include the corresponding components of equation (4.1) depending on whether either the response model in equation (2.4) holds with $x_{1 k}$ replacing $x_{k}$ or the prediction model is $E (y_{k} | x_{k}, z_{k}) = z_{2 k}^{T} β_{2},$ whether or not $k$ is sampled or responds if sampled, and the $ε_{2 k} = y_{k} - z_{2 k}^{T} β_{2}$ are uncorrelated random variables with variances equal to $σ_{2 k}^{2} = z_{2 k}^{T} η_{2},$ where $η_{2}$ need not be specified other than having finite components. Now, both $N^{- 1} \sum_{R} d_{k} α^{'} (g_{1}^{T} x_{1 k}) z_{1 k} x_{1 k}^{T}$ and $N^{- 1} \sum_{R} d_{k} {h^{'}}_{k} (g_{2}^{T} x_{2 k}) z_{2 k} x_{2 k}^{T}$ are assumed to be of full rank and bounded as the sample size grows arbitrarily large.

The variance estimator in equation (4.2) is almost the same as the estimator in (3.1): $x_{k}$ has been replaced with $x_{1 k}$ and $z_{k}$ with $z_{1 k},$ while $h_{k} e_{2 k}$ substitutes for $y_{k}$ (we will get to a small difference shortly). Observe that $e_{2 k}$ is effectively an expression of the “residual” from the second calibration-weighting step. This residual is multiplied by the weight-adjustment factor $h_{k},$ which is asymptotically unity from the quasi-sampling-design-based perspective and a constant from the prediction-model viewpoint. The product is then used to create the first-step “regression-coefficient” $b_{1}$ in equation (4.4) and its accompanying “residual” $e_{1 k}$ in equation (4.5). We do the second step regression first because $t_{y} - T_{y} = \sum_{R} w_{k} y_{k} - \sum_{U} y_{k} = \sum_{R} w_{k} e_{2 k} - \sum_{U} e_{2 k} .$

It is for estimating the prediction model of $t_{y}$ as an estimator of $T_{y}, \sum_{S} (w_{k}^{2} - w_{k}) σ_{2 k}^{2},$ that the last appearance of $h_{k}$ on the right-hand side of equation (4.2) is not squared, as it would be if $h_{k} e_{2 k}$ substituted for $y_{k}$ everywhere. From a quasi-design viewpoint, $h_{k}$ is asymptotically identical to unity, so whether or not it is squared makes no asymptotic difference.

Observe that the ${h^{'}}_{j}$ have been inserted in equation (4.3) for the same reason as $α^{'}$ was inserted into $b$ in equation (3.1). Since the ${h^{'}}_{j}$ are asymptotically unity, however, they are not really needed (and serve no function whatever from a prediction-model viewpoint). A similar argument applies to the $h_{f}$ in equation (4.4): they are asymptotically unity from the quasi-sampling-design viewpoint (and part of an estimate of 0 from a prediction-model viewpoint).

Previous | Next

Date modified:: 2015-11-27

Language selection

Search and menus

Search

Publications

Survey Methodology

Browse by

4. Two-step calibration weighting

4.1 Calibration weighting in two steps

4.2 Estimation and variance estimation when calibrating in two steps