A method to find an efficient and robust sampling strategy under model uncertainty
Section 5. The risk measure under the Generalized Regression Estimator
The difference estimator
(2.1) requires that
is fully
specified in order to calculate
which is
undesirable from a practical standpoint. The generalized regression (GREG)
estimator is an alternative that allows for the estimation of all or some of
the components of
at the cost of
introducing a small bias. In this section we adapt the material in Sections 2
to 4 to strategies using the GREG estimator.
We define the generalized
regression estimator in a more general way than in Särndal et al. (1992)
as follows. Let
be a weight
defined by the statistician and
where
is fixed and
is to be
estimated. Let also
and
The GREG estimator is
An approximation to the design MSE of the GREG estimator is of the form
(2.2) with
where
and
Example 1. Let us consider the case where
Let
and
In this case we obtain
Letting the exponents
we obtain the classical expression of the GREG
estimator found in Särndal et al. (1992).
Example 2. The case with only one auxiliary variable, i.e.,
with
and
is known as the regression estimator. In this
case we obtain the well known result that the design MSE can be approximated by
expression (2.2) with
where
and
The misspecified model
Let us consider again the
situation where the statistician uses the working model (2.4) but the true
model is of the form (3.1) with
where
is the
counterpart of the fixed component
The following
result states a condition under which Result 1 is valid for the GREG
estimator.
Result 2. If
is assumed when
is the true superpopulation model,
as
and
converges to some
as
then
where
Proof. Note that if
then
Thus,
In turn, if
then
Thus
Therefore,
which, by Result 1, is (5.2).
Example 3 (Continuation of Example 1). Let the working model be as in
Example 1 and the true model be
Let also
and
In this case,
where
and (5.2)
becomes
Example 4 (Continuation of Example 2). Let the working model be as in
Example 2 and the true model be
with
and
It can be shown that (5.2) becomes
with
and
Note that (5.4) does not depend on
For the particular case
developed in Examples 2 and 4, where
and
an alternative
approximation of
is (Proof in
the Appendix)
where
with
and
and
are, respectively, the correlation
coefficients between
and
and between
and
The latter is unknown but often some decent
guess about it is available.
The approximation of
in (5.6) is
more convenient than the one in (4.2) as now we have that (5.4) is approximated
by
with
given by (5.5). This expression depends
neither on the intercept
nor the parameter
and the slope
becomes a proportionality constant that can be
ignored.
The risk measure
As in Section 4, the
asymptotic model expected MSE of the GREG estimator given by Result 2 can
be seen as the loss incurred by assuming that
is the true
parameter when it is, in fact,
Assuming a
prior distribution on
the risk (4.1)
can be calculated.