5. The proposed estimator and its variance estimation
Alina Matei and M. Giovanna Ranalli
Previous | Next
Recall
that we have a variable of particular interest
and that item
nonresponse is present for it. If we wish to estimate the population total
of
then a naive
estimator that does not correct neither for unit nor for item nonresponse is
given by
Reweighting
item responders is also an approach to handle item nonresponse. Moustaki and
Knott (2000) propose to weight item responders by the inverse of the fitted
probability of item response
assuming
Therefore, a
possible adjustment weight for item and unit nonresponse associated with unit
is given by
We propose using
the three-phase estimator adjusted for item and unit nonresponse via
reweighting given by
where
is provided by
Model (4.4), and
by Model (4.2).
Proposals that use imputation of
values for
to deal with
item nonresponse are also considered but not reported for reasons of space. They
are available from the Authors upon request.
The
properties of the proposed estimator (5.2) depend on the assumptions made about
the unit and the item nonresponse mechanisms. In particular, Estimator (5.2)
assumes a second phase of sampling with unknown response probabilities. If we
ignore estimation of
in Model (4.4),
the results in Kim and Kim (2007) on design consistency of the two-phase
estimator that uses estimated response probabilities hold here as well when considering
maximum likelihood estimates for the parameters
and
Again, ignoring
estimation of the latent variable
and using
marginal maximum likelihood estimates for the parameters
and
in Model (4.2),
estimator
will be
consistent if the models for unit and item nonresponse probabilities are
correctly specified.
We
can consider replication methods for variance estimation of the proposed
estimator and combine proposals for two-phase sampling (Kim, Navarro and Fuller 2006) and for generalized calibration in the presence of nonresponse
(Kott 2006). In particular, the replicate variance estimator can be written as
where
is the
version of
based on the
observations included in the
replicate,
is the number of
replications,
is a factor
associated with replicate
determined by
the replication method. The
replicate of
can be written
as
where
denotes the
replicate weight for the
unit in the
replication.
These replicate weights are computed using a two-step procedure.
First,
note that, if we ignore for the moment the presence of item nonresponse, the
two-phase estimator
has weights
with,
(see Equation
(4.4)). Let
be the first
phase estimate of the total of variable
defined as
Then, parameters
and
are such that
This procedure is equivalent to obtaining unweighted maximum likelihood
estimates, but is convenient to set it as a non-linear generalized calibration
problem. In this way, it is possible to use the approach in Kott (2006),
combined with that in Kim et al. (2006), to obtain replicate weights using
the following steps.
Step 1: Compute the first phase
estimate of the total of
with
observation
deleted, i.e.,
where
is the classical
jackknife replication weight for unit
in replication
Compute the
jackknife weights for the second phase sampling using
as a benchmark. In
particular,
are chosen to be
with
and
such that
This procedure provides weights that are very similar to those considered
in Kott (2006) and can be computed using existing software that handles
generalized calibration.
Item nonresponse is handled similarly by considering
(compare Equation
(4.3)). A major approximation here is to assume that, given
parameters
and
are estimated
using a classical logistic model (instead of a 2PL model) and are such that
where
and
Another drawback
is that auxiliary variables
depend on
and, therefore,
different sets of weights have to be produced for the different variables of
interest.
Step 2: Third phase jackknife weights
are obtained by first computing the second phase estimate of the total of
with unit
removed by using
weights coming from Step 1, i.e.,
Then, using
as a benchmark,
are chosen to be
with
and
computed via
Previous | Next