How to decompose the non-response variance: A total survey error approach
Section 3. Unit-level error decomposition of variance components
This section describes the approach used to evaluate the contribution of a given nonresponding unit,
to
the estimated total variance for the estimation of a total for a given
variable.
The unit-level error decomposition,
of the total variance for a given unit,
is
defined as the difference between the estimated total variance, and the projected total variance,
i.e.,
The
superscript
is used to indicate projected quantities when unit
is
converted to a respondent. So,
can be
seen as the expected gain, in terms of total variance, of converting a
nonrespondent unit
to a
respondent.
In order to get
is moved from
to
generating the new partition
of the sample from
where
and
as illustrated in Figure 3.1.

Description for Figure 3.1
Figure illustrating the sample partitions. Partition where and Unit is moved from to generating the new partition of the sample from where and
Some assumptions are necessary to decompose the variance
components. It is recognized that these assumptions may not perfectly hold in reality. However, they can be used to
generate accurate results, as
shown in the simulation in Section 4. The required assumptions are:
- Projected reported value: let
be converted to a response and let
- Projected imputation parameters:
and
- Projected imputation relationship
matrix:
and
if
or if
or
otherwise. Similarly,
if
or
otherwise.
Assumption
1 implies that if a nonresponding unit,
would have been converted to a respondent, its
reported value is equal to its
imputed value. This is not true
generally, but the imputed value is our best estimate. The expectation is
that this imputed value is close enough to the reported value to estimate the
error on the variance components. This assumption will have an impact when the sampling variance is decomposed.
Assumption
2 states that the estimated parameters of the imputation model would remain
unchanged if
were a respondent. In the case of a consistent
imputation model parameter estimator, this assumption becomes more realistic
when
is larger.
Finally, assumption 3 means that the imputation
relationship between nonrespondents
and respondents remains unchanged, except when unit
is involved. In other words, the converted
unit,
is no longer imputed from respondents, but will not be used to impute other
nonresponding units. Figure 3.2
shows how assumption 3 is reflected in terms of the phi matrix.

Description for Figure 3.2
Figure illustrating the initial and projected imputation relationship
phi matrices. Columns of the initial relationship matrix are units including unit Rows are units Columns of the projected relationship matrix are units excluding unit Rows are units including unit The row associated with unit is a row of 0. Other values in matrix are the same as the ones in matrix .
Therefore, the compensation weight,
of a
responding unit,
is projected as
The marginal weight from the converted unit
is withdrawn
from the original compensation
weight,
to obtain the new
Note
that
because
under
assumption 3. As mentioned above, it means that
isn’t used to impute nonrespondents.
In the next subsections, the unit-level error
decomposition for unit
is computed for the four variance components,
as described in Section 2.3.
3.1 Unit-level error decomposition
of the naive sampling variance
The quantity
depends on the
values,
the final weights and the first-order and second-order selection probabilities.
The unit-level error decomposition of the naive sampling variance
component
is trivial since the assumption that unit
goes from
to
does not change weights and selection
probabilities. Under assumption 1, the projected reported value
is set to
so that
when
is converted to a responding unit. Consequently, the decomposition of
is given by
This result is consistent with the idea that the naive sampling
variance point estimate will likely change,
but it is not expected to decrease with an extra responding unit.
3.2 Unit-level
decomposition of the correction to the sampling variance component
The unit-level error decomposition for unit
of the correction to the sampling variance
component,
is given by
Under assumption 2,
so that
The astute reader will notice that the actual sampling
variance (not its estimation) should not be impacted by whether or not a unit
is a respondent. However, we decided to include the impact of a unit on the
sampling variance estimation in order to be coherent in the way we treat the
three components
and
3.3 Unit-level
decomposition of the non-response variance component
The unit-level error decomposition for unit
of the non-response variance component
is given by
Under assumptions 2 and 3,
and
This can be rewritten as
Using formula
(3.1), this becomes
3.4 Unit-level
decomposition of the mixed variance component
Finally, the impact of unit
on the variance component term,
is given by
This equation can be rewritten as follows, under
assumptions 2 and 3 and equation (3.1)
In Section 2.3, the estimation of the total
variance,
has been defined as
Similarly, the impact of unit
on
is defined as
where
and
are respectively given by equations (3.2),
(3.3), (3.4) and (3.5).
It can be observed (proofs are given in the appendix) that
and
However, this linear relation doesn’t hold for
This property is important to consider
because, for
and
the
sum of the unit-level errors on all nonresponding units,
is equal to the corresponding estimated variance component. In the case of non-response
variance component, the sum of the errors is different than
The difference is given by
This difference can be relatively small, especially in
business surveys characterized with asymmetric data. This is the case when
This is in line with the results shown by
Mills et al. (2013).
Overall, the total variance can be considered as
approximately linear in terms of the unit-level errors, especially in the case
of sample surveys where
and
are significant contributors to the total
variance.