Small area benchmarked estimation under the basic unit level model when the sampling rates are non‑negligible
Section 4. Simulation study
We report in this section the results of a design-based
simulation study as it is in line with measures that are computed by the
National Statistical Offices. A design-based study is one where a fixed finite
population is first generated using an assumed model, and then for each
simulation run, a sample is drawn employing the fixed finite population. The
aim of the simulation study is to evaluate the properties of the benchmarked
estimators described in Section 3 in terms of design bias and design mean
squared error. We considered two scenarios: Scenario 1 corresponds to the case of correct modeling, whereas Scenario 2 corresponds to the case of incorrect modelling.
Model diagnostics such as those given in Rao and Molina (2015, pages 114-118),
can be used to test whether the models are correct or not. Such model
diagnostics include residual analysis to detect departures from the assumed
model, selection of auxiliary variables for the model, and case-deletion
diagnostics to detect influential observations.
4.1 Simulation set-up
for generating the finite populations
For each scenario, we considered five populations. Each
population had
30 small areas, with
100 population units within each small area.
The populations corresponding to Scenario 1 were created using the following model
where
and
For
generating the populations in Scenario 2,
we split the 30 small areas into three equal groups of small areas, denoted as
for
The
first group
contains
areas
the
second group
contains
areas
and the
third group
contains
areas
The
model within a given group is given by
where
for areas
for
areas
and
for
areas
Both (4.1)
and (4.2) use the auxiliary variable
whose
values
were
generated from an exponential distribution with mean equal to 5 and variance
equal to 25.
The random components in (4.1) and (4.2) were generated
from the normal distributions
and
The five populations corresponding to Scenario 1, denoted as A1, B1, C1,
D1 and E1, were generated based on (4.1) and the following variance parameters
doublets: i.
for population A1; ii.
for population B1; iii.
for population C1; iv.
for population D1; and
for population E1. Note that, for populations
A1 through E1, the value of
is kept fixed, whereas the values for
vary. The
are chosen to obtain the following variance
ratios
as 0.01, 0.05, 0.1, 0.2 and 1. The five
populations in Scenario 2,
denoted as A2, B2, C2, D2 and E2, were generated based on (4.2) with the same
variance parameters doublets as for Scenario 1.
A stratified sampling design was used by drawing
independent probability proportional to size samples (pps) of size
within the
small area. The small area sample sizes were
taken
for
The selection probabilities were computed as
where the size measures are
We used Conditional Poisson Sampling (CPS) to
select the pps samples within each small area (see Tillé (2006), Chapter 5).
The basic design weights are given by
In Scenario 1,
we fitted the nested regression model (4.1) and its augmented version to pps
sampling data selected from one of the five populations generated with model (4.1).
This scenario represents correct modeling as the model fitted and the model
used to generate the finite population coincide. In Scenario 2, we fitted the nested regression model (4.1) and
its augmented version to pps sampling data selected from one of the five
populations generated with model (4.2). This scenario represents incorrect
modeling as the model fitted and the model used to generate the finite
population do not coincide.
We selected
30,000 stratified pps samples from each of the
ten finite populations: populations A1 to E1 corresponding to Scenario 1, and populations A2 to
E2 corresponding to Scenario 2.
For
let
and
denote respectively the estimates of
given by the truncated REML method and its
re-parameterized version, that correspond to the
sample. The starting values in equation (B.2)
were
and
Equation (B.2) reached convergence in less
than 15 iterations for all the populations and both scenarios. Based on the
simulated samples selected in each of the five
populations corresponding to Scenario 1,
we computed the Monte Carlo value of the probability to obtain a zero truncated
REML estimate for
as
where
is an
indicator function with value 1 if condition
holds,
and 0 otherwise.
Table 4.1 displays the Monte Carlo values of the
probability to get a zero estimate for
It can be seen that the simulated probability
can be as high as 0.47 for
0.01. As
increases, this empirical probability
decreases. Table 4.1 clearly shows that estimates
cannot be used to compute the restricted EBLUP
and YR estimators for samples selected in populations A1, B1, C1 and D1.
Table 4.1
Values of
Scenario 1
Table summary
This table displays the results of Values of
Scenario 1 Pop A1
, Pop B1
, Pop C1
, Pop D1
and Pop E1
(appearing as column headers).
|
Pop A1
|
Pop B1
|
Pop C1
|
Pop D1
|
Pop E1
|
|
|
0.47 |
0.40 |
0.21 |
0.06 |
0.00 |
Figure 4.1 displays the number of iterations to
convergence of the Fisher-scoring algorithm for the estimate
of
The algorithm stops when the value of
is less than
where
represents the
iteration computed with equation (B.2) in
Appendix B. The percentages of Figure 4.1 are based only on samples with a
truncated REML estimate of
that is
We only considered populations A1, B1, C1 and
D1, as these four populations have non‑negligible probabilities for
to be null. Figure 4.1 clearly shows that
the convergence is attained in a maximum of 11 iterations.

Description for Figure 4.1
Figure showing the percentage of iterations to converge in samples with
Four bar charts are presented for Pop A1, Pop B1, Pop C1 and Pop D1. The percentage from 0 to 100 is on the y-axis. The number of iterations to converge is on the x-axis, ranging from 4 to 10 for Pop A1 and Pop B1, from 4 to 11 for Pop C1 and from 4 to 9 for Pop D1. The convergence is attained in a maximum of 11 iterations. The convergence appears to be faster for Pop A1 and Pop B1 and slightly slower for Pop D1.
4.2 Comparison between
the benchmarked estimators
The aim of the simulation study is to compare the
benchmarked estimators described in Section 3 in terms of design bias and
design mean squared error. We used both scenarios as we wanted to check how
benchmarking protects against incorrect modeling. Furthermore, we considered
the benchmark to two GREG estimators:
and
Estimator
has weights given by (3.2) calibrated on the
auxiliary vector
associated with the small area model. It
follows that estimator
corresponds to the case
The second GREG estimator
has weights given by (3.2) based on auxiliary
vector
where the values
were generated from an exponential
distribution with mean equal to 5 and variance equal to 25, and independently
of the values
It follows that estimator
corresponds to the case
since the auxiliary variable
associated with the unit-level model (4.1) do
not belong to vector
used to obtain the weights associated with
For a fixed finite population, let
be the mean of the small area
and
a generic estimator of
We denote by
the value of
based on the
simulated sample, for
The estimators described in Section 3
respect the benchmark property regardless of the method used to estimate the
variance components. Since the restricted benchmarked estimators are based on
estimates
we decided to use reREML for computing
for each estimator
evaluated in this simulation study.
We considered the following performance measures:
Average Absolute Relative Bias
Average Relative Root Mean Squared Error
This portion of the simulation is summarized in four
tables. We provide the results separately for Scenarios 1 and 2.
The results for the case when the benchmarking is to
(the case
are summarized in Tables 4.2 (Scenario 1) and 4.3 (Scenario 2). Those for the case
when the benchmarking is to
(the case
are summarized in Tables 4.4 (Scenario 1) and 4.5 (Scenario 2).
Benchmarking to
(the case
)
We computed
the
and
for two non benchmarked estimators,
and
as well as their corresponding estimators
benchmarked to
For
we have three benchmarked estimators
and
given respectively by equations (3.5), (3.8)
and (3.13). For
the corresponding benchmarked estimators are
and
given respectively by equations (3.5), (3.9)
and (3.15).
We first discuss their properties when the model is
correct (Scenario 1). Comparing
the
across all the estimators in Table 4.2,
we observe that there is not much difference between the estimators. The EBLUP
estimators have somewhat smaller
than the estimators based on the YR procedure.
The benchmarked estimator
has the smallest
whereas the
of the benchmarked estimators
and
are identical to those of
The
values associated with estimators
and
are close, whereas estimator
has a somewhat larger relative bias,
especially for larger values of
For all the estimators, the
increase as
increases: slight exceptions occur when
Next, we report on the
As expected, the smallest
are associated with
whereas estimator
has somewhat larger
values due to the use of survey weights under
correct modeling. Benchmarking results in an increase of the
Note that the
of the benchmarked estimators
and
given in Sections 3.1 and 3.2
respectively, are higher than those associated with the restricted methods
and
given in Sections 3.3 and 3.4
respectively. The naïve ratio procedures
and
have
that are quite comparable to those of the
benchmarked that use the restricted methods. The
increase as
increases.
We conclude the following in the case
and when the small area model is correctly
specified. The restricted benchmarked or ratio type estimators perform better
than those that use an augmented model for EBLUP or a modified YR method. When
the restricted or the ratio benchmarking techniques are used, the resulting
estimators have bias values that are similar to those associated with their non
benchmarked versions, whereas their mean squared error values are slightly
larger than those of the non benchmarked versions. The small area estimators and
the GREG estimator
are based on the same auxiliary variables,
whereas the model is correct. Consequently,
and
do not have to be severely modified to achieve
benchmarking to
Table 4.2
(%) and
(%) for Scenario 1: the benchmark to
Table summary
This table displays the results of
(%) and
(%) for Scenario 1: the benchmark to
. The information is grouped by Estimator (appearing as row headers), Measure, Pop A1
, Pop B1
, Pop C1
, Pop D1
and Pop E1
(appearing as column headers).
| Estimator |
Measure |
Pop A1
|
Pop B1
|
Pop C1
|
Pop D1
|
Pop E1
|
|
|
|
1.1 |
1.9 |
2.3 |
2.7 |
2.6 |
|
|
2.7 |
3.4 |
3.9 |
4.9 |
6.5 |
|
|
|
1.2 |
2.0 |
2.4 |
2.9 |
3.1 |
|
|
3.1 |
3.7 |
4.2 |
5.3 |
7.2 |
|
|
|
1.1 |
1.9 |
2.3 |
2.7 |
2.6 |
|
|
3.2 |
3.8 |
4.3 |
5.2 |
6.9 |
|
|
|
1.2 |
2.0 |
2.4 |
2.9 |
3.1 |
|
|
3.1 |
3.7 |
4.3 |
5.3 |
7.4 |
|
|
|
1.0 |
1.6 |
2.1 |
2.4 |
2.3 |
|
|
9.6 |
9.8 |
10.1 |
11.1 |
13.9 |
|
|
|
1.2 |
2.0 |
2.5 |
3.0 |
3.7 |
|
|
3.5 |
4.8 |
5.4 |
11.7 |
14.5 |
|
|
|
1.1 |
1.9 |
2.3 |
2.7 |
2.6 |
|
|
3.2 |
3.8 |
4.3 |
5.3 |
7.0 |
|
|
|
1.2 |
2.0 |
2.4 |
2.9 |
3.2 |
|
|
3.1 |
3.7 |
4.3 |
5.3 |
7.5 |
The results for not using the correct model are given in
Table 4.3. The value of
does not have much impact on the
and
across all estimators. The
and
of the EBLUP estimators, whether they
are benchmarked or not, are higher than those associated with the YR
estimators. It follows that if we have incorrect modeling, the use of the YR
estimators is recommended. Since
and the estimators based on the YR procedure
use the same vector of auxiliary information, it follows that there is not much
difference in terms of
and
between the non benchmarked estimator
and its benchmarked versions,
and
However, it can be noticed that the benchmarked estimator
has the smallest
values, whereas the restricted benchmarked
estimator
has the smallest
Table 4.3
(%) and
(%) for Scenario 2: the benchmark to
Table summary
This table displays the results of
(%) and
(%) for Scenario 2: the benchmark to
. The information is grouped by Estimator (appearing as row headers), Measure, Pop A2
, Pop B2
, Pop C2
, Pop D2
and Pop E2
(appearing as column headers).
| Estimator |
Measure |
Pop A2
|
Pop B2
|
Pop C2
|
Pop D2
|
Pop E2
|
|
|
|
42.3 |
42.7 |
43.2 |
43.0 |
41.5 |
|
|
59.8 |
60.5 |
61.1 |
60.6 |
59.0 |
|
|
|
13.5 |
13.8 |
13.8 |
13.6 |
13.5 |
|
|
42.8 |
43.2 |
43.5 |
43.2 |
42.4 |
|
|
|
42.9 |
43.4 |
43.9 |
43.6 |
42.1 |
|
|
61.2 |
61.9 |
62.7 |
62.1 |
60.3 |
|
|
|
13.8 |
14.1 |
14.1 |
13.9 |
13.8 |
|
|
43.9 |
44.4 |
44.7 |
44.4 |
43.5 |
|
|
|
19.8 |
20.2 |
20.2 |
20.2 |
19.6 |
|
|
66.2 |
66.7 |
67.6 |
67.3 |
66.6 |
|
|
|
10.9 |
10.6 |
11.5 |
12.5 |
10.7 |
|
|
47.3 |
47.6 |
48.1 |
47.9 |
47.8 |
|
|
|
41.2 |
41.8 |
41.8 |
41.7 |
40.6 |
|
|
58.2 |
59.0 |
59.1 |
58.9 |
57.4 |
|
|
|
12.5 |
12.7 |
12.6 |
12.5 |
12.5 |
|
|
42.4 |
42.9 |
43.1 |
42.9 |
42.1 |
Benchmarking to
(the case
)
The
results of this case are given in Tables 4.4 and 4.5 for Scenarios 1 and 2, respectively. The weighting is with
respect to
given by (3.2) .We investigated the following
four estimators
and
that are benchmarked to
The first two estimators,
and
are given by equations (3.5) and (3.13)
respectively, while the last two,
and
are given by equations (3.5) and (3.15).
In
Table 4.4, we summarize the average ARB and RRMSE values when the model is
correct. That is, both the sample and the population data respect model (4.1).
We first discuss their properties in terms of the
Comparing the
across all the estimators in Table 4.4,
we observe once more that, under correct modeling, the original EBLUP
estimator,
has the smallest
The
increase when benchmarking is required, and
this is different from what we noticed from Table 4.2. There is not much
difference in terms of
between the benchmarked estimators obtained
using ratio adjustment methods,
and
and those obtained by restricted methods,
and
The
increase as
increases: slight exceptions occur when
Next,
we report on the
As expected, the smallest
are associated with
which is optimal under correct modeling.
Benchmarking results in an increase of
Note that the
associated with all four benchmarking
procedures in Table 4.4 are quite high compared to the
associated with the non benchmarked estimators
and
The estimators
and
have similar efficiency, whereas
and
have
values that are somewhat larger than those of
and
The
increase as
increases.
When
there are larger differences between the small
area estimators based on model (2.2) that uses the vector
and the GREG estimator that uses
Notice that we considered a somewhat extreme
situation when
and
have no variable in common. It follows that
the modifications needed to obtain benchmarked estimators are more accentuated
in this case as compared to the case
This explains why in Table 4.4 the
benchmarked estimators have significantly larger
and
values than the estimators that are not
benchmarked to
Table 4.4
(%) and
(%) for Scenario 1: the benchmark to
Table summary
This table displays the results of
(%) and
(%) for Scenario 1: the benchmark to
. The information is grouped by Estimator (appearing as row headers), Measure, Pop A1
, Pop B1
, Pop C1
, Pop D1
and Pop E1
(appearing as column headers).
| Estimator |
Measure |
Pop A1
|
Pop B1
|
Pop C1
|
Pop D1
|
Pop E1
|
|
|
|
1.1 |
1.9 |
2.3 |
2.7 |
2.6 |
|
|
2.7 |
3.4 |
3.9 |
4.9 |
6.5 |
|
|
|
1.2 |
2.0 |
2.4 |
2.9 |
3.1 |
|
|
3.1 |
3.7 |
4.2 |
5.3 |
7.2 |
|
|
|
4.2 |
4.3 |
4.5 |
4.9 |
4.6 |
|
|
13.0 |
13.2 |
13.5 |
14.0 |
14.6 |
|
|
|
4.2 |
4.3 |
4.5 |
5.0 |
4.8 |
|
|
13.0 |
13.2 |
13.5 |
14.0 |
14.0 |
|
|
|
4.2 |
4.3 |
4.5 |
5.0 |
4.8 |
|
|
13.1 |
13.3 |
13.5 |
14.1 |
15.0 |
|
|
|
4.2 |
4.3 |
4.6 |
5.1 |
5.0 |
|
|
13.5 |
13.7 |
13.8 |
14.5 |
16.2 |
The
impact of using an incorrect model is given in Table 4.5. We see that
suffers the most in terms of both
and
because the EBLUP procedure assumes that the
model is correct. The benchmarked versions of EBLUP,
and
also have high
and
Although
the original You and Rao (2002)
estimator,
has much smaller
than the EBLUP estimator, its
is fairly high. The
and
associated with the ratio benchmarked version of
are a bit higher than those associated with
The benchmarked YR estimator,
which is based on the restricted procedure
given in Section 3.4, has an
that is the smallest amongst the estimators in
Table 4.5. Due to benchmarking, its
is slightly larger than the one associated
with
Table 4.5
(%) and
(%) for Scenario 2: the benchmark to
Table summary
This table displays the results of
(%) and
(%) for Scenario 2: the benchmark to
. The information is grouped by Estimator (appearing as row headers), Measure, Pop A2
, Pop B2
, Pop C2
, Pop D2
and Pop E2
(appearing as column headers).
| Estimator |
Measure |
Pop A2
|
Pop B2
|
Pop C2
|
Pop D2
|
Pop E2
|
|
|
|
42.3 |
42.6 |
43.2 |
43.0 |
41.6 |
|
|
59.8 |
60.4 |
61.1 |
60.7 |
59.1 |
|
|
|
13.6 |
13.6 |
13.9 |
13.7 |
13.5 |
|
|
42.8 |
43.1 |
43.5 |
43.3 |
42.4 |
|
|
|
43.8 |
44.4 |
44.9 |
44.6 |
43.3 |
|
|
65.4 |
66.1 |
67.0 |
66.4 |
64.5 |
|
|
|
15.0 |
15.2 |
15.6 |
15.2 |
14.9 |
|
|
47.9 |
48.2 |
48.7 |
48.3 |
47.3 |
|
|
|
37.3 |
38.0 |
38.1 |
37.8 |
37.1 |
|
|
57.4 |
58.2 |
58.5 |
58.2 |
56.7 |
|
|
|
9.9 |
10.1 |
10.4 |
10.0 |
10.1 |
|
|
43.4 |
43.8 |
44.2 |
43.9 |
43.1 |