5 Simulation study

J.N.K. Rao, F. Verret and M.A. Hidiroglou

We conducted a small simulation study on the performance of the proposed WEE estimators under the simple nested error mean model, using $μ = 0.5$ , $σ_{v}^{2} = 0.5$ and $σ_{e}^{2} = 2.0.$ The population consists of $N =$ 1,000 clusters, each containing $M_{i} = M = 100$ elements. A two-stage sampling design with $n =$ 50 sample clusters and $m_{i} = m = 5$ sample elements from each sample cluster is used. Clusters are selected by simple random sampling, and the elements within clusters by the Rao-Sampford probability proportional to size (PPS) sampling method (Rao 1965 and Sampford 1967) with specified size measures $z_{i j} .$ The size measures are chosen to reflect different levels of informativeness.

Following Asparouhov (2006), we considered both invariant and non-invariant selections. For invariant selection, the size measure $z_{i j}$ depends only on the level 1 errors and is invariant across clusters. In particular, we let

$z_{i j} = {(1 + \exp {- 0.5 [e_{i j} / α + e_{i j}^{*} {(1 - α^{- 2})}^{1 / 2}]})}^{- 1}, (5.1)$

where $e_{i j}^{*}$ is independent of $e_{i j}$ but with the same distribution, $N (0, σ_{e}^{2} = 2.0)$ . For non-invariant selection, the size measure $z_{i j}$ depends on both level 1 and level 2 errors and hence non-invariant across clusters. In particular, we replace $e_{i j}$ and $e_{i j}^{*}$ in (3.7) by $v_{i} + e_{i j}$ and $v_{i}^{*} + e_{i j}^{*}$ respectively, where $v_{i}^{*}$ is independent of $v_{i}$ but with the same distribution $N (0, σ_{v}^{2} = 0.5)$ . We considered four values of $α$ in (5.1): $α = 1, 2, 3, \infty,$ where $α = \infty$ corresponds to non-informative sampling within each cluster, $α = 1$ corresponds to the most informative sampling and informativeness decreases as $α$ increases.

We used the design-model ( $p m$ ) approach to simulate $R = 1, 000$ samples for each specified $α$ and separately for invariant and non-invariant selections. Under this approach, we generated a population with $N = 1, 000$ and $M_{i} = M = 100$ from the model and then selected a two-stage sample of elements as specified above. The two-step process was repeated $R = 1, 000$ times to simulate $1, 000$ samples.

5.1 Performance of estimators

From each sample, we computed the estimates of $μ, σ_{v}^{2}$ and $σ_{e}^{2}$ using REML, weighted scaling methods A and A1, the proposed WEE method and the alternative method of Korn and Graubard (abbreviated KG). Biases and variances of the estimators were computed from the $1, 000$ estimates. Performance of alternative estimators is judged using two performance measures: Bias ratio = BR = (Bias)/ (square root of variance) and relative root mean squared error = RRMSE = (square root of MSE)/ (true parameter value). Tables 5.1, 5.2 and 5.3 respectively report the BR values of the estimators of $μ, σ_{v}^{2}$ and $σ_{e}^{2}$ . RRMSE values of the estimators of $μ, σ_{v}^{2}$ and $σ_{e}^{2}$ are reported in Tables 5.4, 5.5 and 5.6 respectively.

Table 5.1
Bias ratio (%) of estimators of $μ$
Table summary
This table displays bias ratio (%) of estimators of $μ$ . The information is grouped by $α$ (appearing as row headers), invariant and non-invariant (appearing as column headers).
$α$	Invariant			Non-invariant
$α$	REML	A	A1/WEE/KG	REML	A	A1/WEE/KG
1	346.5	80.2	2.2	370.9	83.9	3.0
2	167.7	40.1	0.3	172.3	45.3	6.1
3	114.3	30.7	4.5	114.9	30.8	4.8
$\infty$	2.0	2.5	2.1	-1.5	-2.4	-2.2

Table 5.1 reports bias ratio (%) of the estimators of $μ$ based on REML, weight-scaling methods A and A1, KG and WEE. Note that in the case of $μ$ , estimators A1, KG and WEE (WCL) are identical. Results in Table 5.1 show that BR is similar for invariant and non-invariant selections and that BR of REML and A decrease as $α$ increases. Further, REML leads to large bias under informative sampling, even for $α = 3$ ; for example, BR for REML ranges from 114% to 346% under invariant selection. Method A also leads to significant BR under informative sampling; for example BR for A ranges from 30.8% to 83.9% under non-invariant selection. On the other hand, BR of WEE, A1 and KG does not depend on $α$ and it is small ( $| B R | < 6 %$ ). Under non-informative sampling, REML performs well as expected ( $| B R | < 3 %$ ).

Table 5.2
Bias ratio (%) of estimators of $σ_{v}^{2}$
Table summary
This table displays Bias ratio (%) of estimators of $σ_{v}^{2}$ . The information is grouped by $α$ (appearing as row headers), REML, A, A1, WEE, KG (appearing as column headers).
$α$	REML	A	A1	WEE	KG
Invariant Selection
1	0.6	59.5	59.3	-8.5	33.2
2	0.5	24.5	26.3	-10.0	8.0
3	-3.4	16.1	18.2	-13.6	0.4
$\infty$	-0.1	14.8	17.1	-8.9	0.6
Non-invariant Selection
1	-49.0	50.1	58.9	-4.4	24.0
2	-10.9	24.6	28.7	-7.0	7.1
3	-4.0	20.0	22.7	-7.8	4.6
$\infty$	-1.3	12.8	13.9	-13.3	-1.6

Turning to the estimation of $σ_{v}^{2}$ , we first note that the proportion of times the estimate of $σ_{v}^{2}$ is negative is zero in the simulations for all four values of $α$ and for all the estimation methods (REML, A, A1, WEE and KG). Table 5.2 reports BR values of the estimators of $σ_{v}^{2}$ . It shows that the BR of REML is not affected by $α$ under invariant selection, but is affected under non-invariant selection. In the latter case, REML leads to serious underestimation for $α = 1$ (BR= -49%) but |BR| decreases as $α$ increases. Table 5.2 also shows that methods A and A1 do not perform well under informative sampling (BR ranging from 16% to 60%). KG did not perform well for $α = 1$ (BR=33% under invariant selection and BR=24% under non-invariant selection). On the other hand, WEE performs well for all values of $α$ (BR ranging from -4% to -13%) although underestimation is consistent across values of $α$ .

Table 5.3
Bias ratio (%) of estimators of $σ_{e}^{2}$
Table summary
This table displays Bias ratio (%) of estimators of $σ_{e}^{2}$ . The information is grouped by $α$ (appearing as row headers), REML, A, A1, WEE, KG (appearing as column headers).
$α$	REML	A	A1	WEE	KG
Invariant Selection
1	-106.9	-118.4	-66.9	2.4	-71.2
2	-22.7	-43.6	-34.3	2.1	-16.5
3	-9.4	-31.7	-28.4	2.9	-6.5
$\infty$	-0.4	-21.8	-23.8	0.3	0.4
Non-invariant Selection
1	-115.3	-131.3	-79.6	-6.9	-82.6
2	-30.4	-51.1	-43.3	-7.6	-23.9
3	-12.5	-34.9	-32.2	-2.3	-10.3
$\infty$	1.1	-20.2	-21.8	2.6	1.6

Table 5.3 reports BR values of the estimators of $σ_{e}^{2}$ . It shows that BR values are similar for invariant and non-invariant selections, as in the case of $μ$ . REML and KG lead to serious underestimation when $α = 1$ (BR= -107% for REML and BR= -71% for KG under invariant selection), but |BR| decreases as $α$ increases and becomes negligible for $α = \infty$ . Estimators A and A1 perform poorly for all values of $α$ including $α = \infty$ . On the other hand, WEE performs well for all values of $α$ with |BR|<8%. It appears that the instability introduced by the scale factor (2.9) might have contributed to the large |BR| for methods A and A1 even for the case of non-informative sampling $(α = \infty)$ .

Table 5.4
Relative root mean squared error (%) of estimators of $μ$
Table summary
This table displays relative root mean squared error (%) of estimators of $μ$ . The information is grouped by $α$ (appearing as row headers), invariant, non-invariant (appearing as column headers).
$α$	Invariant			Non-invariant
$α$	REML	A	A1/WEE/KG	REML	A	A1/WEE/KG
1	93.3	35.9	29.4	92.5	35.4	29.2
2	51.6	29.3	27.8	52.8	30.4	28.9
3	40.5	28.2	27.5	40.8	28.7	28.1
$\infty$	25.8	26.1	26.5	26.6	27.3	27.7

Relative root mean squared error

Table 5.4 shows that the RRMSE (%) values for estimators of $μ$ are similar for invariant and non-invariant selections and that RRMSE of REML and A decrease as $α$ increases. For informative sampling with $α = 1$ , RRMSE for REML is large relative to RRMSE for WEE (A1 and KG) due to large BR. For example, RRMSE=93% for REML compared to RRMSE=29% for WEE. As expected, REML has the smallest RRMSE under non-informative sampling, but the increase in RRMSE for the other methods is quite small. Also, RRMSE of WEE (A1 and KG) depends on $α$ .

Table 5.5
Relative root mean squared error (%) of estimators of $σ_{v}^{2}$
Table summary
This table displays Relative root mean squared error (%) of estimators of $σ_{v}^{2}$ . The information is grouped by $α$ (appearing as row headers), REML, A, A1, WEE, KG (appearing as column headers).
$α$	REML	A	A1	WEE	KG
Invariant Selection
1	36.5	47.3	51.1	43.6	43.8
2	37.1	39.7	41.1	40.5	39.5
3	36.3	37.3	38.7	39.5	37.8
$\infty$	35.8	36.9	38.1	38.7	37.2
Non-invariant Selection
1	36.7	44.6	52.6	43.4	41.5
2	35.6	37.9	40.4	39.3	37.7
3	37.0	38.7	40.4	40.2	38.8
$\infty$	36.6	37.2	38.0	39.0	37.8

Turning to RRMSE of estimators of $σ_{v}^{2}$ , Table 5.5 shows that REML performs well for all $α$ under invariant selection due to small BR in this case. We also note that KG and WEE are comparable in terms of RRMSE for all values of $α$ . Table 5.5 also shows that A and A1 lead to somewhat larger RRMSE for $α = 1$ : 51% for A1 and 47% for A under invariant selection compared to 44% for WEE.

Table 5.6
Relative root mean squared error (%) of estimators of $σ_{e}^{2}$
Table summary
This table displays relative root mean squared error (%) of estimators of $σ_{e}^{2}$ . The information is grouped by $α$ (appearing as row headers), REML, A, A1, WEE, KG (appearing as column headers).
$α$	REML	A	A1	WEE	KG
Invariant Selection
1	13.5	14.5	12.8	13.9	12.9
2	9.7	10.4	10.4	11.0	10.0
3	9.5	10.0	10.1	10.7	9.8
$\infty$	10.1	10.3	10.5	11.1	10.3
Non-invariant Selection
1	13.7	14.8	12.9	13.2	13.0
2	10.0	10.9	10.9	11.3	10.3
3	9.7	10.4	10.7	11.2	10.2
$\infty$	10.3	10.6	10.8	11.4	10.7

Table 5.6 gives RRMSE values of the estimators of $σ_{e}^{2}$ and we note that the values are similar for invariant and non- invariant selections. It also shows that RRMSE values are comparable for methods WEE, A, Al and KG even though in terms of bias ratio A, Al and KG performed poorly relative to WEE. This is due to larger variance for WEE compared to other methods. For example, in the case of invariant selection and $α = 1$ we have the following variances for WEE, KG and REML: 0.0771, 0.0438 and 0.0339 with corresponding bias ratios (%) from Table 5.3: 2.4, -71.2, and -106.9.

5.2 Performance of variance estimator

We now report some simulation results on the relative bias of the linearization variance estimator (3.12) of the WEE (WCL) estimator ${\hat{θ}}_{w}$ . We first repeated the two-step process $R_{1} = 2, 000$ times and computed $v_{L}^{(r)} ({\hat{θ}}_{w})$ from each two-stage sample $r = 1, ..., 2, 000.$ The averages of the diagonal elements of $E {v_{L} ({\hat{θ}}_{w})} \approx v_{L} ({\hat{θ}}_{w}) = R_{1}^{- 1} \sum_{r = 1}^{R_{1}} v_{L}^{(r)} ({\hat{θ}}_{w})$ are denoted by ${\bar{v}}_{L} ({\hat{μ}}_{w}), {\bar{v}}_{L} ({\hat{σ}}_{v w}^{2})$ and ${\bar{v}}_{L} ({\hat{σ}}_{e w}^{2})$ respectively. We then generated $R_{2} = 10, 000$ independent samples and computed the empirical mean squared error (MSE) of the three estimators ${\hat{μ}}_{w}, {\hat{σ}}_{v w}^{2}$ and ${\hat{σ}}_{e w}^{2} .$ We have $MSE ({\hat{μ}}_{w}) \approx R_{2}^{- 1} {\sum_{r = 1}^{R_{2}} ({\hat{μ}}_{w}^{(r)} - μ)}^{2}$ where ${\hat{μ}}_{w}^{(r)}$ is the estimate of $μ$ from the r-th simulated sample, and similar expressions for $MSE ({\hat{σ}}_{v w}^{2})$ and $MSE ({\hat{σ}}_{e w}^{2}) .$

The relative bias of $v_{L} ({\hat{μ}}_{w})$ is calculated as

$RB {v_{L} ({\hat{μ}}_{w})} = [{\bar{v}}_{L} ({\hat{μ}}_{w}) / MSE ({\hat{μ}}_{w})] - 1$

and similarly $RB {v_{L} ({\hat{σ}}_{v w}^{2})}$ and $RB {v_{L} ({\hat{σ}}_{e w}^{2})}$ were calculated. Table 5.7 reports the RB values of the three variance estimators for invariant and non-invariant selections and $α = 1, 2, 3, \infty .$ It is clear from Table 5.7 that the linearization variance estimator performs well over all combinations with $| RB | < 10 % .$

Table 5.7
Relative bias (%) of variance estimators
Table summary
This table displays relative bias (%) of variance estimators. The information is grouped by $α$ (appearing as row headers), $v_{L} ({\hat{μ}}_{w})$ , $v_{L} ({\hat{σ}}_{v w}^{2})$ , $v_{L} ({\hat{σ}}_{e w}^{2})$ (appearing as column headers).
$α$	$v_{L} ({\hat{μ}}_{w})$	$v_{L} ({\hat{σ}}_{v w}^{2})$	$v_{L} ({\hat{σ}}_{e w}^{2})$
Invariant Selection
1	-3.0	-6.2	-7.5
2	-5.2	-4.5	-3.1
3	-1.3	-3.8	-1.8
$\infty$	-0.9	-2.5	-2.0
Non-invariant Selection
1	-3.8	-8.3	-4.2
2	-4.5	-5.8	-7.3
3	-4.3	-4.6	-5.7
$\infty$	-2.4	-2.7	-2.9

Previous | Next

Date modified:: 2017-09-20

Language selection

Search and menus

Search

Publications

Survey Methodology

Browse by

5 Simulation study

5.1 Performance of estimators

5.2 Performance of variance estimator