On combining independent probability samples
Section 4. Simulation examples
Two Monte-Carlo simulation examples are presented here. In the first example we combine two Poisson samples using inclusion probabilities approximately proportional to the target variable. In the second example we combine an unstratified simple random sample with a stratified simple random sample.
4.1 Combining two Poisson samples
We generate a population of size with an auxiliary variable The target variable is generated as where Two sets of inclusion probabilities are generated where and We let the expected sample sizes be and For simplicity we let both designs be Poisson designs (where units are selected independently). This allows us to calculate exactly the variances for both separate estimators (and thus the optimal linear combination) and for the combined samples with single and multiple count. For the strategies with linear combination using estimated variances, we performed a Monte-Carlo simulation with 1,000,000 repeated sample selections. True variances for the two separate HT estimators, the SC/MC estimators for the combined samples and the optimal linear combination of the separate estimators are presented (Table 4.1). Simulation results for the different linear combinations with estimates variances are also presented (Table 4.1).
| Estimator | Bias (Rel. bias) | MSE |
|---|---|---|
| 0 | 1,053,083 | |
| 0 | 596,069 | |
| 0 | 361,088 | |
| 0 | 380,929 | |
| Optimal | 0 | 380,626 |
| Separate | -92.8 (-2.24%) | 412,248 |
| Pooled SC | 1.6 (+0.04%) | 381,106 |
| Pooled MC | 1.6 (+0.04%) | 381,106 |
Using combined (pooled) variance estimators reduced both the bias and the variance for a linear combination in comparison to using separate variance estimators. For this example, the linear combination with pooled variance estimation came very close to the optimal linear combination in performance. The negative bias with separate variance estimators is mainly due to a positive correlation between the total estimator and its variance estimator under the Poisson design. For this setting, the best result was obtained by combining the samples using a single count.
4.2 Combining an unstratified SRS with a stratified SRS
Here we generated a population of size with two strata of sizes and The target variable was generated as follows. In stratum 1 there were 500 equal to zero and the other 100 were drawn from In stratum 2 there were 300 equal to zero and the other 100 were drawn from The first sample is an unstratified simple random sample of size and the second sample is a stratified simple random sample with stratum sample sizes and The variances for both separate HT estimators and for the combined samples with single and multiple count were calculated exactly. A Monte-Carlo simulation with 10,000 repetitions was performed to evaluate the performance of a linear combination estimator with estimated variances. The results are presented in Table 4.2. Bias is reduced by using a linear combination with pooled variance estimators compared with using separate variance estimators. Also for this setting, the best result was obtained by combining the samples using a single count.
| Estimator | Bias (Rel. bias) | MSE |
|---|---|---|
| 0 | 516,835 | |
| 0 | 498,321 | |
| 0 | 248,888 | |
| 0 | 253,789 | |
| Optimal | 0 | 253,704 |
| Separate | -77 (-3%) | 287,680 |
| Pooled SC | 9 (+0.4%) | 257,229 |
| Pooled MC | 9 (+0.4%) | 257,217 |
- Date modified: