Sample empirical likelihood approach under complex survey design with scrambled responses
Section 4. Simulation study
In the simulation study,
we consider finite population
for
10,000.
is uniformly
distributed over [0, 1] and
with
Four functions
are listed
below:
(A).
(B).
(C).
(D).
where
is the binary indicator function for condition
such that
if condition
is satisfied and 0 otherwise.
We generated
5,000 Monte
Carlo samples from Poisson sampling with inclusion probabilities
where the size
variable
with
We considered
sample sizes
40, 50, 100 and
200. For each Monte Carlo sample, the
scrambled responses
were generated
with
0.6, and
Suppose we only
observe
and
in the sample.
The performance of the HJ estimator and the proposed SEL estimator were
compared with the estimate population mean of
which is
The results are
shown in Table 4.1.
We computed Monte Carlo bias
Monte Carlo
standard error
with
and Monte Carlo
mean squared error
For variance
estimation, we calculated coverage rate, average length of interval estimates,
and percentage of relative bias of variance estimators
Results
obtained from the simulation are given in Table 4.1.
Table 4.1
Simulation results of Monte Carlo bias (MCB), Monte Carlo standard error (MCSE), and Monte Carlo mean squared error (MCMSE), coverage rate, average length of 95% confidence intervals, and relative bias (RB) for the Hájek (HJ) estimator and sample empirical likelihood (SEL) estimator
Table summary
This table displays the results of Simulation results of Monte Carlo bias (MCB). The information is grouped by Setting (appearing as row headers), MCB, MCSE, MCMSE, Coverage Rate, Avg Length and RB (appearing as column headers).
| Setting |
MCB |
MCSE |
MCMSE |
Coverage Rate |
Avg Length |
RB |
| Model |
|
HJ |
SEL |
HJ |
SEL |
HJ |
SEL |
HJ |
SEL |
HJ |
SEL |
HJ |
SEL |
|
|
40 |
0.0035 |
0.0005 |
0.123 |
0.076 |
0.015 |
0.006 |
0.936 |
0.940 |
0.470 |
0.283 |
-0.027 |
-0.075 |
| 50 |
0.0026 |
0.0006 |
0.110 |
0.069 |
0.012 |
0.005 |
0.939 |
0.941 |
0.420 |
0.255 |
-0.024 |
-0.078 |
| 100 |
0.0009 |
0.0003 |
0.077 |
0.048 |
0.006 |
0.002 |
0.946 |
0.950 |
0.300 |
0.183 |
0.007 |
-0.000 |
| 200 |
0.0006 |
-0.0002 |
0.054 |
0.033 |
0.003 |
0.001 |
0.944 |
0.954 |
0.211 |
0.130 |
-0.010 |
0.000 |
|
|
40 |
0.0006 |
0.0007 |
0.083 |
0.085 |
0.007 |
0.007 |
0.937 |
0.937 |
0.319 |
0.314 |
-0.020 |
-0.098 |
| 50 |
-0.0004 |
-0.0008 |
0.074 |
0.075 |
0.005 |
0.006 |
0.939 |
0.944 |
0.286 |
0.283 |
-0.014 |
-0.066 |
| 100 |
-0.0002 |
-0.0001 |
0.053 |
0.053 |
0.003 |
0.003 |
0.941 |
0.947 |
0.203 |
0.203 |
-0.036 |
-0.057 |
| 200 |
-0.0007 |
-0.0006 |
0.037 |
0.037 |
0.001 |
0.001 |
0.945 |
0.949 |
0.144 |
0.144 |
0.002 |
-0.013 |
|
|
40 |
0.0022 |
0.0011 |
0.138 |
0.091 |
0.019 |
0.008 |
0.926 |
0.939 |
0.512 |
0.344 |
-0.081 |
-0.068 |
| 50 |
0.0056 |
0.0028 |
0.119 |
0.081 |
0.014 |
0.007 |
0.941 |
0.942 |
0.460 |
0.312 |
-0.018 |
-0.045 |
| 100 |
0.0011 |
0.0003 |
0.084 |
0.058 |
0.007 |
0.003 |
0.945 |
0.943 |
0.327 |
0.222 |
-0.011 |
-0.053 |
| 200 |
-0.0002 |
-0.0006 |
0.059 |
0.041 |
0.003 |
0.002 |
0.950 |
0.952 |
0.230 |
0.157 |
-0.010 |
-0.028 |
|
|
40 |
0.0040 |
0.0012 |
0.119 |
0.080 |
0.014 |
0.006 |
0.938 |
0.937 |
0.460 |
0.296 |
-0.007 |
-0.089 |
| 50 |
0.0008 |
0.0002 |
0.107 |
0.071 |
0.012 |
0.005 |
0.943 |
0.943 |
0.413 |
0.267 |
-0.020 |
-0.069 |
| 100 |
0.0007 |
0.0006 |
0.075 |
0.049 |
0.006 |
0.002 |
0.942 |
0.945 |
0.293 |
0.190 |
-0.013 |
-0.036 |
| 200 |
-0.0003 |
-0.0002 |
0.053 |
0.034 |
0.003 |
0.001 |
0.946 |
0.957 |
0.206 |
0.135 |
-0.018 |
0.029 |
For model
and
SEL has a
smaller Monte Carlo bias, Monte Carlo standard error, and Monte Carlo mean
squared error, especially for small sample sizes
40 or 50). For model
the two methods
have comparable performance. For all four models, we found that, for most of
the cases (14 of 16) the SEL estimators had a coverage rate higher than or
equal to that of the HJ estimator, while the average length of confidence
interval was shorter compared with the average length obtained with the HJ
estimator. Both methods provided small relative biases of variance estimators.
Overall, the proposed SEL outperformed HJ for most cases.
To test the sensitivity
of the proposed approach, under current simulation study setups, we added
noise,
to the
simulation. Then,
with
0, 0.1, 0.3, 0.5, 0.7, 0.9, 1,
and
Suppose we only
observe
and
(the scrambled
response of
in the sample,
the HJ estimator and SEL estimator were again compared. The results are shown
in Tables 4.2 and 4.3. We found that as
decreases, the
coverage rates of the SEL estimator are smaller than those of the HJ estimator,
and the average length of CI for SEL estimator is not shorter than that of the
HJ estimator. Therefore, the SEL estimator has better performance than the HJ
estimator, provided that most of the information is contained in the current
covariate.
Table 4.2
Simulation results of the Hájek (HJ) estimator and sample empirical likelihood (SEL) estimator after adding noise
Table summary
This table displays the results of Simulation results of the Hájek (HJ) estimator and sample empirical likelihood (SEL) estimator after adding noise. The information is grouped by Setting (appearing as row headers),
,
and
(appearing as column headers).
| Setting |
|
|
|
| Coverage Rate |
Avg Length |
Coverage Rate |
Avg Length |
Coverage Rate |
Avg Length |
| Model |
|
HJ |
SEL |
HJ |
SEL |
HJ |
SEL |
HJ |
SEL |
HJ |
SEL |
HJ |
SEL |
|
|
40 |
0.924 |
0.903 |
1.419 |
1.368 |
0.926 |
0.911 |
1.289 |
1.251 |
0.938 |
0.928 |
1.045 |
1.022 |
| 50 |
0.926 |
0.915 |
1.292 |
1.256 |
0.928 |
0.920 |
1.146 |
1.125 |
0.937 |
0.930 |
0.958 |
0.938 |
| 100 |
0.940 |
0.935 |
0.927 |
0.927 |
0.941 |
0.935 |
0.839 |
0.838 |
0.948 |
0.943 |
0.679 |
0.668 |
| 200 |
0.949 |
0.941 |
0.651 |
0.657 |
0.942 |
0.943 |
0.589 |
0.593 |
0.948 |
0.948 |
0.478 |
0.469 |
|
|
40 |
0.942 |
0.943 |
1.872 |
1.909 |
0.929 |
0.930 |
1.328 |
1.358 |
0.933 |
0.933 |
1.455 |
1.458 |
| 50 |
0.935 |
0.937 |
1.704 |
1.732 |
0.933 |
0.937 |
1.181 |
1.206 |
0.931 |
0.935 |
1.327 |
1.325 |
| 100 |
0.941 |
0.947 |
1.191 |
1.202 |
0.942 |
0.949 |
0.843 |
0.854 |
0.945 |
0.948 |
0.931 |
0.927 |
| 200 |
0.949 |
0.952 |
0.841 |
0.845 |
0.949 |
0.955 |
0.593 |
0.597 |
0.948 |
0.948 |
0.645 |
0.640 |
|
|
40 |
0.917 |
0.899 |
1.438 |
1.382 |
0.925 |
0.906 |
1.313 |
1.273 |
0.933 |
0.922 |
1.044 |
1.020 |
| 50 |
0.922 |
0.908 |
1.297 |
1.264 |
0.928 |
0.916 |
1.154 |
1.131 |
0.939 |
0.935 |
0.927 |
0.911 |
| 100 |
0.937 |
0.928 |
0.960 |
0.958 |
0.941 |
0.935 |
0.838 |
0.838 |
0.940 |
0.938 |
0.660 |
0.654 |
| 200 |
0.940 |
0.940 |
0.674 |
0.679 |
0.945 |
0.944 |
0.615 |
0.619 |
0.945 |
0.941 |
0.474 |
0.467 |
|
|
40 |
0.903 |
0.885 |
1.226 |
1.167 |
0.912 |
0.894 |
0.994 |
0.947 |
0.927 |
0.909 |
0.518 |
0.511 |
| 50 |
0.921 |
0.912 |
1.093 |
1.057 |
0.917 |
0.912 |
0.902 |
0.870 |
0.928 |
0.918 |
0.460 |
0.457 |
| 100 |
0.931 |
0.925 |
0.805 |
0.802 |
0.936 |
0.935 |
0.646 |
0.644 |
0.935 |
0.931 |
0.337 |
0.338 |
| 200 |
0.941 |
0.939 |
0.581 |
0.585 |
0.936 |
0.939 |
0.460 |
0.462 |
0.945 |
0.946 |
0.236 |
0.237 |
Table 4.3
Simulation results of the Hájek (HJ) estimator and sample empirical likelihood (SEL) estimator after adding noise
Table summary
This table displays the results of Simulation results of the Hájek (HJ) estimator and sample empirical likelihood (SEL) estimator after adding noise. The information is grouped by Setting (appearing as row headers),
,
and
(appearing as column headers).
| Setting |
|
|
|
| Coverage Rate |
Avg Length |
Coverage Rate |
Avg Length |
Coverage Rate |
Avg Length |
| Model |
|
HJ |
SEL |
HJ |
SEL |
HJ |
SEL |
HJ |
SEL |
HJ |
SEL |
HJ |
SEL |
|
|
40 |
0.934 |
0.933 |
1.002 |
0.934 |
0.933 |
0.935 |
1.091 |
0.959 |
0.937 |
0.940 |
1.292 |
1.096 |
| 50 |
0.935 |
0.936 |
0.902 |
0.841 |
0.939 |
0.935 |
0.979 |
0.862 |
0.936 |
0.938 |
1.156 |
0.986 |
| 100 |
0.947 |
0.948 |
0.635 |
0.596 |
0.944 |
0.949 |
0.697 |
0.616 |
0.946 |
0.948 |
0.820 |
0.705 |
| 200 |
0.951 |
0.949 |
0.451 |
0.421 |
0.947 |
0.945 |
0.493 |
0.437 |
0.951 |
0.951 |
0.579 |
0.500 |
|
|
40 |
0.933 |
0.936 |
2.371 |
2.139 |
0.938 |
0.934 |
3.418 |
2.469 |
0.933 |
0.942 |
5.095 |
2.980 |
| 50 |
0.940 |
0.941 |
2.148 |
1.938 |
0.940 |
0.937 |
3.057 |
2.210 |
0.945 |
0.944 |
4.583 |
2.687 |
| 100 |
0.939 |
0.941 |
1.493 |
1.345 |
0.948 |
0.946 |
2.196 |
1.588 |
0.948 |
0.951 |
3.223 |
1.916 |
| 200 |
0.942 |
0.942 |
1.054 |
0.938 |
0.944 |
0.947 |
1.545 |
1.113 |
0.949 |
0.947 |
2.264 |
1.356 |
|
|
40 |
0.939 |
0.935 |
1.004 |
0.940 |
0.935 |
0.937 |
1.101 |
0.970 |
0.939 |
0.947 |
1.288 |
1.093 |
| 50 |
0.937 |
0.935 |
0.890 |
0.832 |
0.938 |
0.942 |
0.978 |
0.864 |
0.936 |
0.940 |
1.152 |
0.982 |
| 100 |
0.946 |
0.945 |
0.635 |
0.595 |
0.951 |
0.952 |
0.698 |
0.616 |
0.948 |
0.952 |
0.821 |
0.706 |
| 200 |
0.949 |
0.950 |
0.450 |
0.420 |
0.943 |
0.948 |
0.493 |
0.437 |
0.952 |
0.952 |
0.579 |
0.500 |
|
|
40 |
0.937 |
0.942 |
0.365 |
0.358 |
0.936 |
0.941 |
0.362 |
0.354 |
0.932 |
0.938 |
0.362 |
0.354 |
| 50 |
0.935 |
0.939 |
0.326 |
0.322 |
0.939 |
0.943 |
0.325 |
0.320 |
0.938 |
0.947 |
0.324 |
0.320 |
| 100 |
0.941 |
0.948 |
0.232 |
0.230 |
0.948 |
0.953 |
0.230 |
0.229 |
0.941 |
0.946 |
0.231 |
0.229 |
| 200 |
0.947 |
0.948 |
0.165 |
0.164 |
0.942 |
0.944 |
0.163 |
0.163 |
0.949 |
0.951 |
0.163 |
0.163 |
ISSN : 1492-0921
Editorial policy
Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.
Submission of Manuscripts
Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).
Note of appreciation
Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.
Standards of service to the public
Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.
Copyright
Published by authority of the Minister responsible for Statistics Canada.
© Her Majesty the Queen in Right of Canada as represented by the Minister of Industry, 2021
Use of this publication is governed by the Statistics Canada Open Licence Agreement.
Catalogue No. 12-001-X
Frequency: Semi-annual
Ottawa