Model based inference using ranked set samples
Section 5. Empirical results
In
this section, we conduct a simulation study to check the finite sample
properties of the estimator for different values of simulation parameters. Data
sets are generated from normal
and log normal
super populations. We consider two different
finite populations with population sizes
150 and
1,000 to see the
impact of population sizes on the estimators. Sample and set size combinations
are selected to be (10, 2), (15, 3),
(20, 4), (25, 5). The quality of ranking information is modeled
through a perceptual error model in Dell and Clutter (1972). The Dell and Clutter
model considers two variables, the variable of interest
and a correlated ranking variable
The ranking variable is modeled through an
additive model
where
is a random noise generated independently with
respect to
To implement the perceptual error model, we
generate a set (size
of simple random sample,
from the true population of interest with mean
and variance
Another set (size
of random numbers are generated from a normal
distribution with mean zero and variance,
The perceptual error model is then defined by
The random numbers
are ranked with respect to the first
components
and the second components are taken to be the
judgment ranked order statistics
The quality of the ranking information is
controlled by the correlation coefficient between
and
Since the units are ranked based on
concomitant variable
the ranking model is equivalent to concomitant
ranking in Section 3. In the simulation study, we used
for perfect ranking and
0.75, 0.50
for imperfect ranking.
In
each replication of the simulation, a finite population of size
is generated from the normal super population
with specified mean
and standard deviation
A ranked set sample is then constructed from
this finite population, a realization from normal super population, with
specified set and cycle sizes. The quality of ranking information in each RSS
sample is controlled generating random noise vector
with specified
(or equivalently
in the perceptual error model. The simulation
size is taken to be 50,000.
Simulation
results are presented in Tables 5.1, 5.2, 5.3 and 5.4. There are several
features that need to be discussed in these tables. For different
and sample size combinations
the relative efficiencies of the RSS estimator
with respect to the SRS estimator are given by
where
and
are the MSPE of SRS and RSS sample means from
the simulation study under a super population model in equation (2.1),
respectively. In equation (5.1), the relative efficiency values
greater than one indicate that the RSS
estimator is more efficient than the SRS estimator. In all these tables, the
RSS sample mean estimator performs better than the SRS estimator. Its
efficiency is an increasing function of set size
and correlation coefficient
as expected. Under the concomitant cost model,
if the cost ratio of obtaining a unit in RSS and a unit in SRS is less than the
RP values in Table 4.1, the RSS sample mean has higher efficiency than the
SRS sample mean.
The
impact of the finite population size
can be observed by comparing the efficiency
results in Tables 5.1 and 5.2 for the normal super population and Tables 5.3
and 5.4 for the lognormal super population. When
0.50, relative
efficiencies
are higher in Table 5.1
than Table 5.2
In Table 5.1, finite population
correction factor is smaller than the finite population correction factor in
Table 5.2. Hence, the reduction in MSPE is smaller in RSS estimator.
Similar effect is also observed in Tables 5.3 and 5.4.
The
simulation study also investigated the properties of the MSPE estimator of RSS
sample mean estimator. Theoretical value of the MSPE estimator is given under
the heading
when
1.0. The
simulated (unbiased) MSPE estimate is given in columns 5 (6) in Tables 5.1-5.4.
It is very clear that simulated and unbiased MSPE estimates are almost
identical when
as expected. Under perfect ranking
theoretical MSPE values, and the simulated and
unbiased MSPE estimates are all close to each other within the simulation
variation.
The
coverage probabilities of the confidence intervals are given under the heading
in column 7 in Tables 5.1-5.4. In
Tables 5.1 and 5.2, the coverage probabilities of the confidence intervals
based on
approximation are reasonably
close to the nominal coverage probability 0.950. On the other hand, the
coverage probabilities in Tables 5.3 and 5.4 are smaller than the nominal
coverage probability 0.95 for lognormal super population. The coverage
probabilities are getting closer to nominal values when the sample size
increases. This indicates that for skewed populations, sample sizes should be
large enough to have a reasonable coverage probability for the confidence
intervals.
Table 5.1
MSPE estimate and
relative efficiency of RSS sample estimator, and coverage probability of a 95%
confidence interval of population mean. Data sets are generated from a normal
super population with
and population
size
150
Table summary
This table displays the results of MSPE estimate and relative efficiency of RSS sample estimator. The information is grouped by (appearing as row headers), , Est. from equations, Est. from simu., UE estimates, Coverage prb. and Relative eff. (appearing as column headers).
|
|
Est. from equations |
Est. from simu. |
UE estimates |
Coverage prb. |
Relative eff. |
|
|
|
|
|
|
| 2.0 |
0.50 |
- |
1.493 |
1.355 |
1.365 |
0.949 |
1.102 |
| 3.0 |
0.50 |
- |
0.960 |
0.840 |
0.833 |
0.947 |
1.143 |
| 4.0 |
0.50 |
- |
0.693 |
0.572 |
0.578 |
0.948 |
1.213 |
| 5.0 |
0.50 |
- |
0.533 |
0.435 |
0.432 |
0.948 |
1.226 |
| 2.0 |
0.75 |
- |
1.493 |
1.195 |
1.205 |
0.949 |
1.250 |
| 3.0 |
0.75 |
- |
0.960 |
0.675 |
0.674 |
0.947 |
1.423 |
| 4.0 |
0.75 |
- |
0.693 |
0.433 |
0.436 |
0.946 |
1.600 |
| 5.0 |
0.75 |
- |
0.533 |
0.302 |
0.304 |
0.945 |
1.768 |
| 2.0 |
1.00 |
0.984 |
1.493 |
0.974 |
0.984 |
0.948 |
1.534 |
| 3.0 |
1.00 |
0.451 |
0.960 |
0.455 |
0.451 |
0.940 |
2.111 |
| 4.0 |
1.00 |
0.234 |
0.693 |
0.233 |
0.235 |
0.936 |
2.971 |
| 5.0 |
1.00 |
0.124 |
0.533 |
0.125 |
0.126 |
0.922 |
4.273 |
Table 5.2
MSPE estimate and
relative efficiency of RSS sample estimator, and coverage probability of a 95%
confidence interval of population mean. Data sets are generated from a normal
super population with
and population
size
1,000
Table summary
This table displays the results of MSPE estimate and relative efficiency of RSS sample estimator. The information is grouped by (appearing as row headers), , Est. from equations, Est. from simu., UE estimate, Coverage prb. and Relative eff. (appearing as column headers).
|
|
Est. from equations |
Est. from simu. |
UE estimate |
Coverage prb. |
Relative eff. |
|
|
|
|
|
|
| 2.0 |
0.50 |
- |
1.584 |
1.461 |
1.455 |
0.950 |
1.084 |
| 3.0 |
0.50 |
- |
1.051 |
0.931 |
0.924 |
0.949 |
1.129 |
| 4.0 |
0.50 |
- |
0.784 |
0.665 |
0.670 |
0.950 |
1.180 |
| 5.0 |
0.50 |
- |
0.624 |
0.524 |
0.522 |
0.950 |
1.191 |
| 2.0 |
0.75 |
- |
1.584 |
1.304 |
1.295 |
0.949 |
1.215 |
| 3.0 |
0.75 |
- |
1.051 |
0.770 |
0.765 |
0.948 |
1.365 |
| 4.0 |
0.75 |
- |
0.784 |
0.525 |
0.526 |
0.951 |
1.494 |
| 5.0 |
0.75 |
- |
0.624 |
0.392 |
0.395 |
0.951 |
1.590 |
| 2.0 |
1.00 |
1.075 |
1.584 |
1.075 |
1.076 |
0.950 |
1.473 |
| 3.0 |
1.00 |
0.541 |
1.051 |
0.538 |
0.541 |
0.951 |
1.954 |
| 4.0 |
1.00 |
0.325 |
0.784 |
0.327 |
0.325 |
0.949 |
2.398 |
| 5.0 |
1.00 |
0.215 |
0.624 |
0.217 |
0.215 |
0.948 |
2.877 |
Table 5.3
MSPE estimate and relative efficiency of RSS sample
estimator, and coverage probability of a 95% confidence interval of population
mean. Data sets are generated from a log-normal super population with
and population
size
150
Table summary
This table displays the results of MSPE estimate and relative efficiency of RSS sample estimator. The information is grouped by (appearing as row headers), , Est. from equations, Est. from simu., UE estimate, Coverage prb. and Relative eff. (appearing as column headers).
|
|
Est. from equations |
Est. from simu. |
UE estimate |
Coverage prb. |
Relative eff. |
|
|
|
|
|
|
| 2.0 |
0.50 |
- |
0.436 |
0.400 |
0.400 |
0.852 |
1.089 |
| 3.0 |
0.50 |
- |
0.280 |
0.243 |
0.242 |
0.869 |
1.153 |
| 4.0 |
0.50 |
- |
0.202 |
0.160 |
0.162 |
0.883 |
1.262 |
| 5.0 |
0.50 |
- |
0.156 |
0.117 |
0.116 |
0.886 |
1.336 |
| 2.0 |
0.75 |
- |
0.436 |
0.371 |
0.372 |
0.855 |
1.176 |
| 3.0 |
0.75 |
- |
0.280 |
0.216 |
0.217 |
0.867 |
1.300 |
| 4.0 |
0.75 |
- |
0.202 |
0.146 |
0.146 |
0.874 |
1.388 |
| 5.0 |
0.75 |
- |
0.156 |
0.103 |
0.103 |
0.878 |
1.514 |
| 2.0 |
1.00 |
0.362 |
0.436 |
0.361 |
0.364 |
0.839 |
1.207 |
| 3.0 |
1.00 |
0.201 |
0.280 |
0.197 |
0.198 |
0.849 |
1.423 |
| 4.0 |
1.00 |
0.128 |
0.202 |
0.128 |
0.127 |
0.847 |
1.586 |
| 5.0 |
1.00 |
0.086 |
0.156 |
0.085 |
0.085 |
0.845 |
1.833 |
Table 5.4
MSPE estimate and relative efficiency of RSS sample
estimator, and coverage probability of a 95% confidence interval of population
mean. Data sets are generated from a log-normal super population with
and population
size
1,000
Table summary
This table displays the results of MSPE estimate and relative efficiency of RSS sample estimator. The information is grouped by (appearing as row headers), , Est. from equations, Est. from simu., UE estimate, Coverage prb. and Relative eff. (appearing as column headers).
|
|
Est. from equations |
Est. from simu. |
UE estimate |
Coverage prb. |
Relative eff. |
|
|
|
|
|
|
| 2.0 |
0.50 |
- |
0.462 |
0.432 |
0.433 |
0.851 |
1.070 |
| 3.0 |
0.50 |
- |
0.307 |
0.263 |
0.263 |
0.868 |
1.164 |
| 4.0 |
0.50 |
- |
0.229 |
0.189 |
0.190 |
0.882 |
1.208 |
| 5.0 |
0.50 |
- |
0.182 |
0.141 |
0.141 |
0.889 |
1.296 |
| 2.0 |
0.75 |
- |
0.462 |
0.413 |
0.413 |
0.852 |
1.119 |
| 3.0 |
0.75 |
- |
0.307 |
0.240 |
0.238 |
0.868 |
1.276 |
| 4.0 |
0.75 |
- |
0.229 |
0.171 |
0.170 |
0.878 |
1.337 |
| 5.0 |
0.75 |
- |
0.182 |
0.129 |
0.129 |
0.884 |
1.415 |
| 2.0 |
1.00 |
0.389 |
0.462 |
0.387 |
0.386 |
0.839 |
1.195 |
| 3.0 |
1.00 |
0.228 |
0.307 |
0.225 |
0.227 |
0.852 |
1.364 |
| 4.0 |
1.00 |
0.154 |
0.229 |
0.155 |
0.155 |
0.857 |
1.479 |
| 5.0 |
1.00 |
0.113 |
0.182 |
0.113 |
0.113 |
0.862 |
1.614 |
ISSN : 1492-0921
Editorial policy
Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.
Submission of Manuscripts
Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).
Note of appreciation
Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.
Standards of service to the public
Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.
Copyright
Published by authority of the Minister responsible for Statistics Canada.
© Her Majesty the Queen in Right of Canada as represented by the Minister of Industry, 2018
Use of this publication is governed by the Statistics Canada Open Licence Agreement.
Catalogue No. 12-001-X
Frequency: Semi-annual
Ottawa