Unequal probability inverse sampling
Section 6. Unequal probability sampling without replacementUnequal probability inverse sampling
Section 6. Unequal probability sampling without replacement
For the draw without replacement, the first
problem is determining the design. One option is to use the method by Ohlsson
(1995) called sequential Poisson sampling. This method involves generating
uniform
random variables in the interval
denoted
Next, we
select the
units
corresponding to the smallest values of
This
method has the advantage of being usable for any sample size and providing a
sequence of samples that are included in each other. Unfortunately, it only satisfies
approximately the fixed inclusion probabilities. However, the approximations
are very accurate according to the simulations given in Ohlsson (1995).
Methods have also been proposed by Sampford
(1962) and Pathak (1964). We propose an exact solution to the problem in the
sense that the inclusion probabilities are exactly satisfied. We begin by
calculating the inclusion probabilities for a design of fixed size
with
inclusion probabilities proportional to a strictly positive auxiliary variable
The
probabilities are determined by
where
is determined such that
A simple
algorithm for calculating these probabilities is described in Tillé (2006, page
19), among others. The probabilities can be calculated simply using the function
inclusionprobabilities in the R sampling package.
A sequential selection method must therefore
select a sample of size
with
inclusion probabilities
It must
then make it possible to go from size
to size
by
simply selecting an additional unit such that the completed sample has an
inclusion probability of
It
appears that the only method that allows that to be achieved is the elimination
method (Tillé 1996). This method starts with the entire population (the list of
occupations) and eliminates one unit in each step. In step
the unit
is eliminated from among the remaining units with the probability
This method can
thus be used to create a sequence of samples included in each other that verify
the inclusion probabilities in relation to their size.
Therefore, we can simply apply the elimination
method for sample size
so that
the algorithm successively eliminates all the units. Taking them in the reverse
order of elimination, we obtain a sequence of units. The first
units of
the sequence are selected with inclusion probability
The
appendix contains a function written in R that can be used to generate this
sequence. The code is executed in a simulation that shows that the
probabilities obtained through simulations by applying this function are equal
to the fixed inclusion probabilities for all sample sizes.
6.2 Inverse or negative design with unequal
probabilities
Now that the design is defined, the inverse
design can be defined. The units in the list of occupations are taken using the
elimination method until
occupations in the enterprise are selected. In
this case, the probability distribution of the number of failures
seems
impossible to calculate. Calculating the conditional inclusion probability
is also
problematic.
However, we can proceed by analogy and estimate
the inclusion probabilities on the basis of expression (5.1) developed for the
case with replacement, where
can
simply be replaced by
Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.
Submission of Manuscripts
Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).
Note of appreciation
Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.
Standards of service to the public
Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.
Copyright
Published by authority of the Minister responsible for Statistics Canada.