Unequal probability inverse sampling Section 6. Unequal probability sampling without replacementUnequal probability inverse sampling Section 6. Unequal probability sampling without replacement

6.1 Sequential sampling without replacement

For the draw without replacement, the first problem is determining the design. One option is to use the method by Ohlsson (1995) called sequential Poisson sampling. This method involves generating $M$ uniform random variables in the interval $[0,1],$ denoted $u_{i k} .$ Next, we select the $n$ units corresponding to the smallest values of $u_{i k} / π_{k | i} .$ This method has the advantage of being usable for any sample size and providing a sequence of samples that are included in each other. Unfortunately, it only satisfies approximately the fixed inclusion probabilities. However, the approximations are very accurate according to the simulations given in Ohlsson (1995).

Methods have also been proposed by Sampford (1962) and Pathak (1964). We propose an exact solution to the problem in the sense that the inclusion probabilities are exactly satisfied. We begin by calculating the inclusion probabilities for a design of fixed size $n$ with inclusion probabilities proportional to a strictly positive auxiliary variable $b_{k}, k \in L .$ The probabilities are determined by

$π_{k | i} (n) = \min (1, C_{n} \frac{b_{k}}{\sum_{l \in L} b_{l}}),$

where $C_{n}$ is determined such that

$\sum_{k \in L} π_{k | i} (n) = \sum_{k \in L} \min (1, C_{n} \frac{b_{k}}{\sum_{l \in L} b_{l}}) = n .$

A simple algorithm for calculating these probabilities is described in Tillé (2006, page 19), among others. The probabilities can be calculated simply using the function inclusionprobabilities in the R sampling package.

A sequential selection method must therefore select a sample of size $n$ with inclusion probabilities $π_{k | i} (n) .$ It must then make it possible to go from size $n$ to size $n + 1$ by simply selecting an additional unit such that the completed sample has an inclusion probability of $π_{k | i} (n + 1) .$ It appears that the only method that allows that to be achieved is the elimination method (Tillé 1996). This method starts with the entire population (the list of occupations) and eliminates one unit in each step. In step $j =1, \dots, N,$ the unit is eliminated from among the remaining units with the probability

$1 - \frac{π_{k | i} (N - j)}{π_{k | i} (N - j + 1)} .$

This method can thus be used to create a sequence of samples included in each other that verify the inclusion probabilities in relation to their size.

Therefore, we can simply apply the elimination method for sample size $n =1$ so that the algorithm successively eliminates all the units. Taking them in the reverse order of elimination, we obtain a sequence of units. The first $n$ units of the sequence are selected with inclusion probability $π_{k | i} (n) .$ The appendix contains a function written in R that can be used to generate this sequence. The code is executed in a simulation that shows that the probabilities obtained through simulations by applying this function are equal to the fixed inclusion probabilities for all sample sizes.

6.2 Inverse or negative design with unequal probabilities

Now that the design is defined, the inverse design can be defined. The units in the list of occupations are taken using the elimination method until $r$ occupations in the enterprise are selected. In this case, the probability distribution of the number of failures $X_{i}$ seems impossible to calculate. Calculating the conditional inclusion probability $E (A_{i k} | X_{i})$ is also problematic.

However, we can proceed by analogy and estimate the inclusion probabilities on the basis of expression (5.1) developed for the case with replacement, where $p_{i k}$ can simply be replaced by

$\frac{π_{k | i} (r + X_{i})}{r + X_{i}} .$

Therefore, we obtain

$\hat{1 / π_{k | i}} = {\begin{array}{l} \frac{(r - 1) (r + X_{i})}{r (X_{i} + r - 1) π_{k | i} (r + X_{i})} & if k \in F_{i} \\ \frac{r + X_{i}}{(X_{i} + r - 1) π_{k | i} (r + X_{i})} & if k \in D_{i} . \end{array}$

ISSN : 1492-0921

Editorial policy

Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.

Submission of Manuscripts

Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).

Note of appreciation

Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.

Standards of service to the public

Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.

Copyright

Published by authority of the Minister responsible for Statistics Canada.

Use of this publication is governed by the Statistics Canada Open Licence Agreement.

Catalogue No. 12-001-X

Frequency: semi-annual

Ottawa

Date modified:: 2016-12-20

Language selection

Search and menus

Search

Unequal probability inverse sampling Section 6. Unequal probability sampling without replacementUnequal probability inverse sampling Section 6. Unequal probability sampling without replacement

6.1 Sequential sampling without replacement

6.2 Inverse or negative design with unequal probabilities