Unequal probability inverse sampling Section 2. Formalization of the problem
The following notation is used:
- a population of enterprises, i.e., may denote the population of enterprises in an economic region),
- the list of occupations,
- the number of occupations in the list, i.e., the size of
- the list of occupations in enterprise with
- the list of occupations absent from enterprise with and
- the number of occupations in enterprise i.e., the size of
- the number of distinct occupations to be obtained in each enterprise,
- the number of failures before the occupations in enterprise are obtained by selecting the occupations using a given design.
The main objective is to estimate the average wage for an occupation in the total population. Let be the average wage for occupation in enterprise and let be the number of employees with occupation in enterprise The objective is to estimate the average wage for occupation given by
Assume that a sample of enterprises is selected from using some given design with inclusion probabilities In enterprise a sample of occupations is selected using one of the designs described above with inclusion probability If the design is with replacement, represents the expected number of times that occupation is selected in enterprise
can be estimated using a “ratio” type estimator (Hájek 1971):
Therefore, the probability that an occupation will be selected in an enterprise must be known. However, with an inverse type design, the probability is unknown and must therefore be estimated in order to estimate Since the inclusion probabilities appear in the denominator, it is preferable to estimate the inverses of In an enterprise, an occupation’s probability of being selected decreases as the number of occupations increases. In addition, the probability depends on the inverse sampling design used in each enterprise.
- Date modified: