3. Sampling
Piero Demetrio Falorsi and Paolo Righi
Previous | Next
Let
be a vector of auxiliary
variables available for all
A sampling design
is said to be balanced on the
auxiliary variables if and only if it satisfies the following balancing
equations
for each
sample
such that
(Deville and Tillé 2004). Depending on the auxiliary variables and the
inclusion probabilities, equation (3.1) can be exactly or approximately
satisfied in each possible sample; therefore, a balanced sampling design does
not always exist. By specifying
equations
(3.1) become
In this case, the
balancing equations state that the sample size achieved in each subpopulation
is equal to the expected size. In
different contexts, Ernst (1989) and Deville and Tillé (2004; page 905 Section
7.3), have proved that, (i) with the
specification (3.2) and (ii) if the vector of the expected sample
sizes, given by
includes only integer numbers,
then a balanced sampling design always exists. Specification (3.2) defines
sampling designs that guarantee equation (2.4), upon which we wish to focus on.
Deville and Tillé (2004, pages 895 and 905), Deville and Tillé (2005, page 577)
and Tillé (2006, page 168) have shown that several customary sampling designs
may be considered as special cases of balanced sampling, by properly defining
the vectors
and
of equation (3.2). These issues
are illustrated in Remark 4.2 and in Section 6. Balanced samples may be drawn by means of the
Cube method (Deville and Tillé 2004). This strongly facilitates the sample
selection of incomplete stratified sampling designs that overcome the computational
drawbacks of methods based on linear programming algorithms (Lu and Sitter
2002). The Cube method satisfies (3.1) exactly when (3.2) holds and
is a
vector of integers. In the cases of SRSWOR and SSRSWOR, the standard sample
selection methods can be used, as well as the Cube method. Deville and Tillé
(2005) propose as approximation of the variance for the HT estimator, in the
balanced sampling
where
denotes the sampling expectation
and
with
Recently, the
simulation results in Breidt and Chauvet (2011) confirm that equation (3.4)
represents a good approximation of the sampling variance when the balanced
equations are satisfied exactly. Variance estimation is studied in Deville and Tillé
(2005).
Previous | Next