5. Determination of the optimal inclusion probabilities
Piero Demetrio Falorsi and Paolo Righi
Previous | Next
The vector of
values is determined by solving
the following optimization problem:
where
is the cost for collecting
information from unit
and
is a fixed variance threshold
corresponding to
System (5.1) minimizes the
expected cost ensuring that the anticipated variances are bounded and that the
inclusion probabilities lie between 0 and 1. If all the
values are constants equal to 1,
then the problem (5.1) minimizes the sample size. We note that in problem (5.1)
the variances
in
are treated as known; in practice
they must be estimated. In Section 6, an empirical evaluation is conducted in
order to study the sensitivity of the overall sample size with different
estimated values of
To solve (5.1), we
rearrange the inequality constraints to obtain
By fixing the
values of
appropriately, the optimization
problem becomes a classical Linear Convex Separate Problem (LCSP; Boyd and
Vanderberg 2004). Figure 5.1 depicts the flow chart of the algorithm (A prototype software implementing the algorithm is available at
http://www.istat.it/it/strumenti/metodi-e-software/software.), which is
organized into two nested loops: the Outer Loop (OL) and the Inner Loop (IL). The two loops are
updated according to a fixed point algorithm scheme. The convergence under some approximations is shown in
Appendix A2.
Figure 5.1 Algorithm flowchart

Description for Figure 5.1
Initialization. At iteration
of the OL, set
with
A reasonable choice is
At iteration
of the Inner Loop, set
Fix the
vector,
of small
positive values.
Outer loop
- Fixing
the values for the Inner Loop. In accordance with expressions (A1.4), (A1.7) and (A1.8) given in
Appendix A1, the following real scalar values are computed
- Launch
of the Inner Loop. The Inner Loop is executed until convergence.
- Updating
or exiting. If the vector
is such that
then the Outer Loop is iterated
by updating the vector
with
If
then the Outer Loop closes and
represents the optimal values
solution to the problem of the system (5.1).
Inner Loop
-
Fixing
the values for the LCSP. The following values are computed:
in accordance with expression (A1.7) in
Appendix A1.
Solving
the LCSP. Considering the
values as fixed, the
is obtained by solving, by a
standard algorithm for a classical LCSP, the following optimization problem:
Updating
or exiting. If
the vector
is such that
then the Inner Loop is iterated by updating the vector
with
If
then the Inner Loop closes and the updated vector
for the Outer Loop is given by
Remark 5.1. The problem of the system (5.7) can be solved by the algorithm proposed
in Falorsi and Righi (2008, Section 3.1) which represents a slight modification
of Chromy’s algorithm (1987), originally developed for multivariate optimal
allocation in SSRSWOR designs and implemented in standard software tools (see for example the Mauss-R software
available at: http://www3.istat.it/strumenti/metodi/software/campione/mauss_r/). Alternatively, the LCSP can be dealt with by
the SAS procedure NLP as suggested by Choudhry et al. (2012).
Remark 5.2. The algorithm distinguishes the
(updated in the Outer loop) from
the
(updated in the Inner loop). The
innovation of the proposed algorithm lies precisely in this peculiarity. If
this distinction between the inclusion probabilities is not made, i.e.,
we have observed in several
experiments that the iterate solutions of the LCSP for each Outer Loop do not
converge to a stationary point.
Remark 5.3. After the optimization phase, in which the
vector is defined as solution to
problem of system (5.1), a calibration
phase is performed (Falorsi and Righi 2008) to obtain calibrated inclusion
probabilities,
which modifies the optimal
vector marginally in order to
satisfy
where
is a vector of integer numbers. The use of the
Generalized Iterative Proportional Fitting algorithm (Dykstra and Wollan 1987)
ensures that all resulting calibrated inclusion probabilities are in the
interval.
Previous | Next