Criteria for choosing between calibration weighting and survey weighting
Section 2. Estimator of a variable of interest total
for a
population size
from
which sample
of size
is
selected based on survey design
designates
a random variable such as
and
and
respectively designate the first and second
probabilities of inclusion in survey design
We are
interested in a variable of interest
with the
objective of estimating its total
To do
that, we consider the category of linear estimators
where
are the
weights that can depend on sample
and the
auxiliary variables available. The basic weights used are the sampling weights
generated by
They
correspond to the Horvitz-Thompson estimator
(1952).
It is assumed that
we have
auxiliary variables
for
which the values may be represented by vectors
and for
which the vector of their totals
is
known. The category of calibration estimators is defined by
where
referred
to as calibration weights, verify the calibration equation given by
Calibration
helps to reduce the variance of a total estimator, particularly for variables
of interest that are linked to the auxiliary variables used in calibration.
However, calibration results in an estimator with a bias other than zero. That
is why the calibration weights are determined so that they are as close as
possible to the sampling weights in order to manage bias.
2.1 Precision of a
linear total estimator
In order to
measure the precision of a linear total estimator, we will consider the design
and model-based approach. In addition to the design distribution, this approach
consists of assuming that values
for the
variable of interest
are the
product of a random vector
whose
joint probability distribution is given by the Superpopulation model
defined
by:
with
where
are
unknown parameters.
and
represent respectively the expectation,
variance and covariance for the model. Vector estimator
for the
regression coefficients is produced by
where
is the
matrix of
values
for
and
Under
the the design and model-based approach, the criterion used to measure the
precision of a linear total estimator is
which
corresponds to the mean square error (MSE) for the design and model, also
referred to as the anticipated mean
square error (AMSE). This is based on the assumption that the design is not
informative. We can then show that the AMSE for linear estimator
is
(Nedyalkova and Tillé, 2008):
where
with
(sampling weight) and
for
and
otherwise. Ratio
equals 1
when linear estimator
is
unbiased according to the design.
2.2 AMSE for the calibration estimator
For the
calibration estimator, verifying the calibration equation renders it unbiased
under the model:
Consequently,
the AMSE is expressed as:
where
and
Giving
Note that the
expression (2.5) of
makes it
possible to underscore the two criteria that determine the accuracy of
calibration estimator
The
first corresponds to Superpopulation model
through
its residual variance
which
decreases when the variable of interest and the calibration variables are
correlated (variance reduction
The
second criterion is represented by weight ratios
which
become important when the calibration weights are very different from the
sampling weights (bias increase
2.3 AMSE for the HT estimator
In order to
develop our criterion for choosing between calibration weighting and sample
weighting, we need to determine the expression of the AMSE for the HT estimator. Since the latter is unbiased under the
design
its AMSE is given by:
It should be
noted that the expression of the AMSE for
depends
on probabilities
which
are generally unknown and difficult to calculate for unequal probability
sampling designs. Several approximations for these probabilities have been
proposed in literature, enabling us to obtain several possible estimators for
the variance of the HT estimator. However, Matei and Tillé (2005) showed,
through a series of simulations, that these estimators are almost equivalent
and allow us to effectively estimate the exact expression of the variance under
design
An approximation
of
can be
obtained by considering the one proposed by Hájek (1981) for the variance of
the HT estimator, produced by:
where
and
The latter is obtained from the following
approximation of probabilities
(see
Deville and Tillé, 2005; Tirari, 2003):
Consequently,
the AMSE for
can be
approximated by:
It should be
noted that for simple designs, such as Poisson design or simple stratified
random design, joint probability can be calculated precisely without the need
for an approximation. In the next section, we will be basing calibration and HT
estimators on the AMSE to develop a
new measurement of the impact of
using calibration weights.
ISSN : 1492-0921
Editorial policy
Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.
Submission of Manuscripts
Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).
Note of appreciation
Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.
Standards of service to the public
Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.
Copyright
Published by authority of the Minister responsible for Statistics Canada.
© Her Majesty the Queen in Right of Canada as represented by the Minister of Industry, 2018
Use of this publication is governed by the Statistics Canada Open Licence Agreement.
Catalogue No. 12-001-X
Frequency: Semi-annual
Ottawa