Statistical inference with non-probability survey samples
Section 5. Quota surveys and poststratification
Quota surveys are one of
the oldest non-probability survey sampling methods which are still used in
practice in present days. For a pre-specified overall sample size
quotas of
sample sizes are set for subpopulations which are defined by demographic
variables and social-economic status indicators or other characteristic
variables suitable for units of the target population. Data collection
processes continue until quotas for each of the subpopulations are filled.
Units from the population are typically approached using whatever convenient
ways available and there are little or no controls on how units are selected
for the final sample other than the pre-specified quotas.
The theory of the IPW
estimators for non-probability survey samples provides an opportunity to
examine scenarios where quota surveys may succeed or fail. For the convenience
of notation without loss of generality, let
be the quota
survey sample and
be the set of
categorical variables used for defining the subpopulations and setting the
quotas. The overall sample can be partitioned into
corresponding
to the cross-classification of sampled units using the combinations of levels
of the
variables. For
instance, if
with
having two
levels and
having three
levels, we have a total of
subpopulations
defined by
Let
be the
pre-specified size of
and
be the size of
the corresponding subpopulation. Under the assumption A1, the propensity scores
become a
constant for units in the same subpopulation and are given by
for the
subpopulation. The
IPW estimator
given in (4.8)
reduces to
where
is the size of
the
subpopulation
obtained or estimated from external sources, and
Under the
current setting with the availability of a reference probability sample
we form the
same partition as cross-classified by levels of
and obtain
We can then use
The estimator given in
(5.1) is the standard poststratified estimator of
It requires the
information on the “stratum weights”
which is not
available from the sample data itself. Quota surveys, combined with the use of
the poststratified estimator, can be successful in producing valid population
estimates for the study variable
if the following
conditions hold:
(i) The
categorical variables
used in
defining the subpopulations and setting the quotas provide characterizations of
the participation behavior of the units for voluntary surveys.
(ii) The
inclusion of units in the survey is relatively random within each subpopulation
and no specific groups are intentionally excluded from the survey.
(iii) The
information on the stratum weights corresponding to the cross-classifications
in setting the quotas can be reliably obtained from external sources.
(iv) The hardcore
nonrespondents in the population who never take any voluntary surveys possess
similar features to respondents in terms of the study variable
The IPW estimators
and
given in (4.8)
may be sensitive to small values of estimated propensity scores. The
poststratified estimator in the form of (5.1) serves as a robust alternative
under general scenarios where the dimension of
is not low
and/or some components of
are continuous.
The
strata are
formed based on homogeneous groups in terms of the propensity scores. Suppose
that
are computed
based on a parametric model,
Suppose also
that
with the chosen
where
is an integer.
Let
be the
estimated propensity scores in ascending order. Let
be the set of
the first
units in the
sequence,
be the second
units in the
sequence, and so on. The poststratified estimator of
is computed as
which has the
same form of the estimator given in (5.1). The estimates of the stratum
weights,
can be obtained
by using the reference probability sample
as follows. Let
Let
and
- Compute
- Define
- Calculate
It is apparent that
and
The estimated
stratum weights are given by
The choice of
needs to
reflect the balance between homogeneity of the units within each post-stratum
(in terms of the propensity scores) and the stability of the poststratified
estimator (in terms of the stratum sample sizes). When the sample size
is small or
moderate, a small number such as
should be used.
For scenarios where
is large, a
larger
should be used
such that units within the same poststratified sample
have similar
estimated propensity scores. A practical guidance for the choice of
is to ensure
that
for the
poststratified samples. For those who are old enough, do you remember the good
old days when “the sample size is large” means
ISSN : 1492-0921
Editorial policy
Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.
Submission of Manuscripts
Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).
Note of appreciation
Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.
Standards of service to the public
Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.
Copyright
Published by authority of the Minister responsible for Statistics Canada.
© His Majesty the King in Right of Canada as represented by the Minister of Industry, 2022
Use of this publication is governed by the Statistics Canada Open Licence Agreement.
Catalogue No. 12-001-X
Frequency: Semi-annual
Ottawa