Sample empirical likelihood approach under complex survey design with scrambled responses
Section 2. Preliminaries
Suppose the finite
population
is generated
from some unknown super-population model, where
is a study
variable and
is a covariate.
For ease of presentation, given
a random sample
is assumed to
be selected from a single stage unstratified sampling design. Let
be the sampling
indicator for unit
such that
if unit
is selected and
0 otherwise. Denote the first-order and second-order inclusion probabilities as
and
for
Then, the
sampling weight can be written as
and sample size
is
Suppose the
parameter of interest is
Due to
confidentiality, we plan to use scrambled responses
of
such that
with
probability
and
with
probability
where
and
with
and
known. Bar-lev
et al. (2004) and Singh and Kim (2011) considered similar models. Instead
of observing
directly, we
only observe the scrambled responses
in the data
file. Hájek estimator discussed in Hájek (1971) and Fuller (2009) has been used
frequently in survey data analysis. Under certain regularity conditions, one
can show that the following Hájek (HJ) type estimator is consistent:
where
and
since
and
The asymptotic properties of
are described in the following Theorem 1,
and the sketched proof is contained in Appendix B.
Theorem 1. Under the regularity
conditions in Appendix A,
has the following
asymptotic expansion
and
as
with
where
and
Note that
is the design variability of Hájek estimator
for population mean
by using the true values and
is the additional variability generated by
using scrambled responses. According to Theorem 1, the consistent
estimator of
can be written as
When
the second term above can be safely ignored.
Therefore, we can use a traditional design consistent estimator with
transformed variable
In the next section, we will propose using the
pseudo empirical likelihood method to construct both point estimator and
confidence interval when we have aggregated auxiliary information.
ISSN : 1492-0921
Editorial policy
Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.
Submission of Manuscripts
Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).
Note of appreciation
Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.
Standards of service to the public
Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.
Copyright
Published by authority of the Minister responsible for Statistics Canada.
© Her Majesty the Queen in Right of Canada as represented by the Minister of Industry, 2021
Use of this publication is governed by the Statistics Canada Open Licence Agreement.
Catalogue No. 12-001-X
Frequency: Semi-annual
Ottawa