A note on multiply robust predictive mean matching imputation with complex survey data
Section 3. Proposed method
The proposed method
allows the user to specify multiple outcome regression models for the survey
variable
This grants a
greater probability of selecting a model that performs well at replicating the
relationship between the response variable and the explanatory variables,
making the approach multiply robust without requiring the Lipschitz continuity
condition to hold. As long as one of the specified models is correctly
specified, the resulting estimator will be consistent.
We consider a class of
outcome regression models:
To impute the
missing values, we proceed as follows:
(Step1).
Obtain the estimators
of
by solving the following survey weighted
estimating equations:
(Step2).
For
obtain the
-vector of
predicted values
(Step3).
Fit a weighted linear regression model without intercept with
as the response variable and
as the vector of explanatory variables. Let
be the resulting predicted value attached to
unit
where
and
(Step4).
The imputed value for the missing
is
where
is the index of the nearest-neighbour of unit
which satisfies
for any
After applying (Step1)-(Step4), we construct the imputed estimator of
where
Using an approach similar to the one used by
Yang and Kim (2020), it can be shown that the estimator
is multiply robust in the sense that it is
consistent if all but one model are misspecified.
Estimating the variance
of
can be done
through replication variance estimation procedures; see e.g., Rust and Rao
(1996) and Wolter (2007). In the context of PMM for survey data, Yang and Kim
(2020) also considered replication procedures. Let
denote the
number of replicates and
be a
replication weight attached to unit
in the
replicate. A
replication variance estimator of
is given by
where
denote the estimator
in the
replicate with
denoting the imputed value attached to unit
in the
replicate, obtained from (Step1)-(Step4)
above, based on the replication weight
instead of the original weights
The factor
in (3.4) is determined by the
replication method. For instance, with the delete-one jackknife, we have
and
if
and
if
ISSN : 1492-0921
Editorial policy
Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.
Submission of Manuscripts
Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).
Note of appreciation
Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.
Standards of service to the public
Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.
Copyright
Published by authority of the Minister responsible for Statistics Canada.
© Her Majesty the Queen in Right of Canada as represented by the Minister of Industry, 2021
Use of this publication is governed by the Statistics Canada Open Licence Agreement.
Catalogue No. 12-001-X
Frequency: Semi-annual
Ottawa