We examine the performance of the variance estimators in
two simulation studies. The first study generates finite populations with
response indicators
and then draws simple random samples from the
population. The second simulation uses data from the 2009-2013 5-year American Community Survey Public Use Microdata Samples (ACS PUMS) as a population and
then draws repeated cluster samples from this population under different
nonresponse mechanisms.
For the simulation involving simple random sampling, we
generated finite populations of 1,000,000 units. To study the
poststratification estimator we used
poststrata to generate nonresponse. The
experimental factors were:
sample size,
300 or 1,000.
population proportion
in each poststratum: (P1) (1/6, 1/6, 1/6, 1/6,
1/6, 1/6), (P2) (1/21, 2/21, 3/21, 4/21, 5/21, 6/21), and (P3) (6/21, 5/21,
4/21, 3/21, 2/21, 1/21).
response rates in poststrata: (R1) (0.2, 0.3, 0.5, 0.6, 0.8, 0.9), (R2) (0.3,
0.7, 0.3, 0.7, 0.3, 0.7), and (R3) (1, 1, 1, 1, 1, 1). Level (R3), with full
response, is included to explore the accuracy of the higher-order approximation
to the variance when
number of poststrata used in nonresponse adjustment: 1, 3 (collapse
adjacent pairs of poststrata), or 6. Only the settings with 6 poststrata are
guaranteed to correct for the nonresponse bias.
Within each poststratum, population values
were generated from a normal distribution with
the specified poststratum mean and variance 1. The response indicators
were generated as independent Bernoulli random
variables with mean
The simple random sampling simulations were
done in version 3.2.2 of R (R Core Team 2015), and 2,000 iterations were
performed for each of the 162 simulation settings, which results in a standard
error less than 0.005 for the Monte Carlo estimate of the rejection proportion
when the null hypothesis of
is true. Some of the generated samples had
fewer than two respondents in one or more poststrata, which would result in
some jackknife resamples having no respondents in those poststrata. For such
samples, the two poststrata with the smallest number of respondents were
combined iteratively until all poststrata had at least two respondents.
For each simulation setting, the Monte Carlo (MC)
variance of
was calculated as the sample variance of
for
The linearization and jackknife variance
estimates were calculated for each simulated sample, and the means of those
estimates over the 2,000 samples are denoted as
and
respectively.
Figures 4.1 and 4.2 display results for the simulation
settings in which
Figure 4.1 displays histograms of the ratios
of the mean linearization and jackknife variance estimates to
The scatterplot in Figure 4.2 displays the
percentage of the 2,000 iterations in which the null hypothesis
is rejected at the 5% significance level. Most
of the variance estimates are close to the MC variance and the rejection rate
for
is approximately 5% when
with higher power for larger values of
Four of the simulation runs with
however, have linearization and jackknife
variances that are approximately twice the MC variance, and rejection rates
that are between 0 and 1%. These results are from the simulations with
poststratum means (M3), response rates (R3), population proportions (P2) or
(P3), and three collapsed poststrata. Although the population means for the
collapsed poststrata differ, they do not differ greatly and a sample size of
1,000 is too small for the first-order asymptotic approximation to be accurate.
For these settings, a sample size of approximately 15,000 was needed to reduce
the variance ratios
and
to 1.2.
Description for Figure 4.1
This figures
shows two graphs to compare respectively the linearization and the Jackknife
variance to the MC variance. is on the
x-axis ranging from 0 to 800,000 and the ratio of the variance estimates over
the MC variance is on the y-axis ranging from 0.8 to 2.0. Simulations results from and are shown.
Description for Figure 4.2
This figures
shows two graphs to illustrate the percentage of null hypothesis rejected for
both the linearization and the Jackknife variances. is on the
x-axis ranging from 0 to 800,000 and the percentage of null hypothesis rejected
is on the y-axis ranging from 0 to 100. Simulations results from and are shown.
Figure 4.3 shows the behavior of and when the first-order term of the variance is but For all of those simulations, the true value
of was 0 and the second-order term was calculated using the SRS approximation in
Theorem 3. Even though the true first-order variance is zero for these settings, the estimated
first-order variances from linearization and jackknife are nonzero. For the
simulations with poststratum means (M1) and response rates (R3), for example,
all poststrata have the same population mean. The sample means for the
poststrata differ, however, and this causes the linearization and jackknife
variance estimators to be positive and, on average, about twice as large as the
MC variance. The same thing happens with poststratum means (M3), population
proportions (P1), and response rates (R3) when three poststrata are used: the
three collapsed poststrata each have population mean 1/2 but the sample means
vary.
Description for Figure 4.3
Figure 4.3 shows
the behavior of and when the
first-order term of the variance is but The Log MC
variance is on the x-axis ranging from 14.5 to 18.0 and the ratio of the
variance estimates over the MC variance is on the y-axis ranging from 0.8 to
2.0. The linearization and jackknife ratios are mostly around 2 while the ratio
from is around 1.
Only simulation settings with response rates (R3)
required the use of higher-order terms or large sample sizes for the
linearization and jackknife variance estimators to be accurate. It would be
easy to identify these situations in practice from the absence of nonresponse.
To study the properties of the estimators in Section 3,
we used a subset of the populations generated for the poststratification
simulation as well as populations generated with continuous covariate
giving factors:
Sample
size,
300 or 1,000.
Population
values and nonresponse generation.
Nonresponse is generated in 6 poststrata with
population proportions (P1) or (P2), and response rates (R1) or (R2). The
variable of interest
is generated with poststratum means (M1) and
(M2) plus a
error term.
Covariate
is generated from a
distribution. Then
is generated as (Y1)
(independent of
(Y2)
or (Y3)
The response propensities are generated as
(R1P)
for all units, (R2P) logit
and (R3P) logit
Response
propensity model used.
For poststratified populations, treat
as a continuous variable with values 1
6.
For populations with generated covariate
use linear logistic regression with covariate
This model is correctly specified for
response-generating mechanisms (R1P) and (R2P) but incorrectly specified for
mechanism (R3P).
To reduce the instability of the estimators, estimated
response propensities less than 0.05 were replaced by 0.05, corresponding to
trimming weight adjustments larger than 20. Figures 4.4 and 4.5 display the
variance ratios and empirical power for the propensity model simulations. All
settings in this simulation had
As in the poststratification simulation, the
linearization and jackknife variance estimators both perform well in general.
There are a few settings, however, in which the linearization variance is
substantially larger than the jackknife. This occurs because of the weight
trimming: the jackknife automatically accounts for the effect of weight
trimming on the variance because the jackknife replicates also trim the
weights. The linearization variance used in this simulation was from Theorem 5,
and the formula would need to be modified to include the effects of trimming.
We also ran simulations using the jackknife in which the mean was estimated
instead of the population total, and the jackknife performed well for that
parameter as well.
The second simulation study used a population of
6,019,599 household-level records from the ACS PUMS studied in Lohr, Hsu and
Montaquila (2015). There are 3,344 PSUs in the population defined by the public
use microdata areas. Eight poststrata were formed based on the cross-classification
of households by tenure (rent or own), presence of children in the household
(yes or no), and number of income earners (0-1 or 2+). The primary outcome
variable
was household income. Additionally, a less
skewed outcome variable
was studied, where
was set to 0 if
A
factorial design was used for this study with
factors
overall response rate: 50% or 80%.
number of PSUs for each sample: 25 or 100.
nonresponse generating mechanism: (N1) missing completely at random
(MCAR), with response propensity for all records equal to the response rate for
all households; (N2) missing at random (MAR), where a linear logistic model
with main effect terms for tenure, presence of children, and number of income
earners generates the response propensities; and (N3) missing not at random
(MNAR), where a linear logistic model with main effect terms for tenure,
presence of children, and household income generates the response propensities.
Description for Figure 4.4
This figures
shows two graphs to compare respectively the linearization and the Jackknife
variance to the MC variance for the propensity model simulation. is on the
x-axis ranging from 0 to 250,000 and the ratio of the variance estimates over
the MC variance is on the y-axis ranging from 0.9 to 1.6. Simulations results from and are shown.
Description for Figure 4.5
This figures
shows two graphs to illustrate the percentage of null hypothesis rejected for
both the linearization and the Jackknife variances for the propensity model
simulation. is on the
x-axis ranging from 0 to 250,000 and the percentage of null hypothesis rejected
is on the y-axis ranging from 0 to 100. Simulations results from and are shown.
For the first two nonresponse generating mechanisms,
For the first mechanism, there is no
nonresponse bias. Poststratification corrects for the bias in the second
mechanism because
for units in poststratum
Poststratification does not correct for the
bias in the third mechanism because the nonresponse depends on the
variable, household income.
For each simulation setting, response indicators were
generated independently for the population units using the calculated response
propensities. One thousand samples were drawn for each setting, in which PSUs
were selected with probability proportional to size and a simple random sample
of 100 households was selected from each sampled PSU. The standard error for
the rejection proportion when
is less than 0.007.
Calculations for the ACS simulation were done in SAS® software (SAS Institute, Inc. 2011). We first calculated the weights and
jackknife weights for the selected sample, and then calculated the poststratified
and jackknife poststratified weights for the respondents. The two sets of
jackknife weights used the same replication structure, so that replicate weight
for the respondents deleted the same PSU as
replicate weight
for the selected sample. To simplify
computation of
in (2.10), we concatenated the selected sample
and respondents, with their respective weights, into one data set and set
for records in the respondent data set and
for records in the selected sample data set.
The linear model
was fit to the concatenated data using the
SURVEYREG procedure, and
from the regression model.
Table 4.1 gives the results from the simulation. For all
but one of the simulation settings, the mean of the jackknife variance
estimates is larger than the Monte Carlo variance of
but the bias of the jackknife variance is
reduced when more PSUs are sampled or the response rate is higher. The outcome
variable
household income, is highly skewed, and the
rejection rate when
is closer to the nominal
of 0.05 when the log-transformed variable is
used.
Table 4.1
Simulation results from ACS population Table summary
This table displays the results of Simulation results from ACS population. The information is grouped by Nonresponse Mechanism (appearing as row headers), Response Rate (%), Number of PSUs and Outcome variable XXXX (appearing as column headers).
Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.
Submission of Manuscripts
Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).
Note of appreciation
Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.
Standards of service to the public
Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.
Copyright
Published by authority of the Minister responsible for Statistics Canada.