2 Consistency and asymptotic normality
Jun Shao, Eric Slud, Yang Cheng, Sheng Wang, and Carma Hogue
Previous | Next
To
consider asymptotics, we view the population as one of a
sequence of populations where the number
of units in increases to
infinity as This paper
treats only the case of strata in which a large sample is drawn; that
is, we assume that for each stratum the sample size depends on and increases to
infinity as but we omit the
index for simplicity.
All limiting processes are considered as Following
authors such as Isaki and Fuller (1982) and Deville and Särndal (1992), we term
this a superpopulation asymptotic framework. Under the design-based
framework considered in Section 2.1, the attribute vectors in the underlying
populations need not be viewed as random vectors. However, under the
model-assisted framework considered in Section 2.2, regression models are
assumed for attribute vectors.
Since
each estimator is a sum of independent estimators constructed within each
stratum, for simplicity we present asymptotic results for the case of The results and
conclusions immediately apply to the case of a fixed and can also be
extended to the situation where increases to
infinity. (It is typical for large-scale surveys to have many strata, although
the number of ASPEP government-by-type strata that were split into substrata
was somewhat less than 100.) Since we only consider we omit the
index for stratum in
this section, e.g., and Also, for the estimators and are defined by
the displayed formulas following equations (1.2) and (1.3), with subscript suppressed,
together with
Furthermore, for simplicity we consider asymptotics
only under with-replacement sampling. The results can be applied to the case of
without replacement sampling if the sampling fraction is negligible.
2.1 Design-based asymptotic framework
First,
we establish the asymptotic normality of and under repeated
sampling, that is, when and are fixed for and is a random PPS
sample.
Theorem 1 Suppose that and are independent PPS samples with replacement
from and respectively, where unit has probability of being selected, and sampling weight for and that the following four conditions hold,
as the population sequence index goes to
(C1) There exist constants and such that and
(C2) For
there exist constants and such that
exist, as do the limits
and in addition,
(C3) The limits
exist, where for
denotes the vector transpose, and is positive definite. The limit also exists, for
(C4) The elements of
form a bounded sequence, where for
Then, as the following conclusions hold.
(a) For
and where denotes convergence in probability.
(b) The combined-stratum estimator
has the exact expression
and the in-probability
limit
(c)
where denotes convergence in distribution, and
(d) For
where and
and is given in condition (C3).
The
conditions (C1)-(C4) of Theorem 1 provide a general
formulation of the superpopulation framework for large-sample design-based
statistical inference, within which the survey regression coefficients estimate
well-defined frame-population descriptive parameters. The results in parts
(a)-(b) show that the in-probability limits of have the
standard interpretation as superpopulation least-squares slopes and intercepts.
(These slope and intercept parameters also keep their usual model-based
interpretations under the model (2.7) introduced in Section 2.2.) The
asymptotic distribution theory for in conclusion
(c) allows us to deduce the large-sample behavior of from that
provided in (d) for
Under
the further conditions
it is clear from Theorem 1(b) that and in (2.2), so that and and are all asymptotically the same up to
remainders of smaller order than as we now show. Also, if then continues to be and the test of equality of slopes rejects, i.e., and therefore has the same asymptotic distribution as which is more efficient than according to the result in Section 2.2.
Theorem 2 Assume the same hypotheses (C1)-(C4) as in Theorem 1.
(a) When (2.3) holds, then as
and the estimators are all asymptotically normally distributed
and equivalent in the sense that
(b) When and
A
more refined study of the asymptotic behavior of the estimators can be
undertaken in the spirit of Saleh (2006), as with contiguous or Pitman
alternatives for non-survey statistical models, by assuming that for a constant Under this
assumption, it can be shown that and, therefore,
the three centered and scaled estimators and all have the
same asymptotic normal distribution with mean 0. Furthermore,
where is given in (2.4), and and are respectively the standard normal
percentage point and distribution function. Thus, has a limit different from 1. In particular,
the limit in (2.6) equals when (i.e.,
when ).
2.2 Model-assisted asymptotic setting
We
elaborate in this section the behavior of estimators under the
assumed probabilistic model that the triples in the finite
population, are independent
and identically distributed (iid), where the size-variables are used in
defining PPS with-replacement draw probabilities and where and follow the model
with and as unknown intercept and slope parameters for
the regression within stratum The errors are assumed to be iid with mean 0 and finite
variance and to be independent of and the variables for are assumed to have finite variance. Also, to
enable PPS sampling, we assume that with probability approaching 1 for large i.e.,
for large
In
this section, asymptotic properties of estimators are considered
with respect to the regression model and repeated sampling. By Theorem 1, the
model-assisted estimators and are still
consistent and asymptotically normal for triples iid within
strata, since the conditions (C1)-(C4) are satisfied under moment
assumptions on even if model
(2.7) is incorrect. However, the estimators are efficient
when model (2.7) is correct.
Theorem 3 Assume model (2.7) along with (C1), with and Then all conclusions in Theorem 1 and Theorem
2 still hold. In particular, when the asymptotic variance of is larger than the asymptotic variance of Furthermore,
where is the limit of
Note
that in (2.8) is
equal to 1 when and equal to when
According
to Theorem 3, under model (2.7), all three estimators defined in (1.2)-(1.4)
have the same asymptotic efficiency when and (condition
(2.3)). Furthermore, is asymptotically
worse than when Thus, why would
we not always use
The
assertions in Theorem 3 are first-order asymptotic results. A more refined,
second-order asymptotic result under the conditions in Theorem 3 and condition
(2.3) when the sizes are all equal is
that, up to a term of order
where mse is the mean squared error conditional on and
Result (2.9) indicates that, when weights are equal
and and the finite sample performance of may be better than that of for moderate and . See the simulation results in
Section 4. The proof of (2.9) is a special case of a more general result in
Slud (2012) and, thus, is omitted.
In
applications, we do not know whether Hence, the
decision-based estimator is an adaptive
procedure to select a good estimator. In view of (2.8), the performance of is close to
(slightly worse than) that of when and is close to
(slightly worse than) that of when and This is also
supported by the simulation results in Section 4.
Previous | Next