3. Reasons for a large interval
Paul Knottnerus
Previous | Next
In order to get
more insight into the difference between
and
we assume
and
hence,
Then subtracting (2.4) from (2.2)
yields
In other words,
is smaller than
when
provided
Assuming
Qualité and Tillé (2008) derive a
similar result for the parameter of absolute change when
An anonymous referee pointed out
that
is a sufficient condition for
because (3.1) can be rewritten as
provided that
If
is sufficiently large, a weaker
condition can be derived under some standard model assumptions. Suppose that
the data satisfy the model
with
and
recall
is not random in this context.
Under this model, we make the (weak) assumptions (i)
and (ii)
To justify these assumptions,
recall from regression theory that
can be seen as the unbiased,
consistent estimator for
from an ordinary least squares
(OLS) regression of
on
and a constant
Furthermore, the corresponding
OLS estimator
for the constant has zero expectation under the above model while its
variance is of order
Hence,
as
and provided
for all
we get the somewhat
counterintuitive result
In fact, it can be shown that
as
This justifies assumption (i);
for further details, see the end of this section. Furthermore,
can be seen as the (unexplained)
variance of the residuals from the OLS regression. However, under the above
model assumptions, these residuals are asymptotically equal to
from which the approximate validity of (ii) follows. In
addition, noting that
is the so-called explained variance of the above OLS
regression, it follows from assumption (i) that
Combining this with assumptions
(i) and (ii), we can rewrite (3.1) as
Hence,
is larger than
when
Thus for say
is under the above model for
sufficiently large
larger than
when
and for say
when
In addition, applying (3.2) to
the data in Example 2.1 with
and
yields as approximation for the
difference between both variances 0.0017 which is not very different from the
actual difference of 0.0016 (=0.00324-0.00166) in the example. For Example 2.2,
taking
and
applying (3.2) yields 0.00226
instead of 0.00212 (=0.00251-0.00039) in the example.
Under the above
assumptions, it can also be shown that the ratio, say
of
and
can be approximated by
irrespective of the values of
and
stands for
For a proof of (3.4), see Appendix
A.1. From (3.4) it can be seen that
and
tend to zero as
tends to unity, provided
is sufficiently large and
It should be noted
that in practice the correlations
often are rather high by the very
nature of the data
That is, a large (small)
enterprise in period
is in most cases still large
(small) after 12 months; Knottnerus and Van Delden (2012, page 47) found for
various strata an overall mean correlation of 0.90 and a variance of 0.0074. So
it appears that
is more affected by a decrease of
than
unless
is extremely low because (i)
when
and (ii)
is large when
is large. For example, when
and
a decrease of
from 0.9 to 0.5 leads to a
decrease of
from 0.58 to 0.37; recall
when
This emphasizes once more the
importance of avoiding panel attrition when using estimator
while
is large.
A natural question
that remains to be answered is when is
sufficiently large. To answer
this question, consider the difference
and its variance, say
The difference
can be written as
In the second line we assumed
and in the last line we used the
model assumption
Next, assuming
we get
This variance can be estimated by
where
and
is an estimate from the OLS
regression
units with
are omitted. Based on
one may call
sufficiently large if the outcome
of (3.1) will not severely be affected by replacing
by
In addition, it should be borne
in mind that relationships for very large
are probably still a reasonably
appropriate indication for what may occur when
is not very large.
Previous | Next