Browse by

3. Reasons for a large interval $I_{S T N}^{95}$

Paul Knottnerus

In order to get more insight into the difference between $var ({\hat{g}}_{O L P})$ and $var ({\hat{g}}_{S T N}),$ we assume $n_{12} = n_{23} = n$ and $G, S_{x y} > 0;$ hence, $λ = μ = n_{2} / n .$ Then subtracting (2.4) from (2.2) yields

$\begin{array}{l} var ({\hat{g}}_{S T N}) - var ({\hat{g}}_{O L P}) & \approx \frac{1}{{\bar{X}}^{2}} {2 G (\frac{1}{n_{2}} - \frac{λ}{n}) S_{x y} - (\frac{1}{n_{2}} - \frac{1}{n}) (S_{y}^{2} + G^{2} S_{x}^{2})} \\ = \frac{1}{λ n {\bar{X}}^{2}} {2 G (1 - λ^{2}) S_{x y} - (1 - λ) (S_{y}^{2} + G^{2} S_{x}^{2})} (3.1) \\ = \frac{1 - λ}{λ n {\bar{X}}^{2}} (2 G λ S_{x y} - S_{y - G x}^{2}) . \end{array}$

In other words, $var ({\hat{g}}_{O L P})$ is smaller than $var ({\hat{g}}_{S T N})$ when $λ > S_{y - G x}^{2} / 2 G S_{x y}$ provided $S_{x y} > 0.$ Assuming $S_{y}^{2} = S_{x}^{2},$ Qualité and Tillé (2008) derive a similar result for the parameter of absolute change when $λ > (1 - ρ_{x y}) / ρ_{x y} .$ An anonymous referee pointed out that $λ < (1 - ρ_{x y}) / ρ_{x y}$ is a sufficient condition for $var ({\hat{g}}_{O L P}) > var ({\hat{g}}_{S T N})$ because (3.1) can be rewritten as

$\frac{(1 - λ) G S_{x} S_{y}}{λ n {\bar{X}}^{2}} (2 λ ρ_{x y} + 2 ρ_{x y} - \frac{S_{y}^{2} + G^{2} S_{x}^{2}}{G S_{x} S_{y}}) \leq \frac{(1 - λ) G S_{x} S_{y}}{λ n {\bar{X}}^{2}} (2 λ ρ_{x y} + 2 ρ_{x y} - 2) < 0,$

provided that $λ < (1 - ρ_{x y}) / ρ_{x y} .$

If $N$ is sufficiently large, a weaker condition can be derived under some standard model assumptions. Suppose that the data satisfy the model $Y_{i} = B X_{i} + u_{i}$ with $E (u_{i}) = 0,$ $E (u_{i}^{2}) = σ_{}^{2} X_{i}^{δ}$ and $E (u_{i} u_{j}) = 0$ $(i \neq j);$ recall $X_{i}$ is not random in this context. Under this model, we make the (weak) assumptions (i) $G = S_{y x} / S_{x}^{2}$ and (ii) $S_{y - G x}^{2} = S_{y}^{2} (1 - ρ_{x y}^{2}) .$ To justify these assumptions, recall from regression theory that $\hat{B} = S_{y x} / S_{x}^{2}$ can be seen as the unbiased, consistent estimator for $B$ from an ordinary least squares (OLS) regression of $Y_{i}$ on $X_{i}$ and a constant $(i = 1, ..., N) .$ Furthermore, the corresponding OLS estimator $(\bar{Y} - \hat{B} \bar{X})$ for the constant has zero expectation under the above model while its variance is of order $1 / N .$ Hence, $0 = plim (\bar{Y} - \hat{B} \bar{X}) = plim {\bar{X} (G - \hat{B})}$ as $N \to \infty$ and provided $\bar{X} > c > 0$ for all $N,$ we get the somewhat counterintuitive result $plim (G - \hat{B}) = 0.$ In fact, it can be shown that

$G = \bar{Y} / \bar{X} = \hat{B} [1 + O_{p} (1 / \sqrt{N})] = (S_{y x} / S_{x}^{2}) [1 + O_{p} (1 / \sqrt{N})]$

as $N \to \infty .$ This justifies assumption (i); for further details, see the end of this section. Furthermore, $S_{y}^{2} (1 - ρ_{x y}^{2})$ can be seen as the (unexplained) variance of the residuals from the OLS regression. However, under the above model assumptions, these residuals are asymptotically equal to $Y_{i} - G X_{i}$ from which the approximate validity of (ii) follows. In addition, noting that $S_{y}^{2} ρ_{x y}^{2}$ is the so-called explained variance of the above OLS regression, it follows from assumption (i) that $S_{y}^{2} ρ_{x y}^{2} = {\hat{B}}^{2} S_{x}^{2} \approx G^{2} S_{x}^{2} .$ Combining this with assumptions (i) and (ii), we can rewrite (3.1) as

$\begin{matrix} var ({\hat{g}}_{S T N}) - var ({\hat{g}}_{O L P}) & \approx \frac{1 - λ}{λ n {\bar{X}}^{2}} {2 G^{2} λ S_{x}^{2} - (1 - ρ_{x y}^{2}) S_{y}^{2}} \\ \approx \frac{(1 - λ) S_{y}^{2}}{λ n {\bar{X}}^{2}} (2 λ ρ_{x y}^{2} - 1 + ρ_{x y}^{2}) (3.2) \\ = \frac{(1 - λ) S_{y}^{2}}{λ n {\bar{X}}^{2}} {ρ_{x y}^{2} (1 + 2 λ) - 1} . \end{matrix}$

Hence, $var ({\hat{g}}_{O L P})$ is larger than $var ({\hat{g}}_{S T N})$ when

$λ < (1 - ρ_{x y}^{2}) / 2 ρ_{x y}^{2} [> (1 - ρ_{x y}) / ρ_{x y}] . (3.3)$

Thus for say $ρ_{x y} = 0.9,$ $var ({\hat{g}}_{O L P})$ is under the above model for sufficiently large $N$ larger than $var ({\hat{g}}_{S T N})$ when $λ < 0.117,$ and for say $ρ_{x y} = 0.75$ when $λ < 0.389.$ In addition, applying (3.2) to the data in Example 2.1 with $λ \approx 57 / 73 = 0.78$ and $ρ_{x y} = 0.876$ yields as approximation for the difference between both variances 0.0017 which is not very different from the actual difference of 0.0016 (=0.00324-0.00166) in the example. For Example 2.2, taking $λ = 54 / 70 = 0.77$ and $ρ_{x y} = 0.970,$ applying (3.2) yields 0.00226 instead of 0.00212 (=0.00251-0.00039) in the example.

Under the above assumptions, it can also be shown that the ratio, say $Q,$ of $var ({\hat{g}}_{O L P})$ and $var ({\hat{g}}_{S T N})$ can be approximated by

$Q = \frac{var ({\hat{g}}_{O L P})}{var ({\hat{g}}_{S T N})} \approx (λ^{- 1} - f) {(1 - f + 2 (1 - λ) \frac{ρ_{x y}^{2}}{1 - ρ_{x y}^{2}})}^{- 1}, (3.4)$

irrespective of the values of $S_{y}^{2}$ and $S_{x}^{2};$ $f$ stands for $n / N .$ For a proof of (3.4), see Appendix A.1. From (3.4) it can be seen that $Q$ and $var ({\hat{g}}_{O L P})$ tend to zero as $ρ_{x y}^{2}$ tends to unity, provided $N$ is sufficiently large and $λ < 1.$

It should be noted that in practice the correlations $ρ_{x y}$ often are rather high by the very nature of the data $(Y_{i}, X_{i}) .$ That is, a large (small) enterprise in period $(t - 12)$ is in most cases still large (small) after 12 months; Knottnerus and Van Delden (2012, page 47) found for various strata an overall mean correlation of 0.90 and a variance of 0.0074. So it appears that $var ({\hat{g}}_{S T N})$ is more affected by a decrease of $λ$ than $var ({\hat{g}}_{O L P})$ unless $λ$ is extremely low because (i) $var ({\hat{g}}_{O L P}) = var ({\hat{g}}_{S T N})$ when $λ = 1$ and (ii) $Q$ is large when $ρ_{x y}^{2}$ is large. For example, when $ρ_{x y} = 0.9$ and $f = 0.1$ a decrease of $λ$ from 0.9 to 0.5 leads to a decrease of $Q$ from 0.58 to 0.37; recall $Q = 1$ when $λ = 1.$ This emphasizes once more the importance of avoiding panel attrition when using estimator ${\hat{g}}_{S T N}$ while $N$ is large.

A natural question that remains to be answered is when is $N$ sufficiently large. To answer this question, consider the difference $Δ \equiv \hat{B} - G$ and its variance, say $σ_{Δ}^{2} .$ The difference $Δ$ can be written as

$\begin{matrix} Δ & = \frac{S_{x y}^{}}{S_{x}^{2}} - \frac{\bar{Y}}{\bar{X}} = \frac{1}{N - 1} \sum_{i \in U} \frac{X_{i} - \bar{X}}{S_{x}^{2}} Y_{i} - \frac{1}{N} \sum_{i \in U} \frac{Y_{i}}{\bar{X}} \\ \approx \frac{1}{N} \sum_{i \in U} (\frac{X_{i} - \bar{X}}{S_{x}^{2}} - \frac{1}{\bar{X}}) Y_{i} \\ = \frac{1}{N} \sum_{i \in U} M_{i} U_{i} (M_{i} = \frac{X_{i} - \bar{X}}{S_{x}^{2}} - \frac{1}{\bar{X}}) . \end{matrix}$

In the second line we assumed $N > > 1$ and in the last line we used the model assumption $Y_{i} = B X_{i} + U_{i} .$ Next, assuming $var (U_{i}) = σ^{2} X_{i}^{δ},$ we get

$σ_{Δ}^{2} \equiv var (\hat{B} - G) = \frac{σ^{2}}{N^{2}} \sum_{i \in U} M_{i}^{2} X_{i}^{δ} .$

This variance can be estimated by

${\hat{σ}}_{Δ}^{2} = \frac{{\hat{σ}}^{2}}{N n_{2}} \sum_{i \in s_{2}} {\hat{m}}_{i}^{2} X_{i}^{\hat{δ}},$

where

${\hat{m}}_{i} = \frac{X_{i} - {\bar{x}}_{2}}{s_{x 2}^{2}} - \frac{1}{{\bar{x}}_{2}}, {\hat{σ}}^{2} = \frac{1}{n_{2} - 1} \sum_{i \in s_{2}} {(Y_{i} - \frac{{\bar{y}}_{2}}{{\bar{x}}_{2}} X_{i})}^{2} / X_{i}^{\hat{δ}}$

and $\hat{δ}$ is an estimate from the OLS regression

$\ln {(Y_{i} - \frac{{\bar{y}}_{2}}{{\bar{x}}_{2}} X_{i})}^{2} = α + δ \ln X_{i} + w_{i} (i = 1, ..., n_{2});$

units with $Y_{i} = {\bar{y}}_{2} X_{i} / {\bar{x}}_{2}$ are omitted. Based on ${\hat{σ}}_{Δ}^{2},$ one may call $N$ sufficiently large if the outcome of (3.1) will not severely be affected by replacing $G$ by $G + {\hat{σ}}_{Δ} .$ In addition, it should be borne in mind that relationships for very large $N$ are probably still a reasonably appropriate indication for what may occur when $N$ is not very large.

Previous | Next

Date modified:: 2017-09-20

Language selection

Search and menus

Search

Publications

Survey Methodology

Browse by

3. Reasons for a large interval $I_{S T N}^{95}$