Browse by

2. Basic theory

Jae-kwang Kim, Seunghwan Park and Seo-young Kim

In this section, we first introduce the basic theory for combining the information for small area estimation. We first consider the simple case of combining two surveys. Assume that there are two surveys, survey $A$ and survey $B,$ obtained from separate probability sampling designs. The two surveys are not necessarily independent. From survey $A,$ we obtain a design unbiased estimator ${\hat{X}}_{h, a} = \sum_{i \in A_{h}} w_{i a} x_{i}$ and its variance estimator $\hat{V} ({\hat{X}}_{h}) .$ From survey $B,$ we obtain a design unbiased estimator ${\hat{Y}}_{1 h} = \sum_{i \in B_{h}} w_{i b} y_{1 i}$ of $Y_{1 h} = \sum_{i \in U_{h}} y_{1 i} .$ The sampling error of $({\hat{X}}_{h}, {\hat{Y}}_{1 h})$ can be expressed by the sampling error model

$(\begin{array}{l} {\hat{X}}_{h} \\ {\hat{Y}}_{1 h} \end{array}) = (\begin{array}{l} X_{h} \\ Y_{1 h} \end{array}) + (\begin{array}{l} N_{h} a_{h} \\ N_{h} b_{h} \end{array}) (2.1)$

and $a_{h}$ and $b_{h}$ represent the sampling errors associated with ${\hat{X}}_{h} / N_{h}$ and ${\hat{Y}}_{1 h} / N_{h}$ such that

$(\begin{matrix} a_{h} \\ b_{h} \end{matrix}) \sim [(\begin{matrix} 0 \\ 0 \end{matrix}), (\begin{matrix} V (a_{h}) & Cov (a_{h}, b_{h}) \\ Cov (a_{h}, b_{h}) & V (b_{h}) \end{matrix})] .$

Our parameter of interest is the population total $X_{h}$ of $x$ in area $h .$

From (1.1), we obtain the following area level model:

$Y_{1 h} = N_{h} β_{0} + β_{1} X_{h} + {\tilde{e}}_{1 h}, (2.2)$

where $(N_{h}, X_{h}, Y_{1 h}, {\tilde{e}}_{1 h}) = \sum_{i \in U_{h}} (1, x_{i}, y_{1 i}, e_{1 i}) .$ We can express (2.2) in terms of population mean

${\bar{Y}}_{1 h} = β_{0} + {\bar{X}}_{h} β_{1} + {\bar{e}}_{1 h}, (2.3)$

where $({\bar{X}}_{h}, {\bar{Y}}_{1 h}, {\bar{e}}_{1 h}) = N_{h}^{- 1} \sum_{i \in U_{h}} (x_{i}, y_{1 i}, e_{1 i}) .$ If we use a nested error model

$e_{1 h i} = ε_{h} + u_{h i} (2.4)$

where $ε_{h} \sim (0, σ_{e}^{2})$ and $u_{h i} \sim (0, σ_{u}^{2}),$ then ${\bar{e}}_{1 h} \sim (0, σ_{e, h}^{2}),$ $σ_{e, h}^{2} = σ_{e}^{2} + σ_{u}^{2} / N_{h} .$ The nested error model is quite popular in small area estimation (e.g., Battese, Harter and Fuller 1988) and it assumes that $Cov (e_{1 h i}, e_{1 h j}) = σ_{e}^{2}$ for $i \neq j .$ Because $N_{h}$ is often quite large, we can safely assume that ${\bar{e}}_{1 h} \sim (0, σ_{e, h}^{2} = σ_{e}^{2}) .$ The model (2.2) is called structural error model because it describes the structural relationship between the two latent variables $Y_{1 h}$ and $X_{h} .$ The two models, (2.1) and (2.2), are often encountered in the measurement error model literature (Fuller 1987). Thus, the model for small area estimation can be viewed as a measurement error model, as suggested by Fuller (1991) who originally used the measurement error model approach in the unit-level modeling for small area estimation.

Now, if we define $({\bar{y}}_{1 h}, {\bar{x}}_{h}) = N_{h}^{- 1} ({\hat{Y}}_{1 h}, {\hat{X}}_{h}),$ combining (2.1) and (2.3), we have

$(\begin{array}{l} {\bar{y}}_{1 h} \\ {\bar{x}}_{h} \end{array}) = (\begin{array}{l} β_{0} & β_{1} \\ 0 & 1 \end{array}) (\begin{array}{l} 1 \\ {\bar{X}}_{h} \end{array}) + (\begin{matrix} b_{h} + {\bar{e}}_{1 h} \\ a_{h} \end{matrix})$

which can also be written as

$(\begin{array}{l} {\bar{y}}_{1 h} - β_{0} \\ {\bar{x}}_{h} \end{array}) = (\begin{array}{l} β_{1} \\ 1 \end{array}) {\bar{X}}_{h} + (\begin{matrix} b_{h} + {\bar{e}}_{1 h} \\ a_{h} \end{matrix}) . (2.5)$

Thus, when all the model parameters in (2.5) are known, the best estimator of ${\bar{X}}_{h}$ can be computed by

${\hat{\bar{X}}}_{h} = {(β_{1},1) V_{h}^{- 1} {(β_{1},1)}^{'}}^{- 1} (β_{1},1) V_{h}^{- 1} {({\bar{y}}_{1 h} - β_{0}, {\bar{x}}_{h})}^{'} (2.6)$

where $V_{h}$ is the variance-covariance matrix of ${(b_{h} + {\bar{e}}_{1 h}, a_{h})}^{'} .$ The variance of ${\hat{\bar{X}}}_{h}$ is given by ${(β_{1},1) V_{h}^{- 1} {(β_{1},1)}^{'}}^{- 1} .$ The estimator in (2.6) can be called the Generalized Least Squares (GLS) estimator because it uses the technique of the generalized least squares method in the linear model theory. The GLS method is useful because it is optimal and it can incorporate additional sources of information naturally. For example, if another estimator ${\bar{y}}_{2 h}$ for ${\bar{Y}}_{2 h}$ is also available and satisfies

${\bar{Y}}_{2 h} = γ_{0} + γ_{1} {\bar{X}}_{h} + {\bar{e}}_{2 h}$

and

${\bar{y}}_{2 h} = {\bar{Y}}_{2 h} + c_{h},$

then the extended GLS model is written as

$(\begin{array}{l} {\bar{y}}_{2 h} - γ_{0} \\ {\bar{y}}_{1 h} - β_{0} \\ {\bar{x}}_{h} \end{array}) = (\begin{array}{l} γ_{1} \\ β_{1} \\ 1 \end{array}) {\bar{X}}_{h} + (\begin{matrix} c_{h} + {\bar{e}}_{2 h} \\ b_{h} + {\bar{e}}_{1 h} \\ a_{h} \end{matrix}) (2.7)$

and the GLS estimator can be obtained by

${\hat{\bar{X}}}_{h 2} = {(γ_{1}, β_{1},1) V_{h 2}^{- 1} {(γ_{1}, β_{1},1)}^{'}}^{- 1} (γ_{1}, β_{1},1) V_{h 2}^{- 1} {({\bar{y}}_{2 h} - γ_{0}, {\bar{y}}_{1 h} - β_{0}, {\bar{x}}_{h})}^{'}$

where $V_{h 2}$ is the variance-covariance matrix of ${(c_{h} + {\bar{e}}_{2 h}, b_{h} + {\bar{e}}_{1 h}, a_{h})}^{'} .$ The GLS estimator has variance ${(γ_{1}, β_{1},1) V_{h 2}^{- 1} {(γ_{1}, β_{1},1)}^{'}}^{- 1} .$ If ${\bar{y}}_{2 h}$ is independent of $({\bar{x}}_{h}, {\bar{y}}_{1 h}),$ the efficiency gain by incorporating ${\bar{y}}_{2 h}$ into GLS in terms of relative variance can be expressed as

$\frac{V ({\hat{\bar{X}}}_{h 2}) - V ({\hat{\bar{X}}}_{h})}{V ({\hat{\bar{X}}}_{h})} = - \frac{{V ({\bar{y}}_{2 h} / γ_{1})}^{- 1}}{{V ({\hat{\bar{X}}}_{h})}^{- 1} + {V ({\bar{y}}_{2 h} / γ_{1})}^{- 1}},$

where $V ({\bar{y}}_{2 h} / γ_{1}) = V (c_{h} + {\bar{e}}_{2 h}) / γ_{1}^{2} .$ The gain is high if both the sampling variance of ${\bar{y}}_{2 h}$ and the model variance $V ({\bar{e}}_{2 h})$ are small. If $γ_{1} = 0,$ then there is no gain.

Remark 1 Note that model (2.5) can also be written as

$(\begin{matrix} β_{1}^{- 1} ({\bar{y}}_{1 h} - β_{0}) \\ {\bar{x}}_{h} \end{matrix}) = (\begin{array}{l} 1 \\ 1 \end{array}) {\bar{X}}_{h} + (\begin{matrix} (b_{h} + {\bar{e}}_{1 h}) / β_{1} \\ a_{h} \end{matrix}) . (2.8)$

The GLS estimator obtained from (2.8), which is the same as the GLS estimator obtained from (2.5), can be expressed as

${\hat{\bar{X}}}_{h} = α_{h} {\bar{x}}_{h} + (1 - α_{h}) {\tilde{x}}_{h} (2.9)$

where ${\tilde{x}}_{h} = β_{1}^{- 1} ({\bar{y}}_{1 h} - β_{0})$ and

$\begin{array}{l} α_{h} & = & \frac{V ({\tilde{x}}_{h}) - Cov ({\bar{x}}_{h}, {\tilde{x}}_{h})}{V ({\bar{x}}_{h}) + V ({\tilde{x}}_{h}) - 2 Cov ({\bar{x}}_{h}, {\tilde{x}}_{h})} \\ = & \frac{σ_{e, h}^{2} + V (b_{h}) - β_{1} Cov (a_{h}, b_{h})}{σ_{e, h}^{2} + V (b_{h}) + β_{1}^{2} V (a_{h}) - 2 β_{1} Cov (a_{h}, b_{h})} . \end{array}$

The estimator ${\tilde{x}}_{h},$ when computed with estimated parameter $\hat{β} = ({\hat{β}}_{0}, {\hat{β}}_{1}),$ is called the synthetic estimator and the optimal estimator in (2.9) is often called the composite estimator. It can be shown that, ignoring the effect of estimating $β,$ the variance of the composite estimator is equal to

$V ({\hat{\bar{X}}}_{h} - {\bar{X}}_{h}) = α_{h} V ({\bar{x}}_{h}) + (1 - α_{h}) Cov ({\bar{x}}_{h}, {\tilde{x}}_{h}) (2.10)$

and, as $α_{h} < 1,$ the composite estimator is more efficient than the direct estimator.

Previous | Next

Date modified:: 2015-11-27

Language selection

Search and menus

Search

Publications

Survey Methodology

Browse by

2. Basic theory