Small area benchmarked estimation under the basic unit level model when the sampling rates are non‑negligible
Section 3. Benchmarked estimators

Table of contents

We now proceed to develop benchmarked estimators of the small area means ${\bar{Y}}_{i}$ using unit level model (2.2) or augmented versions of it. We assume that a reliable direct estimator ${\hat{Y}}_{w} = \sum_{i = 1}^{m} \sum_{j \in s_{i}} w_{i j} y_{i j}$ of the population total $Y$ is available, where $Y = \sum_{i = 1}^{m} Y_{i},$ and $Y_{i} = N_{i} {\bar{Y}}_{i}$ is the total of small area $i .$ Let ${\hat{\bar{Y}}}_{i}$ be the model-based small area estimator of ${\bar{Y}}_{i} .$ It is desirable to ensure that the aggregated values of ${\hat{\bar{Y}}}_{i},$ agree with the reliable estimator ${\hat{Y}}_{w} .$ The small area means estimators ${\hat{\bar{Y}}}_{i},$ $i = 1, \dots, m,$ are said to be benchmarked to ${\hat{Y}}_{w}$ if

$\sum_{i = 1}^{m} N_{i} {\hat{\bar{Y}}}_{i} = {\hat{Y}}_{w} . (3.1)$

Let ${\hat{Y}}_{w}$ be a GREG estimator with weights calibrated at the population level on a vector of auxiliary variables $x_{i j}^{*} .$ This estimator is analogous to the combined regression estimator if one views the small areas as strata. The vector of auxiliary variables $x_{i j}^{*}$ may or may not be the same as $x_{i j} .$ We distinguish two cases in this context: $x_{i j} \subseteq x_{i j}^{*}$ and $x_{i j} ⊄ x_{i j}^{*} .$ The first case, $x_{i j} \subseteq x_{i j}^{*},$ implies that all the components of $x_{i j}$ also belong to $x_{i j}^{*},$ and that $x_{i j}^{*}$ may or may not have additional components that are different from those contained in $x_{i j} .$ The second case, $x_{i j} ⊄ x_{i j}^{*},$ implies that some of the components of $x_{i j}$ do not appear in $x_{i j}^{*} .$ We assume that the first component of both vectors $x_{i j}$ and $x_{i j}^{*}$ are equal to one, as they represent an intercept term.

For a given sample $s,$ auxiliary data $x_{i j}^{*}$ and basic design weights $d_{i j} = 1 / π_{i j},$ the GREG estimator of the population total $Y$ is given by

${\hat{Y}}^{GREG} = \sum_{i = 1}^{m} \sum_{j \in s_{i}} w_{i j}^{GREG} y_{i j},$

where the GREG weights $w_{i j}^{GREG}$ are given by

$w_{i j}^{GREG} = d_{i j} (1 + {(X^{*} - {\hat{X}}^{* HT})}^{T} {(\sum_{i = 1}^{m} \sum_{j \in s_{i}} d_{i j} x_{i j}^{*} x_{i j}^{* T})}^{- 1} x_{i j}^{*}) . (3.2)$

In equation (3.2), $X^{*} = \sum_{i = 1}^{m} X_{i}^{*},$ where $X_{i}^{*} = \sum_{j = 1}^{N_{i}} x_{i j}^{*}$ represents the known small area total, whereas ${\hat{X}}^{* HT} = \sum_{i = 1}^{m} {\hat{X}}_{i}^{* HT}$ and ${\hat{X}}_{i}^{* HT} = \sum_{j \in s_{i}} d_{i j} x_{i j}^{*}$ represent respectively the direct design-based Horvitz-Thompson estimators of $X^{*}$ and $X_{i}^{*} .$ Note that

$\sum_{i = 1}^{m} \sum_{j \in s_{i}} w_{i j}^{GREG} x_{i j}^{*} = X^{*} . (3.3)$

Using the GREG weights $w_{i j}^{GREG},$ estimators of $N_{i}$ and $X_{i}$ are given by

${\hat{N}}_{i}^{GREG} = \sum_{j \in s_{i}} w_{i j}^{GREG} and {\hat{X}}_{i}^{GREG} = \sum_{j \in s_{i}} w_{i j}^{GREG} x_{i j} . (3.4)$

The small area estimates ${\hat{\bar{Y}}}_{i}^{EBLUP}$ and ${\hat{\bar{Y}}}_{i}^{YR}$ given respectively by (2.11) and (2.13), do not satisfy the benchmarking equation (3.1) for ${\hat{Y}}_{w} = {\hat{Y}}^{GREG} :$ that is the total estimates ${\hat{Y}}^{EBLUP} = \sum_{i = 1}^{m} N_{i} {\hat{\bar{Y}}}_{i}^{EBLUP},$ and ${\hat{Y}}^{YR} = \sum_{i = 1}^{m} N_{i} {\hat{\bar{Y}}}_{i}^{YR}$ do not match the GREG estimator ${\hat{Y}}^{GREG} .$ We need to adjust ${\hat{\bar{Y}}}_{i}^{EBLUP}$ and ${\hat{\bar{Y}}}_{i}^{YR}$ so that the sum of these modified small area estimators add up to ${\hat{Y}}^{GREG}$ when they are summed over all the $m$ small areas.

A very simple modification to the ${\hat{\bar{Y}}}_{i}^{EBLUP} ’ s$ and ${\hat{\bar{Y}}}_{i}^{YR} ’ s$ is called ratio benchmarking. It consists of multiplying each ${\hat{\bar{Y}}}_{i}^{EBLUP}$ and ${\hat{\bar{Y}}}_{i}^{YR}$ by the common adjustment factors ${\hat{Y}}^{GREG} / \sum_{i = 1}^{m} N_{i} {\hat{\bar{Y}}}_{i}^{EBLUP}$ and ${\hat{Y}}^{GREG} / \sum_{i = 1}^{m} N_{i} {\hat{\bar{Y}}}_{i}^{YR}$ respectively, leading to the ratio benchmarked estimators

${\hat{\bar{Y}}}_{i b}^{EBRat} = {\hat{\bar{Y}}}_{i}^{EBLUP} \frac{{\hat{Y}}^{GREG}}{\sum_{i = 1}^{m} N_{i} {\hat{\bar{Y}}}_{i}^{EBLUP}} and {\hat{\bar{Y}}}_{i b}^{YRat} = {\hat{\bar{Y}}}_{i}^{YR} \frac{{\hat{Y}}^{GREG}}{\sum_{i = 1}^{m} N_{i} {\hat{\bar{Y}}}_{i}^{YR}} . (3.5)$

It readily follows that both ${\hat{\bar{Y}}}_{i b}^{EBRat}$ and ${\hat{\bar{Y}}}_{i b}^{YRat}$ satisfy equation (3.1) with ${\hat{Y}}_{w} = {\hat{Y}}^{GREG} .$ In equation (3.5) and hereafter the subscript $b$ denotes that the estimators are benchmarked to ${\hat{Y}}^{GREG} .$

Note that the ${\hat{\bar{Y}}}_{i}^{EBLUP} ’ s$ and ${\hat{\bar{Y}}}_{i}^{YR} ’ s$ in equation (3.5) are multiplied by the same factor regardless of their precision and ignoring the particular small area characteristics, such as the variability of the units within a small area, or the small area sample size. Consequently, the resulting benchmarked estimators, ${\hat{\bar{Y}}}_{i b}^{EBRat}$ and ${\hat{\bar{Y}}}_{i b}^{YRat},$ based on this simple procedure, are just proportional modifications of estimators ${\hat{\bar{Y}}}_{i}^{EBLUP}$ and ${\hat{\bar{Y}}}_{i}^{YR}$ respectively, to obtain the desired concordance. This limitation can be avoided by using the small area model (2.2) to construct the benchmarked estimators.

We now proceed to show how model (2.2) can be used to obtain estimators benchmarked to ${\hat{Y}}^{GREG} .$ In Sections 3.1 and 3.2 we adapt the procedures in Stefan and Hidiroglou (2020) for obtaining benchmarked estimators to the case of non‑negligible sampling rates. In Sections 3.3 and 3.4 we introduce two restricted benchmarked estimators based on the procedure proposed by Ugarte et al. (2009). The benchmarked estimators of Sections 3.1 and 3.2 rely on the assumption that $x_{i j} \subseteq x_{i j}^{*},$ whereas the estimators of Sections 3.3 and 3.4 can be computed for any vector $x_{i j}$ or $x_{i j}^{*} .$

3.1 Augmented EBLUP benchmarked estimators

The GREG weights $w_{i j}^{GREG}$ should be used in the estimation to achieve benchmarking to ${\hat{Y}}^{GREG} .$ A possible way that $w_{i j}^{GREG}$ can be incorporated in the estimation is by augmenting the small area model (2.2) with a suitable auxiliary variable that is a function of $w_{i j}^{GREG} .$ This procedure is based on the augmented model approach used by Wang et al. (2008), whereby estimates obtained using the FH area-level model could be forced to add up to specified totals. Stefan and Hidiroglou (2020) adapted the Wang et al. (2008) approach under the basic unit-level model and for negligible sampling rates. They showed that benchmarking to ${\hat{Y}}^{GREG}$ could be obtained by augmenting model (2.2) with the GREG weights $w_{i j}^{GREG} .$ We extend Stefan and Hidiroglou (2020) to the case when the sampling rates are non‑negligible. For this case, benchmarking to ${\hat{Y}}^{GREG}$ is achieved by augmenting model (2.2) with $q_{i j} = w_{i j}^{GREG} - 1.$ This leads to the augmented model given by

$y_{i j} = x_{i j}^{T} β_{1 a} + q_{i j} β_{2 a} + v_{i a} + e_{i j a}, i = 1, \dots, m; j \in s_{i} . (3.6)$

The random effects $v_{i a}$ are assumed to be i.i.d. $N (0, σ_{v a}^{2})$ and independent of the unit errors $e_{i j a},$ and the $e_{i j a} ’ s$ are assumed to be i.i.d. $N (0, σ_{e a}^{2}) .$ The EBLUP estimators of $β_{a} = {(β_{1 a}^{T}, β_{2 a})}^{T}$ and $v_{i a}$ in (3.6) are respectively denoted by ${\hat{β}}_{a} = {({\hat{β}}_{1 a}^{T}, {\hat{β}}_{2 a})}^{T}$ and ${\hat{v}}_{i a} .$ We can now spell Result 1 for ${\hat{β}}_{a}$ and ${\hat{v}}_{i a} .$

Result 1. The EBLUP estimators ${\hat{β}}_{a}$ and ${\hat{v}}_{i a}$ based on model (3.6) obey the following equation

$\sum_{i = 1}^{m} \sum_{j \in s_{i}} y_{i j} + {(\sum_{i = 1}^{m} x_{i r})}^{T} {\hat{β}}_{1 a} + \sum_{i = 1}^{m} q_{i w} {\hat{β}}_{2 a} + \sum_{i = 1}^{m} ({\hat{N}}_{i}^{GREG} - n_{i}) {\hat{v}}_{i a} = {\hat{Y}}^{GREG}, (3.7)$

where $q_{i w} = \sum_{j \in s_{i}} q_{i j}^{2} = \sum_{j \in s_{i}} {(w_{i j}^{GREG} - 1)}^{2} .$

Proof: See Appendix A.

It follows from equation (3.7) that small area estimators benchmarked to ${\hat{Y}}^{GREG}$ are given by

${\hat{\bar{Y}}}_{i a b}^{EBLUP} = \frac{1}{N_{i}} [\sum_{j \in s_{i}} y_{i j} + x_{i r}^{T} {\hat{β}}_{1 a} + q_{i w} {\hat{β}}_{2 a} + ({\hat{N}}_{i}^{GREG} - n_{i}) {\hat{v}}_{i a}] . (3.8)$

The subscript $a$ indicates that ${\hat{\bar{Y}}}_{i a b}^{EBLUP}$ is based on an augmented small area model.

3.2 You-Rao benchmarked estimators

The procedure proposed by You and Rao (2002) can be used with any survey weights $w_{i j} .$ However, there is no guarantee that the resulting YR estimator will be benchmarked to ${\hat{Y}}^{GREG} .$ When the sampling rates are negligible, Stefan and Hidiroglou (2020) obtained benchmarked estimators with the You and Rao’s (2002) procedure based on the weights $w_{i j} = w_{i j}^{GREG}$ of the GREG estimator. When the sampling rates are non‑negligible, we now show that the weights $w_{i j} = w_{i j}^{GREG} - 1$ lead to YR benchmarked estimators.

Let ${\hat{β}}^{YR}$ and ${\hat{v}}^{YR} = {({\hat{v}}_{1}^{YR}, \dots, {\hat{v}}_{m}^{YR})}^{T}$ be YR estimators of $β$ and $v$ respectively with $w_{i j}$ replaced by $w_{i j}^{GREG} - 1.$ Using ${\hat{β}}^{YR},$ ${\hat{v}}_{i}^{YR}$ and the $N_{i} - n_{i}$ estimates ${\hat{y}}_{i j}^{YR} = x_{i j}^{T} {\hat{β}}^{YR} + {\hat{v}}_{i}^{YR}$ for $j \in r_{i},$ a YR estimator, denoted as ${\hat{\bar{Y}}}_{i}^{YR},$ can be computed with equation (2.13). However, ${\hat{\bar{Y}}}_{i}^{YR}$ is not benchmarked to ${\hat{Y}}^{GREG}$ even if it uses the weights $w_{i j}^{GREG} - 1.$ The original YR procedure leads to a self-benchmarked estimator in a limited number of cases.

To achieve the benchmark to ${\hat{Y}}^{GREG},$ a YR modified estimator, denoted as ${\hat{\bar{Y}}}_{i b}^{YR},$ is defined as follows:

${\hat{\bar{Y}}}_{i b}^{YR} = \frac{1}{N_{i}} [\sum_{j \in s_{i}} y_{i j} + x_{i r}^{T} {\hat{β}}^{YR} + ({\hat{N}}_{i}^{GREG} - n_{i}) {\hat{v}}_{i}^{YR}] . (3.9)$

The following proves that ${\hat{\bar{Y}}}_{i b}^{YR}$ defined by (3.9) benchmarks to ${\hat{Y}}^{GREG} .$

Result 2. Let ${\hat{β}}^{YR}$ and ${\hat{v}}^{YR} = {({\hat{v}}_{1}^{YR}, \dots, {\hat{v}}_{m}^{YR})}^{T}$ be respectively the YR estimators of $β$ and $v,$ constructed with weights $w_{i j}^{GREG} - 1.$ Then, $({\hat{β}}^{YR}, {\hat{v}}^{YR})$ satisfy the following equation:

$\sum_{i = 1}^{m} \sum_{j \in s_{i}} y_{i j} + \sum_{i = 1}^{m} x_{i r}^{T} {\hat{β}}^{YR} + \sum_{i = 1}^{m} ({\hat{N}}_{i}^{GREG} - n_{i}) {\hat{v}}_{i}^{YR} = {\hat{Y}}^{GREG} .$

Proof: See Appendix A.

Given $x_{i j}^{*},$ the weights $w_{i j}^{GREG}$ are calibrated on $x_{i j}^{*}$ at the small area level if they satisfy the following equations

$\sum_{j \in s_{i}} w_{i j}^{GREG} x_{i j}^{*} = X_{i}^{*}, for i = 1, \dots, m . (3.10)$

Equations (3.10) implies equation (3.3), however, the reverse is not true. If the weights $w_{i j}^{GREG}$ satisfy (3.10), and since $x_{i j} \subseteq x_{i j}^{*},$ it follows that the weights $w_{i j}^{GREG}$ are also calibrated on $x_{i j}$ at the small area level. In turn, this implies that ${\hat{N}}_{i}^{GREG} = N_{i},$ as we assume that vector $x_{i j}$ contains the constant regressor equal to 1. It follows that ${\hat{\bar{Y}}}_{i}^{YR} = {\hat{\bar{Y}}}_{i b}^{YR} .$ Thus, the YR estimator ${\hat{\bar{Y}}}_{i}^{YR}$ constructed with $w_{i j}^{GREG} - 1$ is self-benchmarked to ${\hat{Y}}^{GREG}$ in the special case when the GREG weights are calibrated at the small area level (see You and Rao, 2002).

3.3 Restricted EBLUP benchmarked estimator

In Section 2 we showed that the EBLUP estimators of $(β, v)$ can be obtained if the function $ϕ$ defined in (2.5) is minimized with respect to $(β, v) .$ It therefore follows that an EBLUP estimator can be viewed as the solution to an unrestricted minimization problem. The idea of restricted EBLUP estimators is to obtain new estimators of $(β, v)$ by minimizing $ϕ$ subject to the restriction given by the benchmark condition. The procedure was used by Pfeffermann and Barnard (1991) under the FH area-level model. More recently, Ugarte et al. (2009) applied the procedure under the BHF unit-level model to obtain benchmarking to a synthetic estimator. Ugarte et al. (2009) described the restricted estimator as a generalized least squares estimator subject to a restriction by noticing that the minimization can be conducted as in the econometrics theory of regression estimation under linear constraints. We now describe the procedure in Ugarte et al. (2009).

We denote by ${\hat{β}}^{R}$ and ${\hat{v}}^{R} = {({\hat{v}}_{1}^{R}, \dots, {\hat{v}}_{m}^{R})}^{T}$ the new restricted EBLUP estimators of $(β, v) .$ Then, the restricted EBLUP estimator of ${\bar{Y}}_{i},$ denoted as ${\hat{\bar{Y}}}_{i b}^{REBLUP},$ is given by equation (2.4), where ${\hat{y}}_{i j}$ are replaced by ${\hat{y}}_{i j}^{R} = x_{i j}^{T} {\hat{β}}^{R} + {\hat{v}}_{i}^{R},$ for $j \in r_{i} .$ We impose that the estimators ${\hat{\bar{Y}}}_{i b}^{REBLUP}, i = 1, \dots, m$ be benchmarked to ${\hat{Y}}^{GREG},$ that is they satisfy equation (3.1) with ${\hat{Y}}_{w} = {\hat{Y}}^{GREG} .$ After carrying out some algebra, it can be shown that the benchmark to ${\hat{Y}}^{GREG}$ of estimators ${\hat{\bar{Y}}}_{i b}^{REBLUP}, i = 1, \dots, m$ is equivalent to the following linear constraint equation

$a_{1}^{T} {\hat{β}}^{R} + a_{2}^{T} {\hat{v}}^{R} = {\hat{Y}}_{r}^{GREG}, (3.11)$

where $a_{1} = \sum_{i = 1}^{m} x_{i r},$ $a_{2} = {(N_{1} - n_{1}, \dots, N_{m} - n_{m})}^{T},$ $Y_{r} = Y - \sum_{i = 1}^{m} \sum_{j \in s_{i}} y_{i j}$ is the total of non-observed $y_{i j}$ values with $i = 1, \dots, m; j \in r_{i},$ and ${\hat{Y}}_{r}^{GREG} = {\hat{Y}}^{GREG} - \sum_{i = 1}^{m} \sum_{j \in s_{i}} y_{i j}$ is an estimator of $Y_{r}$ based on ${\hat{Y}}^{GREG} .$ The restricted EBLUP estimators $({\hat{β}}^{R}, {\hat{v}}^{R})$ are therefore obtained as the solution to the minimization of function $ϕ$ given by (2.5) subject to the linear constraint (3.11).

The Lagrange multiplier method can be used to solve the constrained minimization of $ϕ .$ After straightforward algebra, it can be shown that estimators $({\hat{β}}^{R}, {\hat{v}}^{R})$ are given by

$(\begin{array}{l} {\hat{β}}^{R} \\ {\hat{v}}^{R} \end{array}) = (\begin{array}{l} \hat{β} \\ \hat{v} \end{array}) + \frac{1}{a^{T} \hat{A} a} {\hat{A}}^{- 1} a [{\hat{Y}}_{r}^{GREG} - a^{T} (\begin{array}{l} \hat{β} \\ \hat{v} \end{array})], (3.12)$

where $(\hat{β}, \hat{v})$ are the (unconstrained) EBLUP estimators of $(β, v),$ $\hat{A}$ is the empirical version of matrix $A$ defined in (2.7), and $a = {(a_{1}^{T} a_{2}^{T})}^{T} .$ Then, using ${\hat{y}}_{i j}^{R}$ in (2.4), the estimator ${\hat{\bar{Y}}}_{i b}^{REBLUP}$ can be rewritten as

${\hat{\bar{Y}}}_{i b}^{REBLUP} = \frac{1}{N_{i}} [\sum_{j \in s_{i}} y_{i j} + x_{i r}^{T} {\hat{β}}^{R} + (N_{i} - n_{i}) {\hat{v}}_{i}^{R}] . (3.13)$

Remark 2. The matrix $\hat{A}$ does not exist for samples when ${\hat{σ}}_{v}^{2} = 0.$ In such cases, we noticed that equation (2.8) cannot be used to compute the unconstrained estimators $(\hat{β}, \hat{v}) .$ However $(\hat{β}, \hat{v})$ can still be computed when ${\hat{σ}}_{v}^{2} = 0$ because the alternative equation (2.9) can be used for $(\hat{β}, \hat{v})$ . Equation (3.12) clearly shows that the constrained $({\hat{β}}^{R}, {\hat{v}}^{R})$ cannot be computed for samples when estimator ${\hat{σ}}_{v}^{2}$ is truncated to zero, and no alternative equation exists in these cases.

It, therefore, follows that the methods of estimation for the variance components commonly used in SAE cannot be used to compute the restricted EBLUP estimator. In Section 3.4 and Appendix B we describe an alternative method that produces a strictly positive estimation of $σ_{v}^{2}$ that can be applied in conjunction with $({\hat{β}}^{R}, {\hat{v}}^{R})$ such that a restricted benchmarked estimator of ${\bar{Y}}_{i}$ always exists.

3.4 Restricted You-Rao benchmarked estimator

We showed in Section 2.2 that YR estimators of $β$ and $v$ can be obtained as a solution to mixed model equations obtained by minimizing the sample weighted function $ϕ_{w}$ given by (2.14). That is, we showed that, by defining a function $ϕ_{w}$ with weights ${w_{i j}}, i = 1, \dots, m; j \in s_{i}$ and ${ω_{i}}, i = 1, \dots, m,$ and then minimizing $ϕ_{w},$ we obtain the same estimators as those given by the You and Rao’s (2002) procedure. We now minimize function $ϕ_{w}$ under the benchmark constraint given by (3.11). The result is a restricted YR estimator that is benchmarked to ${\hat{Y}}^{GREG} .$

Minimization of $ϕ_{w}$ given the benchmark restriction (3.11) results in estimators of ${\bar{Y}}_{i}, i = 1, \dots, m$ that are guaranteed to be benchmarked for any weights that define the function $ϕ_{w} .$ Thus, one may choose any set of weights $w_{i j}$ in $ϕ_{w} .$ In a limited design-based simulation study, we compared three restricted YR estimators based on three options with respect to $w_{i j} :$ i. $w_{i j} = w_{i j}^{GREG} - 1;$ ii. $w_{i j} = w_{i j}^{GREG}$ and iii. $w_{i j} = d_{i j} .$ We found no significant difference between these three estimators in terms of design mean squared error. Given this last point and that the unrestricted benchmarked YR estimators described in Section 3.2 were based on $w_{i j} = w_{i j}^{GREG} - 1,$ we chose to define the restricted YR estimator based on these weights.

Let $ϕ_{w}$ be defined in terms of $w_{i j} = w_{i j}^{GREG} - 1$ and $ω_{i} = \sum_{j \in s_{i}} w_{i j}^{2} / \sum_{j \in s_{i}} w_{i j} .$ Minimization of $ϕ_{w}$ with respect to $(β, v)$ subject to the benchmark constraint (3.11) results in the restricted YR estimators of $(β, v),$ denoted as $({\hat{β}}^{RYR}, {\hat{v}}^{RYR}) .$ They are given by:

$(\begin{array}{l} {\hat{β}}^{RYR} \\ {\hat{v}}^{RYR} \end{array}) = (\begin{array}{l} {\hat{β}}^{YR} \\ {\hat{v}}^{YR} \end{array}) + \frac{1}{a^{T} {\hat{A}}_{w} a} {\hat{A}}_{w}^{- 1} a [{\hat{Y}}_{r}^{GREG} - a^{T} (\begin{array}{l} {\hat{β}}^{YR} \\ {\hat{v}}^{YR} \end{array})], (3.14)$

where estimators $({\hat{β}}^{YR}, {\hat{v}}^{YR})$ are given by (2.15), and ${\hat{A}}_{w}$ is the empirical version of $A_{w}$ given by (2.16). Using ${\hat{β}}^{RYR}$ and ${\hat{v}}_{i}^{RYR}$ of ${\hat{v}}^{RYR} = {({\hat{v}}_{1}^{RYR}, \dots, {\hat{v}}_{m}^{RYR})}^{T},$ restricted YR estimates ${\hat{y}}_{i j}^{RYR} = x_{i j}^{T} {\hat{β}}^{RYR} + {\hat{v}}_{i}^{RYR}$ of unobserved $y_{i j}$ for $j \in r_{i}$ are then used to compute a benchmarked restricted YR estimator:

${\hat{\bar{Y}}}_{i b}^{RYR} = \frac{1}{N_{i}} [\sum_{j \in s_{i}} y_{i j} + x_{i r}^{T} {\hat{β}}^{RYR} + (N_{i} - n_{i}) {\hat{v}}_{i}^{RYR}] . (3.15)$

As in the case of the restricted EBLUP estimator, the estimators $({\hat{β}}^{RYR}, {\hat{v}}^{RYR})$ given by (3.14) do not exist if FC, ML or REML results in a truncated estimate for $σ_{v}^{2} .$ Consequently, ${\bar{Y}}_{i}$ can only be estimated by ${\hat{\bar{Y}}}_{i b}^{RYR}$ with a method of estimation for the variance components that always leads to strictly positive estimates for $σ_{v}^{2} .$

A null estimate of $σ_{v}^{2}$ poses no problem in computing EBLUP and YR estimators. However, we noticed that the restricted EBLUP and the restricted YR estimators cannot be computed if ${\hat{σ}}_{v}^{2} = 0.$ In order to get around this problem, we use a method proposed by Moghtased-Azar, Tehranchi and Amiri-Simkooei (2014) that guarantees that the estimator of $σ_{v}^{2}$ will be strictly positive. This method is based on the concept of a re-parameterized restricted maximum likelihood estimation (reREML). Their idea is to use functions whose range is the set of all positive real numbers, namely positive-valued functions (PVFs), for unknown variance components in the stochastic model instead of using variance components themselves. Their numerical results showed the successful estimation of non-negativity estimation of variance components (as positive values) as well as covariance components (as negative or positive values).

We used a Fisher-scoring algorithm to obtain iteratively the reREML estimates of the variance components of the basic unit-level model given by (2.2) (see Appendix B for details). We also carried out a small simulation and found out that for area sample sizes equal to or larger than 3, the Fisher-scoring algorithm converged in less than 15 iterations. When we only considered the samples that produced a null estimate ${\hat{σ}}_{v}^{2} = 0,$ we observed that the algorithm converged even faster (see Figure 4.1 in Section 4).

ISSN : 1492-0921

Editorial policy

Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.

Submission of Manuscripts

Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).

Note of appreciation

Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.

Standards of service to the public

Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.

Copyright

Published by authority of the Minister responsible for Statistics Canada.

Use of this publication is governed by the Statistics Canada Open Licence Agreement.

Catalogue No. 12-001-X

Frequency: Semi-annual

Ottawa

Date modified:: 2021-06-24

Language selection

Search and menus

Search

Small area benchmarked estimation under the basic unit level model when the sampling rates are non‑negligible
Section 3. Benchmarked estimators

3.1 Augmented EBLUP benchmarked estimators

3.2 You-Rao benchmarked estimators

3.3 Restricted EBLUP benchmarked estimator

3.4 Restricted You-Rao benchmarked estimator

Small area benchmarked estimation under the basic unit level model when the sampling rates are non‑negligible Section 3. Benchmarked estimators

3.1 Augmented EBLUP benchmarked estimators

3.2 You-Rao benchmarked estimators

3.3 Restricted EBLUP benchmarked estimator

3.4 Restricted You-Rao benchmarked estimator

Editorial policy

Submission of Manuscripts

Note of appreciation

Standards of service to the public

Copyright

Small area benchmarked estimation under the basic unit level model when the sampling rates are non‑negligible
Section 3. Benchmarked estimators