Publications

Survey Methodology

Browse by

2 Analysis of embedded K x L factorial experiments

Jan A. van den Brakel

2.1 Experimental designs embedded in probability samples

In a $K \times L$ factorial design, the effects of two factors are tested simultaneously. The first factor, denoted A contains $K \geq 2$ levels. The second factor, denoted B contains $L \geq 2$ levels. The purpose of the experiment is to test the main effects of the two factors and the interactions between both factors on the main parameter estimates of the ongoing survey. To this end a probability sample $s$ of size $n$ is drawn from a finite target population U of size N according the sample design of the regular survey. This sample design can be generally complex, and is described by its first order inclusion probabilities $π_{i}$ for unit i and second order inclusion probabilities $π_{i i'}$ for units i and i'.

Subsequently, this sample is randomly divided into $K L$ subsamples according to a randomized experiment. In the case of a CRD, the sample $s$ of size $n$ is randomly divided into $K L$ subsamples $s_{k l}$ , each with a size of $n_{k l}$ sampling units. The sampling units of each subsample are assigned to one of the $K L$ treatment combinations. Under a CRD, $n_{+ +} = \sum_{k = 1}^{K} \sum_{l = 1}^{L} n_{k l}$ denotes the total number of sampling units in the sample $s$ . The probability that sampling unit i is assigned to subsample $s_{k l}$ , conditionally on the realization of $s$ , equals $n_{k l} / n_{+ +}$ . The unconditional probability that sampling unit i is selected in subsample $s_{k l}$ equals $π_{i}^{*} = π_{i} (n_{k l} / n_{+ +}) .$

The power of an experiment might be improved by using sampling structures such as strata, clusters or interviewers as block variables in an RBD since restricted randomization removes the variance between the blocks from the analysis of the experiment, (Fienberg and Tanur (1987, 1988)). In the case of an RBD, the sampling units are deterministically grouped in B more or less homogeneous blocks $s_{b}$ . Within each block, the sampling units are randomly assigned to one of the $K L$ treatment combinations. Let $n_{b k l}$ denote the number of sampling units in block $b$ assigned to treatment combination $k l$ , and $n_{b + +} = \sum_{k = 1}^{K} \sum_{l = 1}^{L} n_{b k l}$ the number of sampling units in block $b$ . The probability that sampling unit i is assigned to subsample $s_{k l}$ , conditionally on the realization of $s$ and $i \in s_{b}$ , equals $n_{b k l} / n_{b + +}$ , $i \in s_{b}$ . The unconditional probability that sampling unit i is selected in subsample $s_{k l}$ equals $π_{i}^{*} = π_{i} (n_{b k l} / n_{b + +}) .$

In many practical applications one of the $K L$ subsamples is assigned to the regular survey and serves, besides being used to produce estimates for the regular publication, as the control group in the experiment. In such situations, the size of this subsample will be substantially larger than the other subsamples.

There are a lot of issues in the planning and design stage of embedded experiments. The field staff, for example, requires special attention, since an embedded experiment can have a large impact on their daily routine of data collection, to which they are accustomed. See van den Brakel and Renssen (1998) and van den Brakel (2008) for more details about such design issues.

Although factorial designs are efficient from a statistical point of view, there might be strong practical arguments against a factorial set-up. The number of treatment combinations increases rapidly with the number of factors in full factorial designs, which might be difficult to implement in the data collection of a survey process. A general solution, known from standard experimental design theory, is to confound higher order interactions with blocks or to apply fractional factorial designs (Hinkelmann and Kempthorne (2005); Montgomery (2001)). These balanced designs, however, are generally hard to combine with the fieldwork restrictions encountered in the daily practice of survey sampling. In many applications the factors that changed in a survey redesign are therefore combined into one treatment. The total effect of these modifications is tested against the standard alternative in a two-treatment experiment. This implies that the effects of all factors in the experiment are confounded and cannot be separately estimated.

2.2 Testing hypotheses about finite population parameters

The purpose of embedded experiments is to test whether alternative survey implementations result in significantly different estimates for finite population parameters. Such differences are the result of non-sampling errors, like measurement errors and response bias. A measurement error model is required to link systematic differences between finite population parameters due to different survey implementations or treatments. Therefore the measurement error model for single-factor experiments proposed by van den Brakel and Renssen (2005) and van den Brakel (2008) is extended to factorial designs.

Let $y_{i q k l}$ denote the observation obtained from the $i^{t h}$ individual observed under the $k l^{t h}$ treatment combination and the $q^{t h}$ interviewer. It is assumed that the observations are a realization of the measurement error model

$y_{i q k l} = u_{i} + β_{k l} + γ_{q} + ε_{i k l} . (2.1)$

Here $u_{i}$ is the true intrinsic value of the $i^{t h}$ individual, $β_{k l}$ the effect of the $k l^{t h}$ treatment combination and $ε_{i k l}$ an error component. The model also allows for interviewer effects, i.e. $γ_{q} = ψ + ξ_{q}$ , where $ψ$ denotes a systematic interviewer bias and $ξ_{q}$ the random effect of the $q^{t h}$ interviewer, respectively. Let $E_{m}$ and ${cov}_{m}$ denote the expectation and the covariance with respect to the measurement error model. It is assumed that $E_{m} (ε_{i k l}) = 0$ , ${var}_{m} (ε_{i k l}) = σ_{i k l}^{2}$ , and that measurement errors between sampling units are independent. Furthermore it is assumed that $E_{m} (ξ_{q}) = 0$ , ${var}_{m} (ξ_{q}) = τ_{q}^{2}$ and that random interviewer effects between interviewers are independent. As a result the model allows for correlated response between sampling units that are interviewed by the same interviewer. The measurement error model allows for separate variances for measurement errors under different treatment combinations and separate variances for interviewers.

The treatment effects $β_{k l}$ can be interpreted as the bias in the estimated population parameter if the true intrinsic population value of $u$ is measured by means of the $k l^{t h}$ survey implementation. The treatment effect can be decomposed in the traditional way of an analysis of variance for a two-way layout:

$β_{k l} = u + A_{k} + B_{l} + A B_{k l}, (2.2)$

with $u$ the overall effect, $A_{k}$ and $B_{l}$ the main effects of treatment factors $A$ and $B$ and $A B_{k l}$ the interactions between treatment factors $A$ and $B$ . If the treatment effects are defined as fixed deviations from the individuals' intrinsic value $u_{i}$ , then the overall mean $u$ equals zero. In that case $A_{k}$ corresponds with the bias associated with the $k^{t h}$ level of factor $A$ averaged over all levels of factor $B$ , $B_{l}$ the bias associated with the $l^{t h}$ level of factor $B$ , averaged over all levels of factor $A$ , and $A B_{k l}$ the additional bias associated with the combination of the $k^{t h}$ level of factor $A$ and the $l^{t h}$ level of factor $B$ on top of $A_{k}$ and $B_{l} .$

The following restrictions are required to identify model (2.2):

$\sum_{k = 1}^{K} A_{k} = 0, \sum_{l = 1}^{L} B_{l} = 0, (2.3)$

and

$\sum_{k = 1}^{K} A B_{k l} = 0, l = 1, 2, \dots, L, \sum_{l = 1}^{L} A B_{k l} = 0, k = 1, 2, \dots, K . (2.4)$

For each sampling unit, a potential response variable is defined under each of the $K L$ treatment combinations. Therefore the measurement error model can be expressed in matrix notation as:

$y_{i q} = j_{K L} u_{i} + β + j_{K L} γ_{q} + ε_{i}, (2.5)$

where $y_{i q} = {(y_{i q 11}, ..., y_{i q k l}, ..., y_{i q K L})}^{t}$ , $β = {(β_{11}, ..., β_{k l}, ..., β_{K L})}^{t}$ , $j_{K L}$ a vector of order $K L$ with each element equal to one and $ε_{i} = {(ε_{i 11}, ..., ε_{i k l}, ..., ε_{i K L})}^{t}$ . The sampling units are assigned to one of the treatment combinations only, so only one of the responses of $y_{i q}$ is actually observed. The model assumptions specified above are stated as:

$E_{m} (ε_{i}) = 0, (2.6)$

${cov}_{m} (ε_{i}, ε_{i^{'}}) = {\begin{array}{l} Σ_{i} & : & i = i^{'} \\ Ο & : & i \neq i^{'} \end{array}, (2.7)$

$E_{m} (ξ_{q}) = 0, (2.8)$

${cov}_{m} (ξ_{q}, ξ_{q^{'}}) = {\begin{array}{l} τ_{q}^{2} & : & q = q^{'} \\ 0 & : & q \neq q^{'} \end{array}, (2.9)$

${cov}_{m} (ε_{i k l}, ξ_{q}) = 0, (2.10)$

where $0$ is a vector of order $K L$ with each element zero, $Σ_{i}$ a matrix of order $K L \times K L$ containing the variances of the measurement errors $σ_{i k l}^{2}$ , and $Ο$ a matrix of order $K L \times K L$ with each element zero.

Let $\bar{Y} = {({\bar{Y}}_{11}, ..., {\bar{Y}}_{1 L}, ..., {\bar{Y}}_{k l}, ..., {\bar{Y}}_{K 1}, ..., {\bar{Y}}_{K L})}^{t}$ denote the $K L$ dimensional vector of population means of $y_{i q}$ defined by (2.5). These are the values obtained under a complete enumeration of the finite population under each of the treatment combinations and are defined as:

$\bar{Y} = j_{K L} \frac{1}{N} \sum_{i = 1}^{N} u_{i} + β + j_{K L} ψ + j_{K L} \sum_{q = 1}^{Q} \frac{N_{q}}{N} ξ_{q} + \frac{1}{N} \sum_{i = 1}^{N} ε_{i}, (2.11)$

where $Q$ denotes the total number of interviewers available for the data collection and $N_{q}$ the number of units assigned to the $q - th$ interviewer in the case of a complete enumeration.

Only systematic differences between the population parameters that are reflected by the treatment effects $β$ should lead to a rejection of the null hypotheses of no treatment effects. This is accomplished by formulating hypotheses about $\bar{Y}$ in expectation over the measurement error model, i.e.

$E_{m} \bar{Y} = j_{K L} \frac{1}{N} \sum_{i = 1}^{N} u_{i} + β + j_{K L} ψ . (2.12)$

Consequently, hypotheses about main effects and interactions are formulated as

$\begin{array}{l} H_{0} : C E_{m} \bar{Y} = 0, (2.13) \\ H_{1} : C E_{m} \bar{Y} \neq 0, \end{array}$

where $C$ denotes an appropriate contrast matrix, and $0$ a vector with elements equal to one and a dimension that is equal to the number of contrasts (rows) defined by $C$ . The contrast matrix for the hypothesis about the main effects of factor $A$ is defined as

$C_{A} = \frac{1}{L} (j_{(K - 1)} | - I_{(K - 1)}) \otimes j_{L}^{t} \equiv \frac{1}{L} {\tilde{C}}_{A} \otimes j_{L}^{t}, (2.14)$

with $I_{(K - 1)}$ the identity matrix of order $K - 1$ . Matrix ${\tilde{C}}_{A}$ defines the $K - 1$ contrasts between the $K$ levels of factor $A$ , averaged over the $L$ levels of factor $B$ . From (2.12) and due to restrictions (2.3) and (2.4) it follows that the contrasts between the population parameters exactly correspond to the contrasts between the main effects of the first factor:

${\tilde{C}}_{A} E_{m} \bar{Y} = {\tilde{C}}_{A} β = {(A_{1} - A_{2}, ..., A_{1} - A_{K})}^{t} .$

The contrast matrix for the hypothesis about the main effects of factor $B$ is defined as

$C_{B} = \frac{1}{K} j_{K}^{t} \otimes (j_{(L - 1)} | - I_{(L - 1)}) \equiv \frac{1}{K} j_{K}^{t} \otimes {\tilde{C}}_{B} . (2.15)$

This matrix defines the $L - 1$ contrasts between the $L$ levels of factor $B$ , averaged over the $K$ levels of factor $A$ . From (2.12) and due to restrictions (2.3) and (2.4) it follows that the contrasts between the population parameters exactly correspond to the contrasts between the main effects of the second factor:

${\tilde{C}}_{B} E_{m} \bar{Y} = {\tilde{C}}_{B} β = {(B_{1} - B_{2}, ..., B_{1} - B_{L})}^{t} .$

The contrast matrices for the main effects use the first level of factors $A$ and $B$ as the reference category. This implies that treatment combination $A_{1} \times B_{1}$ is considered as the control group in the experiment.

Interactions between the two treatment factors are defined as the $L - 1$ contrasts of factor $B$ between the $K - 1$ contrasts of factor $A$ or, equivalently, as the $K - 1$ contrasts of factor $A$ between the $L - 1$ contrasts of factor $B$ , Hinkelmann and Kempthorne (1994, chapter 11). Therefore the contrast matrix for the hypothesis about the interactions between factor $A$ and $B$ can be defined as

$C_{A B} = (j_{(K - 1)} | - I_{(K - 1)}) \otimes (j_{(L - 1)} | - I_{(L - 1)}) = {\tilde{C}}_{A} \otimes {\tilde{C}}_{B} . (2.16)$

This matrix contains the $(K - 1) (L - 1)$ contrasts that define the interactions between factor $A$ and $B$ . The contrasts between the population parameters exactly correspond to the interactions between the first and the second factor, since

${\tilde{C}}_{A B} E_{m} \bar{Y} = {\tilde{C}}_{A B} β = (A B_{11} - A B_{12} - A B_{21} + A B_{22}, ...,$

$A B_{11} - A B_{1 L} - A B_{21} + A B_{2 L}, ..., A B_{11} - A B_{12} - A B_{K 1} + A B_{K 2}, ..., A B_{11} - A B_{1 L} - A B_{K 1} + A B_{K L})^{t}$

Each element of this $(K - 1) (L - 1)$ vector defines one of the $(K - 1) (L - 1)$ interactions, which neatly corresponds to the contrasts between the interaction effects defined by (2.2). The first element e.g. can be interpreted as the deviation of the treatment effect of the particular combination of factor $A$ at level 2 and factor $B$ at level 2 from the two main effects of these factors.

2.3 Wald test

The hypotheses specified in section 2.2, can be tested with a Wald test (Wald 1943), which is frequently applied in design-based testing procedures, see for example Skinner, Holt and Smith (1989) or Chambers and Skinner (2003). If $\hat{\bar{Y}}$ denotes a design-unbiased estimator for $\bar{Y}$ , $C$ the contrast matrix $C_{A}$ , $C_{B}$ , or $C_{A B}$ defined in (2.14), (2.15) and (2.16), and $cov (C \hat{\bar{Y}})$ the covariance matrix of the contrasts between $\hat{\bar{Y}}$ , then hypotheses can be tested with the Wald statistic $W = {\hat{\bar{Y}}}^{t} C^{t} {cov (C \hat{\bar{Y}})}^{- 1} C \hat{\bar{Y}}$ . The GREG estimators, proposed by van den Brakel and Renssen (2005) and van den Brakel (2008) for single-factor experiments are extended to embedded factorial designs in this section. For notational convenience, the subscript q will be omitted in $y_{i q k l}$ , since there is no need to sum explicitly over the interviewer subscript in most of the formulas developed in the rest of this paper.

To apply the model-assisted mode of inference to the analysis of embedded experiments, it is assumed for each unit in the population that the intrinsic value $u_{i}$ in measurement error model (2.5) is an independent realization of the following linear regression model:

$u_{i} = β^{t} x_{i} + e_{i}, (2.17)$

where $x_{i}$ H-vector with auxiliary information, $β$ a H-vector with the regression coefficients and $e_{i}$ the residuals, which are independent random variables with variance $ω_{i}^{2}$ . It is required that all $ω_{i}^{2}$ are known up to a common scale factor, that is $ω_{i}^{2} = ω^{2} ν_{i}$ , with $ν_{i}$ known. The GREG estimator for ${\bar{Y}}_{k l}$ , based on the $n_{k l}$ observations of subsample $s_{k l}$ , is defined as (Särndal et al., 1992)

${\hat{\bar{Y}}}_{k l; g r e g} = {\hat{\bar{Y}}}_{k l} + {\hat{b}}_{k l}^{t} (\bar{X} - \hat{\bar{X}}), k = 1, 2, \dots, K, and l = 1, 2, \dots, L, (2.18)$

where,

${\hat{\bar{Y}}}_{k l} = \frac{1}{N} \sum_{i = 1}^{n_{k l}} \frac{y_{i k l}}{π_{i}^{*}}, (2.19)$

denotes the HT estimator for ${\bar{Y}}_{k l}$ , $\bar{X}$ the finite population means of the auxiliary variables $x$ , and $\hat{\bar{X}}$ the HT estimator for $\bar{X}$ based on the $n_{k l}$ sample units of subsample $s_{k l}$ . Furthermore,

${\hat{b}}_{k l} = {(\sum_{i = 1}^{n_{k l}} \frac{x_{i} x_{i}^{t}}{ω_{i}^{2} π_{i}^{*}})}^{- 1} \sum_{i = 1}^{n_{k l}} \frac{x_{i} y_{i k l}}{ω_{i}^{2} π_{i}^{*}}, (2.20)$

denotes the HT-type estimator for the regression coefficients in (2.17) based on the $n_{k l}$ sampling units in subsample $s_{k l}$ . In (2.19) and (2.20), $π_{i}^{*}$ are the first order inclusion probabilities for the sampling units in the $K L$ different subsamples, derived in subsection 2.1. Now ${\hat{\bar{Y}}}_{G R E G} = {({\hat{\bar{Y}}}_{11; g r e g}, ..., {\hat{\bar{Y}}}_{K L; g r e g})}^{t}$ is an approximately design-unbiased estimator for $\bar{Y}$ and also for $E_{m} \bar{Y}$ by definition.

Under the null hypotheses that there are no treatment effects and no interactions, it follows that $b_{k l} = b_{k' l'}$ . In that case, it might be efficient to substitute for ${\hat{b}}_{k l}$ in the GREG estimator (2.18) the pooled estimator

$\hat{b} = {(\sum_{i = 1}^{n} \frac{x_{i} x_{i}^{t}}{ω_{i}^{2} π_{i}^{*}})}^{- 1} \sum_{k = 1}^{K} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{k l}} \frac{x_{i} y_{i k l}}{ω_{i}^{2} π_{i}^{*}} . (2.21)$

Since $H$ instead of $K L \times H$ regression coefficients have to be estimated, the pooled estimates of the regression coefficients $\hat{b}$ will be more precise, particularly in the case of small subsamples. Note, however, that many commonly used weighting schemes meet the condition that a constant vector $λ$ exists such that $ω_{i}^{2} = λ x_{i}$ for all $i \in U$ . In this situation the GREG estimator reduces to the simplified form ${\hat{\bar{Y}}}_{k l; g r e g} = {\hat{b}}_{k l}^{t} \bar{X}$ (Särndal et al. 1992, section 6.5). Under this simplified form, the treatment effects are completely included in the regression coefficients. In case of the pooled estimator (2.21), the $K L$ GREG estimators are exactly equal by definition, since ${\hat{\bar{Y}}}_{k l; g r e g} = {\hat{b}}_{}^{t} \bar{X}$ for all k and l.

An expression for the covariance matrix of the contrasts between the elements of ${\hat{\bar{Y}}}_{G R E G}$ where the covariance is taken over the sampling design, the experimental design and the measurement error model, is given by

$cov (C {\hat{\bar{Y}}}_{G R E G}) = E_{m} E_{s} C D C^{t}, (2.22)$

where $E_{s}$ denotes the expectation with respect to the sampling design, and $D$ a $K L \times K L$ diagonal matrix with diagonal elements

$d_{k l} = \frac{1}{n_{k l} (n_{+ +} - 1)} \sum_{i = 1}^{n_{+ +}} {(\frac{n_{+ +} (y_{i k l} - b_{k l}^{t} x_{i})}{N π_{i}} - \frac{1}{n_{+ +}} \sum_{i^{'} = 1}^{n_{+ +}} \frac{n_{+ +} (y_{i^{'} k l} - b_{k l}^{t} x_{i^{'}})}{N π_{i^{'}}})}^{2}, (2.23)$

in the case of a CRD and

$d_{k l} = \sum_{b = 1}^{B} \frac{1}{n_{b k l} (n_{b + +} - 1)} \sum_{i = 1}^{n_{b + +}} {(\frac{n_{b + +} (y_{i k l} - b_{k l}^{t} x_{i})}{N π_{i}} - \frac{1}{n_{b + +}} \sum_{i' = 1}^{n_{b + +}} \frac{n_{b + +} (y_{i' k l} - b_{k l}^{t} x_{i'})}{N π_{i'}})}^{2}, (2.24)$

in the case of an RBD. An estimator for $D$ can be derived from the experimental design, conditionally on the measurement error model and the sampling design. Therefore the covariance matrix (2.22) is conveniently stated implicitly as the expectation over the measurement error model and the sampling design. A design-based estimator for this covariance matrix is given by

$c \hat{o} v (C {\hat{\bar{Y}}}_{G R E G}) = E_{m} E_{s} C \hat{D} C^{t}, (2.25)$

with $\hat{D}$ a $K L \times K L$ diagonal matrix with elements

${\hat{d}}_{k l} = \frac{1}{n_{k l} (n_{k l} - 1)} \sum_{i = 1}^{n_{k l}} {(\frac{n_{+ +} (y_{i k l} - {\hat{b}}_{k l}^{t} x_{i})}{N π_{i}} - \frac{1}{n_{k l}} \sum_{i' = 1}^{n_{k l}} \frac{n_{+ +} (y_{i' k l} - {\hat{b}}_{k l}^{t} x_{i'})}{N π_{i'}})}^{2}, (2.26)$

in the case of a CRD and

${\hat{d}}_{k l} = \sum_{b = 1}^{B} \frac{1}{n_{b k l} (n_{b k l} - 1)} \sum_{i = 1}^{n_{b k l}} {(\frac{n_{b + +} (y_{i k l} - {\hat{b}}_{k l}^{t} x_{i})}{N π_{i}} - \frac{1}{n_{b k l}} \sum_{i' = 1}^{n_{b k l}} \frac{n_{b + +} (y_{i' k l} - {\hat{b}}_{k l}^{t} x_{i'})}{N π_{i'}})}^{2}, (2.27)$

in the case of an RBD. Proofs for (2.22) and (2.25) are given by van den Brakel (2010) and resemble the derivation of the covariance matrix for single factor experiments, given by van den Brakel and Renssen (2005) and van den Brakel(2008).

The results for (2.22) and(2.25) are obtained under the condition that a constant H-vector $a$ exists such that $a^{t} x_{i} = 1$ for all $i \in U$ . This is a rather weak condition, since it implies that a weighting model is used that at least uses the size of the finite population as a priori information. See van den Brakel and Renssen (2005) or van den Brakel (2008) for a more detailed discussion.

Since the $K L$ subsamples are drawn without replacement from a finite population, there is a nonzero design covariance between elements of ${\hat{\bar{Y}}}_{G R E G}$ . From that point of view, it is remarkable that (2.25) has a structure as if the subsamples are drawn independently through sampling with replacement using unequal selection probabilities. This gives rise to an attractive variance estimation procedure for embedded experiments, since no design covariances between the subsample estimates appear in (2.25) and no second order inclusion probabilities are required in the variance estimators (2.26) and (2.27). This result is obtained since the covariance matrix of the contrasts between ${\hat{\bar{Y}}}_{G R E G}$ is derived instead of the covariance matrix of ${\hat{\bar{Y}}}_{G R E G}$ itself. A detailed interpretation of this result is given by van den Brakel and Renssen (2005) or van den Brakel (2008). See van den Brakel and Binder (2000) and Hidiroglou and Lavallée (2005) for approximations of the covariance matrix of ${\hat{\bar{Y}}}_{G R E G} .$

The design-based estimators ${\hat{\bar{Y}}}_{G R E G}$ and $c \hat{o} v (C {\hat{\bar{Y}}}_{G R E G})$ can be used to construct a design-based Wald statistic to test the hypotheses described in section 2.2:

$W = {\hat{\bar{Y}}}_{G R E G}^{t} C^{t} {(C \hat{D} C^{t})}^{- 1} C {\hat{\bar{Y}}}_{G R E G} . (2.28)$

Design-based inferences are generally based on normal large-sample approximations to construct confidence intervals for point estimates or p-values and critical regions for test statistics. Under this approach it follows under the null hypothesis that the Wald statistic is asymptotically distributed as a central chi-squared random variable, where the number of degrees of freedom equals the number of contrasts specified in the hypothesis.

The Wald statistic for the hypotheses about the main effects and interactions are given by (2.28) using the contrast matrix $C_{A}$ , $C_{B}$ , or $C_{A B}$ . Under the null hypothesis, it follows that $W \to χ_{[K - 1]}^{2}$ for the test about the main effects of factor $A$ , $W \to χ_{[L - 1]}^{2}$ for the test about the main effects of factor $B$ and $W \to χ_{[(K - 1) (L - 1)]}^{2}$ for the test about interactions, where $χ_{[p]}^{2}$ denotes a central chi-squared distributed random variable with p degrees of freedom.

The Wald test for the main effects can be further simplified. Expressions are developed for the Wald test for the main effects for factor $A$ . Similar expressions can be derived for the main effects of factor $B$ . Denote

${\hat{\bar{Y}}}_{A; G R E G} = {({\hat{\bar{Y}}}_{1.; g r e g}, ..., {\hat{\bar{Y}}}_{K .; g r e g})}^{t}, w i t h {\hat{\bar{Y}}}_{k .; g r e g} = \frac{1}{L} \sum_{l = 1}^{L} {\hat{\bar{Y}}}_{k l; g r e g}$

${\hat{D}}_{A} = Diag ({\hat{d}}_{1.}, \dots, {\hat{d}}_{K .}), with {\hat{d}}_{k .} = \frac{1}{L^{2}} \sum_{l = 1}^{L} {\hat{d}}_{k l} . (2.29)$

It follows that $C_{A} {\hat{\bar{Y}}}_{G R E G} = {\tilde{C}}_{A} {\hat{\bar{Y}}}_{A; G R E G}$ and $C_{A} \hat{D} C_{A}^{t} = {\tilde{C}}_{A} {\hat{D}}_{A} {\tilde{C}}_{A}^{t}$ . With the matrix inversion lemma, the Wald statistic for the main effects of factor $A$ can be simplified to:

$\begin{matrix} W & = {\hat{\bar{Y}}}_{A; G R E G}^{t} {\tilde{C}}_{A}^{t} {({\tilde{C}}_{A} {\hat{D}}_{A} {\tilde{C}}_{A}^{t})}^{- 1} {\tilde{C}}_{A} {\hat{\bar{Y}}}_{A; G R E G}^{} \\ = {\hat{\bar{Y}}}_{A; G R E G}^{t} ({\hat{D}}_{A}^{- 1} - \frac{1}{Trace ({\hat{D}}_{A}^{- 1})} {\hat{D}}_{A}^{- 1} j_{(K - 1)} j_{(K - 1)}^{t} {\hat{D}}_{A}^{- 1}) {\hat{\bar{Y}}}_{A; G R E G}^{} (2.30) \\ = \sum_{k = 1}^{K} \frac{{\hat{\bar{Y}}}_{k .; g r e g}^{2}}{{\hat{d}}_{k .}} - {(\sum_{k = 1}^{K} \frac{1}{{\hat{d}}_{k .}})}^{- 1} {(\sum_{k = 1}^{K} \frac{{\hat{\bar{Y}}}_{k .; g r e g}^{2}}{{\hat{d}}_{k .}})}^{2} . \end{matrix}$

Finally note that the HT estimator (2.19) does not meet the condition that a constant H-vector $a$ exists such that $a^{t} x_{i} = 1$ for all $i \in U$ . The minimum use of auxiliary information used in the GREG estimator is obtained with a weighting scheme that only uses the size of the finite population as a priori knowledge, i.e. $(x_{i}) = 1$ and $ω_{i}^{2} = ω^{2}$ (Särndal et al. 1992, section 7.4). Under this weighting scheme it follows that

${\hat{\bar{Y}}}_{k l; g r e g} = {(\sum_{i = 1}^{n_{k l}} \frac{1}{π_{i}^{*}})}^{- 1} (\sum_{i = 1}^{n_{k l}} \frac{y_{i k l}}{π_{i}^{*}}) \equiv {\tilde{y}}_{k l}, (2.31)$

and $({\hat{b}}_{k l}) = {\tilde{y}}_{k l}$ . Expression (2.31) can be recognized as Hájek's ratio estimator for a population mean, (Hájek 1971). This weighting scheme satisfies the condition that a constant H-vector $a$ exists such that $a^{t} x_{i} = 1$ for all $i \in U$ . Therefore an approximately design-unbiased estimator for the covariance matrix of the contrasts between subsample estimates is given by (2.26) and (2.27) for a CRD and an RBD respectively, where ${\hat{b}}_{k l}^{t} x_{i} = {\tilde{y}}_{k l}$ . Estimator (2.31) is preferable above the HT estimator (2.19), since (2.31) is more stable and the covariance matrix of the contrasts between (2.31) always has the relatively simple form of (2.25).

2.4 Special cases

It will be shown for two special cases that the design-based Wald statistic is equal to the F-test of a standard analysis of variance. Therefore, an ANOVA-type pooled variance estimator for the diagonal elements of $\hat{D}$ should be considered as an alternative for (2.26) or (2.27). Such a pooled variance estimator for a CRD is given by

${\hat{d}}_{k l}^{p} = \frac{1}{n_{k l} (n_{+ +} - K L)} \sum_{k' = 1}^{K} \sum_{l' = 1}^{L} \sum_{i = 1}^{n_{k' l'}} {(\frac{n_{+ +} (y_{i k' l'} - {\hat{b}}_{k' l'}^{t} x_{i})}{N π_{i}} - \frac{1}{n_{k' l'}} \sum_{i' = 1}^{n_{k' l'}} \frac{n_{+ +} (y_{i' k' l'} - {\hat{b}}_{k' l'}^{t} x_{i'})}{N π_{i'}})}^{2}, (2.32)$

and for an RBD by

${\hat{d}}_{k l}^{p} = \sum_{b = 1}^{B} \frac{1}{n_{b k l} (n_{b + +} - K L)} \sum_{k' = 1}^{K} \sum_{l' = 1}^{L} \sum_{i = 1}^{n_{b k' l'}} {(\frac{n_{b + +} (y_{i k' l'} - {\hat{b}}_{k' l'}^{t} x_{i})}{N π_{i}} - \frac{1}{n_{b k' l'}} \sum_{i' = 1}^{n_{b k' l'}} \frac{n_{b + +} (y_{i' k' l'} - {\hat{b}}_{k' l'}^{t} x_{i'})}{N π_{i'}})}^{2} . (2.33)$

Now consider a CRD that is embedded in a self-weighted sample, i.e. $π_{i} = n_{+ +} / N$ , with equally sized subsamples, i.e. $n_{k l} = n_{k' l'} = n_{s}$ . The inclusion probabilities for all units in the $K L$ subsamples are given by $π_{i}^{*} = n_{s} / N$ . Let $\bar{y} = (1 / n_{s}) \sum_{i = 1}^{n_{s}} y_{i k l}$ . Under Hájek's ratio estimator (2.31) and the pooled variance estimator (2.32) it follows that ${\hat{\bar{Y}}}_{k l; g r e g} = {\bar{y}}_{k l}$ , ${\hat{b}}_{k l} = {\bar{y}}_{k l}$ , and

${\hat{d}}_{k l}^{p} = \frac{1}{n_{s} (n_{+ +} - K L)} \sum_{k^{'} = 1}^{K} \sum_{l^{'} = 1}^{L} \sum_{i = 1}^{n_{s}} {(y_{i k^{'} l^{'}} - {\bar{y}}_{k^{'} l^{'}})}^{2} \equiv \frac{{\hat{S}}_{p; CRD}^{2}}{n_{s}} .$

The parameter estimates of the $K$ levels of factor $A$ averaged over the $L$ levels of factor $B$ are denoted as

${\bar{y}}_{k .} = \frac{1}{L} \sum_{l = 1}^{L} {\bar{y}}_{k l} = \frac{1}{n_{k +}} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{s}} y_{i k l}, k = 1, \dots, K, (2.34)$

with $n_{k +} = \sum_{l = 1}^{L} n_{k l}$ . The diagonal elements of ${\hat{D}}_{A}$ are now given by

${\hat{d}}_{k .}^{p} = \frac{1}{L^{2}} \sum_{l = 1}^{L} {\hat{d}}_{k l}^{p} = \frac{1}{L^{2}} \sum_{l = 1}^{L} \frac{{\hat{S}}_{p; CDR}^{2}}{n_{s}} = \frac{{\hat{S}}_{p; CDR}^{2}}{n_{k +}}, k = 1, \dots, K . (2.35)$

Let ${\bar{y}}_{..} = (1 / n_{+ +}) \sum_{k = 1}^{K} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{s}} y_{i k l}$ . Inserting (2.34) and (2.35) into (2.30), gives rise to the following expression for the Wald statistic of the main effects of factor $A$

$W = \frac{1}{{\hat{S}}_{p; C R D}^{2}} (\sum_{k = 1}^{K} n_{k +} {\bar{y}}_{k .}^{2} - n_{+ +} {\bar{y}}_{..}^{2}) . (2.36)$

Note that $W / (K - 1)$ in (2.36) corresponds with the F-statistic for the main effects of an analysis of variance for the two-way layout with interactions, (Scheffé 1959, chapter 4). Under the null hypothesis and the assumption of normally and independently distributed errors, the F-statistic in the two-way layout follows an F-distribution with $(K - 1)$ and $(n_{+ +} - K L)$ degrees of freedom, which is denoted as $F_{[n_{+ +} - K L]}^{[K - 1]}$ . If $n_{+ +} \to \infty$ , then $F_{[n_{+ +} - K L]}^{[K - 1]} \to χ_{[K - 1]}^{2} / (K - 1)$ . Consequently the F-statistic and the Wald statistic have the same limit distribution.

Now consider an RBD that is embedded in a self-weighted sampling design with equal subsample sizes, thus $π_{i} = n_{+ + +} / N$ and $n_{k l} = n_{k' l'} = n_{s}$ , with $n_{+ + +} = \sum_{b = 1}^{B} n_{b + +}$ . Let ${\bar{y}}_{b k l} = (1 / n_{b k l}) \sum_{i = 1}^{n_{b k l}} y_{i k l}$ . Furthermore, it is assumed that the fraction of sampling units assigned to each treatment combination within each block is equal, i.e. $n_{b k l} / n_{b + +} = n_{s} / n_{+ + +}$ , and that the block sizes are sufficiently large to assume that $n_{b + +} / (n_{b + +} - K L) \approx 1$ . Under Hájek's ratio estimator (2.31) and the pooled variance estimator (2.33) it follows that ${\hat{\bar{Y}}}_{k l; g r e g} = {\bar{y}}_{k l}$ , ${\hat{b}}_{k l} = {\bar{y}}_{k l}$ , and

$\begin{matrix} {\hat{d}}_{k l}^{p} = \sum_{b = 1}^{B} \frac{1}{n_{b k l} (n_{b + +} - K L)} {(\frac{n_{b + +}}{n_{+ + +}})}^{2} \sum_{k' = 1}^{K} \sum_{l' = 1}^{L} \sum_{i = 1}^{n_{b k' l'}} {(y_{i k' l'} - {\bar{y}}_{b k' l'})}^{2} \\ \approx \frac{1}{n_{s} n_{+ + +}} \sum_{b = 1}^{B} \sum_{k' = 1}^{K} \sum_{l' = 1}^{L} \sum_{i = 1}^{n_{b k' l'}} {(y_{i k' l'} - {\bar{y}}_{b k' l'})}^{2} \equiv \frac{{\hat{S}}_{p; R B D}^{2}}{n_{s}} . \end{matrix}$

The parameter estimates of the $K$ levels of factor $A$ averaged over the $L$ levels of factor $B$ and the blocks are denoted as

${\bar{y}}_{. k .} = \frac{1}{L} \sum_{l = 1}^{L} {\bar{y}}_{k l} = \frac{1}{n_{+ k +}} \sum_{b = 1}^{B} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{b k l}} y_{i k l}, k = 1, \dots, K, (2.37)$

where $n_{+ k +} = \sum_{b = 1}^{B} \sum_{l = 1}^{L} n_{b k l}$ . The diagonal elements of ${\hat{D}}_{A}$ are given by

${\hat{d}}_{k .}^{p} = \frac{1}{L^{2}} \sum_{l = 1}^{L} {\hat{d}}_{k l}^{p} = \frac{{\hat{S}}_{p; RBD}^{2}}{n_{+ k +}}, k = 1, \dots, K . (2.38)$

Let ${\bar{y}}_{...} = (1 / n_{+ + +}) \sum_{b = 1}^{B} \sum_{k = 1}^{K} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{b k l}} y_{i k l}$ . If these results are inserted into (2.30), then the expression for the Wald statistic of the main effects of factor $A$ can be simplified to

$W = \frac{1}{{\hat{S}}_{p; RBD}^{2}} (\sum_{k = 1}^{K} n_{+ k +} {\bar{y}}_{. k .}^{2} - n_{+ + +} {\bar{y}}_{\dots}^{2}) . (2.39)$

It can be recognized that $W / (K - 1)$ in (2.39) corresponds with the F-statistic for the main effects of an analysis of variance for the three-way layout with interactions, (Scheffé 1959, chapter 4). As in the case of a CRD, this Wald and F-statistic have the same limit distribution.

Previous | Next

Date modified:: 2017-09-20

Language selection

Search and menus

Search