A method to find an efficient and robust sampling strategy under model uncertainty
Section 3. Robustness under a misspecified model
If the finite population is a realization of the superpopulation model (2.4), and if and were known, then an optimal strategy could be defined. In this section we study the robustness of this strategy when the model is misspecified.
We begin by defining how “misspecification” shall be understood in this paper. The working model reflects the beliefs the statistician has about the relation between the auxiliary variables and the study variable at the design stage. We shall assume that a true, unknown model exists. Any deviation of with respect to is a misspecification of the model. In order to keep the analysis tractable, we limit ourselves to the situation when the working model is of the form (2.4) and the true model, is
with
where is a vector of parameters, and as in (2.4) and The random sample and the errors are assumed to be independent.
Result 1. If is assumed when is the true superpopulation model, the model expected value of the design MSE in (2.2), under the difference estimator satisfying condition 1 above, becomes
The result is proven by noting that takes the role of in (2.5) and by taking into account that therefore and As the model is misspecified, we have deliberately avoided the use of the adjective “anticipated” in Result 1.
Using Result 1, it can be seen that for a design that satisfies condition 2 we obtain
It is now possible to see that, even under a mild misspecification as the one considered here, the strategy is not optimal anymore, as its MSE (3.3) can be greater than the MSE obtained under other designs (3.2). In particular, the strategy using the correct model, i.e., into the estimator and a design such that would be more efficient than
- Date modified: