Variance estimation under monotone non-response for a panel survey
Section 7. Conclusion

Table of contents

In this paper, we considered variance estimation accounting for weighting adjustments in panel surveys. We proposed both an approximately unbiased variance estimator and a simplified variance estimator for estimators of totals, complex parameters and measures of change, which covers most cases that may be encountered in practice. Our simulation results indicate that the proposed variance estimator performs well in all cases considered. The simplified variance estimator tends to overestimate the variance of the expansion estimator for totals, and to overestimate the variance for calibrated estimators of totals when the calibration variables lack of explanatory power for the variable of interest. However, the simplified variance estimator performs well for the estimation of ratios and change in totals with calibrated weights, even if the calibration model is not appropriate for the study variable.

The assumption of independent response behaviour is usually not tenable for multi-stage surveys, since units within clusters tend to be correlated with respect to the response behaviour. In this context, estimation of response probabilities based upon conditional logistic regression in the context of correlated responses has been studied by Skinner and D’Arrigo (2011), see also Kim, Kwon and Park (2016). Extending the present work in the context of correlated response behaviour is a challenging problem for further research.

Acknowledgements

We thank the Editors, an Associate Editor and the referees for useful comments and suggestions which led to an improvement of the paper.

Appendix

Estimation of the variance due to non-response for Response Homogeneity Groups

We consider the model of Response Homogeneity Groups introduced in Section 2.5. Recall that this model may be summarized as follows: at each time $δ =1, \dots, t,$ the sub-sample $s_{δ - 1}$ is partitioned into $C (δ - 1)$ groups $s_{δ - 1}^{c}, c =1, \dots, C (δ - 1) .$ The response probabilities are assumed to be constant within the groups.

This model is equivalent to the logistic regression model in (2.18), with

$z_{i}^{δ} = {[1 {i \in s_{δ - 1}^{1}}, \dots, 1 {i \in s_{δ - 1}^{C (δ - 1)}}]}^{⊤} . (A .1)$

The equation (2.2) leads to the estimated response probabilities

${\hat{p}}_{i}^{δ} = \frac{\sum_{i \in s_{δ - 1}^{c}} k_{i}^{δ} r_{i}^{δ}}{\sum_{i \in s_{δ - 1}^{c}} k_{i}^{δ}} for i \in s_{δ - 1}^{c} . (A .2)$

We first consider the case when the reweighted estimator is computed at time $t =1.$ In the estimator of the variance due to non-response given in (2.21), the vector ${\hat{γ}}_{1}^{1}$ simplifies as

${\hat{γ}}_{1}^{1} = {(\frac{\sum_{i \in s_{1} \cap s_{0}^{1}} \frac{y_{i 1}}{π_{i}}}{{\hat{p}}_{1}^{1} \sum_{i \in s_{1} \cap s_{0}^{1}} k_{i}^{1}}, \dots, \frac{\sum_{i \in s_{1} \cap s_{0}^{C (0)}} \frac{y_{i 1}}{π_{i}}}{{\hat{p}}_{C (0)}^{1} \sum_{i \in s_{1} \cap s_{0}^{C (0)}} k_{i}^{1}})}^{⊤} . (A .3)$

After some algebra, the variance estimator in (2.21) may be rewritten as

${\hat{V}}_{1}^{nr} ({\hat{Y}}_{1}) = \sum_{c =1}^{C (0)} \frac{(1 - {\hat{p}}_{c}^{1})}{{({\hat{p}}_{c}^{1})}^{2}} \sum_{i \in s_{1} \cap s_{0}^{c}} {(\frac{y_{i 1}}{π_{i}} - k_{i}^{1} \frac{\sum_{j \in s_{1} \cap s_{0}^{c}} \frac{y_{j 1}}{π_{j}}}{\sum_{j \in s_{1} \cap s_{0}^{c}} k_{j}^{1}})}^{2} . (A .4)$

We now consider the case when the reweighted estimator is computed at time $t =2.$ We focus on the simpler case when the same system of RHGs is kept over time. In the estimator of the variance due to non-response given in (2.22), the vectors ${\hat{γ}}_{2}^{1}$ and ${\hat{γ}}_{2}^{2}$ simplify as

${\hat{γ}}_{2}^{1} = {(\frac{\sum_{i \in s_{2} \cap s_{1}^{1}} \frac{y_{i 2}}{π_{i}}}{{\hat{p}}_{1}^{1} \sum_{i \in s_{2} \cap s_{1}^{1}} k_{i}^{1}}, \dots, \frac{\sum_{i \in s_{2} \cap s_{1}^{C (0)}} \frac{y_{i 2}}{π_{i}}}{{\hat{p}}_{C (0)}^{1} \sum_{i \in s_{2} \cap s_{1}^{C (0)}} k_{i}^{1}})}^{⊤} , (A .5)$

${\hat{γ}}_{2}^{2} = {(\frac{\sum_{i \in s_{2} \cap s_{1}^{1}} \frac{y_{i 2}}{π_{i}}}{{\hat{p}}_{1}^{1} {\hat{p}}_{1}^{2} \sum_{i \in s_{2} \cap s_{1}^{1}} k_{i}^{2}}, \dots, \frac{\sum_{i \in s_{2} \cap s_{1}^{C (0)}} \frac{y_{i 2}}{π_{i}}}{{\hat{p}}_{C (0)}^{1} {\hat{p}}_{C (0)}^{2} \sum_{i \in s_{2} \cap s_{1}^{C (0)}} k_{i}^{2}})}^{⊤} . (A .6)$

After some algebra, the variance estimator in (2.22) may be rewritten as

$\begin{array}{l} {\hat{V}}_{2}^{nr} ({\hat{Y}}_{2}) & = \sum_{c =1}^{C (0)} \frac{(1 - {\hat{p}}_{c}^{1})}{{\hat{p}}_{c}^{2}} \sum_{i \in s_{2} \cap s_{1}^{c}} {(\frac{y_{i 2}}{π_{i} {\hat{p}}_{c}^{1}} - k_{i}^{1} \frac{\sum_{j \in s_{2} \cap s_{1}^{c}} \frac{y_{j 2}}{π_{j}}}{\sum_{j \in s_{2} \cap s_{1}^{c}} k_{j}^{1}})}^{2} \\ + \sum_{c =1}^{C (0)} (1 - {\hat{p}}_{c}^{2}) \sum_{i \in s_{2} \cap s_{1}^{c}} {(\frac{y_{i 2}}{π_{i} {\hat{p}}_{c}^{1} {\hat{p}}_{c}^{2}} - k_{i}^{2} \frac{\sum_{j \in s_{2} \cap s_{1}^{c}} \frac{y_{j 2}}{π_{j}}}{\sum_{j \in s_{2} \cap s_{1}^{c}} k_{j}^{2}})}^{2} . (A .7) \end{array}$

If we further assume that $k_{i}^{δ}$ is constant over times $δ =1, 2,$ and may thus be rewritten as $k_{i},$ the expression in (A.7) simplifies as

${\hat{V}}_{2}^{nr} ({\hat{Y}}_{2}) = \sum_{c =1}^{C (0)} \frac{(1 - {\hat{p}}_{c}^{1 \to 2})}{{({\hat{p}}_{c}^{1 \to 2})}^{2}} \sum_{i \in s_{2} \cap s_{1}^{c}} {(\frac{y_{i 2}}{π_{i}} - k_{i} \frac{\sum_{j \in s_{2} \cap s_{1}^{c}} \frac{y_{j 2}}{π_{j}}}{\sum_{j \in s_{2} \cap s_{1}^{c}} k_{j}})}^{2} . (A .8)$

with ${\hat{p}}_{c}^{1 \to 2} = \prod_{δ =1}^{2} {\hat{p}}_{c}^{δ}$ for $c =1, \dots, C (0) .$ This simplification of the variance estimator can be extended to the reweighted estimator at time $t .$ Assuming that the RHGs are kept over time, and that $k_{i}^{δ} = k_{i}$ for any $δ =1, \dots, t,$ the variance estimator in (2.12) may be written as

${\hat{V}}_{t}^{nr} ({\hat{Y}}_{t}) = \sum_{c =1}^{C (0)} \frac{(1 - {\hat{p}}_{c}^{1 \to t})}{{({\hat{p}}_{c}^{1 \to t})}^{2}} \sum_{i \in s_{t} \cap s_{t - 1}^{c}} {(\frac{y_{i t}}{π_{i}} - k_{i} \frac{\sum_{j \in s_{t} \cap s_{t - 1}^{c}} \frac{y_{j t}}{π_{j}}}{\sum_{j \in s_{t} \cap s_{t - 1}^{c}} k_{j}})}^{2} (A .9)$

with ${\hat{p}}_{c}^{1 \to t} = \prod_{δ =1}^{t} {\hat{p}}_{c}^{δ}$ for $c =1, \dots, C (0) .$

References

Beaumont, J.-F. (2005). Calibrated imputation in surveys under a quasimodel-assisted approach. Journal of the Royal Statistical Society, Series B, 67, 445-458.

Beaumont, J.-F., and Haziza, D. (2016). A note on the concept of invariance in two-phase sampling designs. Survey Methodology, 42, 2, 319-323. Paper available at https://www150.statcan.gc.ca/n1/pub/12-001-x/2016002/article/14662-eng.pdf.

Berger, Y. (2004). Variance estimation for measures of change in probability sampling. Canadian Journal of Statistics, 32, 4, 451-467.

Caron, N., and Ravalet, P. (2000). Estimation dans les enquêtes répétées : application à l’enquête emploi en continu. Technical report INSEE, Paris.

Chauvet, G., and Goga, C. (2018). Linearization versus bootstrap for variance estimation of the change between Gini indexes. Survey Methodology, 44, 1, 17-42. Paper available at https://www150.statcan.gc.ca/n1/pub/12-001-x/2018001/article/54926-eng.pdf.

Clarke, P., and Tate, P. (2002). An application of non-ignorable non-response models for gross flows estimation in the British labour force survey. Australian & New Zealand Journal of Statistics, 4, 413-425.

Deville, J.-C., and Särndal, C.-E. (1992). Calibration estimators in survey sampling. Journal of the American Statistical Association, 87, 376-382.

Ekholm, A., and Laaksonen, S. (1991). Weighting via response modeling in the finnish household budget survey. Journal of Official Statistics, 7, 325-327.

Fay, R. (1992). When are inferences from multiple imputation valid? Proceedings of the Survey Research Methods Section, American Statistical Association, 81, 1, 227-232.

Fuller, W., and An, A. (1998). Regression adjustment for non-response. Journal of the Indian Society of Agricultural Statistics, 51, 331-342.

Fuller, W.A., Loughin, M.M. and Baker, H.D. (1994). Regression weighting in the presence of nonresponse with application to the 1987-1988 Nationwide Food Consumption Survey. Survey Methodology, 20, 1, 75-85. Paper available at https://www150.statcan.gc.ca/n1/pub/12-001-x/1994001/article/14429-eng.pdf.

Goga, C., Deville, J.-C. and Ruiz-Gazen, A. (2009). Composite estimation and linearization method for two-sample survey data. Biometrika, 96, 691-709.

Hawkes, D., and Plewis, I. (2009). Modelling nonresponse in the national child development study. Journal of the Royal Statistical Society, Series A, 169, 479-491.

Juillard, H., Chauvet, G. and Ruiz-Gazen, A. (2017). Estimation under cross-classified sampling with application to a childhood survey. Journal of the American Statistical Association, 112, 850-858.

Kalton, G. (2009). Design for surveys over time. Handbook of Statistics, 29, 89-108.

Kim, J.K., and Kim, J.J. (2007). Nonresponse weighting adjustment using estimated response probability. Canadian Journal of Statistics, 35, 501-514.

Kim, J.K., Kwon, Y. and Park, M. (2016). Calibrated propensity score method for survey nonresponse in cluster sampling. Biometrika, 103, 461-473.

Laaksonen, S. (2007). Weighting for two-phase surveyed data. Survey Methodology, 33, 2, 121-130. Paper available at https://www150.statcan.gc.ca/n1/pub/12-001-x/2007002/article/10489-eng.pdf.

Laaksonen, S., and Chambers, R.L. (2006). Survey estimation under informative nonresponse with follow-up. Journal of Official Statistics, 22, 81-95.

Laniel, N. (1988). Variances for a rotating sample from a changing population. Proceedings of the Business and Economics Statistics Section, American Statistical Association, 246-250.

Laurie, H., Smith, R. and Scott, L. (1999). Strategies for reducing nonresponse in a longitudinal panel survey. Journal of Official Statistics, 15, 269-282.

Lynn, P. (2009). Methods for longitudinal surveys. Methodology of Longitudinal Surveys, 1-19.

Nordberg, L. (2000). On variance estimation for measures of change when samples are coordinated by the use of permanent random numbers. Journal of Official Statistics, 16, 363-378.

Pirus, C., Bois, C., Dufourg, M., Lanoë, J., Vandentorren, S., Leridon, H. and the Elfe team (2010). Constructing a cohort: Experience with the French Elfe project. Population, 65, 637-670.

Qualité, L., and Tillé, Y. (2008). Variance estimation of changes in repeated surveys and its application to the Swiss survey of value added. Survey Methodology, 34, 2, 173-181. Paper available at https://www150.statcan.gc.ca/n1/pub/12-001-x/2008002/article/10758-eng.pdf.

Rendtel, U., and Harms, T. (2009). Weighting and calibration for household panels. Methodology of Longitudinal Surveys, 265-286.

Rizzo, L., Kalton, G. and Brick, J.M. (1996). A comparison of some weighting adjustment methods for panel nonresponse. Survey Methodology, 22, 1, 43-53. Paper available at https://www150.statcan.gc.ca/n1/pub/12-001-x/1996001/article/14386-eng.pdf.

Silva, P., and Skinner, C. (1997). Cross-classiffed sampling: Some estimation theory. Variable Selection for Regression Estimation in Finite Populations, 23, 23-32.

Skinner, C. (2015). Cross-classiffed sampling: Some estimation theory. Statistics & Probability Letters, 104, 163-168.

Skinner, C., and D’Arrigo, J. (2011). Inverse probability weighting for clustered non-response. Biometrika, 98, 953-966.

Skinner, C., and Vieira, M. (2005). Design effects in the analysis of longitudinal survey data. S3RI Methdology Working Papers, M05/13. Southampton, UK: Southampton Statistical Sciences Research Institute.

Slud, E.V., and Bailey, L. (2010). Evaluation and selection of models for attrition nonresponse adjustment. Journal of Official Statistics, 26, 1-18.

Tam, S. (1984). On covariance from overlapping samples. The American Statistician, 38, 1-18.

Vandecasteele, L., and Debels, A. (2007). Attrition in panel data: The effectiveness of weighting. European Sociological Review, 23, 1, 81-97.

Zhou, M., and Kim, J. (2012). An effcient method of estimation for longitudinal surveys with monotone missing data. Biometrika, 99, 631-648.

ISSN : 1492-0921

Editorial policy

Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.

Submission of Manuscripts

Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).

Note of appreciation

Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.

Standards of service to the public

Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.

Copyright

Published by authority of the Minister responsible for Statistics Canada.

Use of this publication is governed by the Statistics Canada Open Licence Agreement.

Catalogue No. 12-001-X

Frequency: Semi-annual

Ottawa

Date modified:: 2018-12-20

Language selection

Search and menus

Search

Variance estimation under monotone non-response for a panel survey
Section 7. Conclusion

Acknowledgements

Appendix

Estimation of the variance due to non-response for Response Homogeneity Groups

References

Variance estimation under monotone non-response for a panel survey Section 7. Conclusion

Acknowledgements

Appendix

Estimation of the variance due to non-response for Response Homogeneity Groups

References

Editorial policy

Submission of Manuscripts

Note of appreciation

Standards of service to the public

Copyright

Variance estimation under monotone non-response for a panel survey
Section 7. Conclusion