Variance estimation under monotone non-response for a panel survey
Section 4. Longitudinal estimators

We may be interested in a change in parameters, such as

$Δ (u \to t) = Y (t) - Y (u), (4.1)$

the difference between the totals of a variable of interest measured at two different times $u < t .$ Since the variable $y_{i u}$ is measured on all sub-samples $s_{u^{'}}$ for $u^{'} = u, \dots, t,$ there are several possible estimators for $Δ (u \to t) .$ For $u^{'} = u, \dots, t,$ we denote by

${\hat{Δ}}_{u^{'} t} (u \to t) = \sum_{i \in s_{t}} \frac{y_{i t}}{π_{i} {\hat{p}}_{i}^{1 \to t}} - \sum_{i \in s_{u^{'}}} \frac{y_{i u}}{π_{i} {\hat{p}}_{i}^{1 \to u^{'}}} (4.2)$

the estimator which makes use of $s_{t}$ for the estimation of $Y (t),$ and of $s_{u^{'}}$ for the estimation of $Y (u) .$ The case $u^{'} = u$ corresponds to the estimation of $Y (u)$ on the largest available sub-sample, $s_{u} .$ The case $u^{'} = t$ corresponds to the estimation of $Y (u)$ and $Y (t)$ on the common sub-sample $s_{t} .$

In the context of full response, several authors have recommended the estimator ${\hat{Δ}}_{t t} (u \to t)$ which makes use of the common sample only, if the variables $y_{u i}$ and $y_{t i}$ are strongly positively correlated; see Caron and Ravalet (2000), Qualité and Tillé (2008), Goga, Deville and Ruiz-Gazen (2009), Chauvet and Goga (2018). In our context, this choice may be heuristically justified as follows. For $u^{'} < t,$ and by conditioning on the sub-sample $s_{u^{'}} ,$ we obtain

$V {{\hat{Δ}}_{u^{'} t} (u \to t)} ≃ V {\sum_{i \in s_{u^{'}}} \frac{y_{i t} - y_{i u}}{π_{i} {\hat{p}}_{i}^{1 \to u^{'}}}} + E V {\sum_{i \in s_{t}} \frac{y_{i t}}{π_{i} {\hat{p}}_{i}^{1 \to t}} | s_{u^{'}}}, (4.3)$

$V {{\hat{Δ}}_{t t} (u \to t)} ≃ V {\sum_{i \in s_{u^{'}}} \frac{y_{i t} - y_{i u}}{π_{i} {\hat{p}}_{i}^{1 \to u^{'}}}} + E V {\sum_{i \in s_{t}} \frac{y_{i t} - y_{i u}}{π_{i} {\hat{p}}_{i}^{1 \to t}} | s_{u^{'}}} . (4.4)$

In equations (4.3) and (4.4), the first term in the right-hand side is identical. Since the variables $y_{i u}$ and $y_{i t}$ are expected to be positively correlated, the difference $y_{i t} - y_{i u}$ is expected to be smaller than $y_{i t} .$ Therefore, the estimator ${\hat{Δ}}_{t t} (u \to t)$ based on the common sample is expected to be more efficient in terms of variance. The results of a small simulation study in Section 5.2 support this heuristic reasoning. Therefore, we focus only in this Section on the estimator ${\hat{Δ}}_{t t} (u \to t)$ for the estimation of $Δ (u \to t) .$ As pointed out by a Referee, and following the approach in Zhou and Kim (2012), we may obtain a gain in efficiency by using the full information on $s_{u},$ namely by calibrating the weights ${(π_{i} {\hat{p}}_{i}^{1 \to t})}^{- 1}$ on the estimator ${\hat{Y}}_{u} .$

Replacing in (2.11) the variable $y_{i t}$ with $y_{i t} - y_{i u}$ yields the estimator of the variance due to the sampling design

${\hat{V}}_{t}^{p} {{\hat{Δ}}_{t t} (u \to t)} = \sum_{i, j \in s_{t}} \frac{Δ_{i j}}{π_{i j}} \frac{1}{{\hat{p}}_{i j}^{1 \to t}} \frac{(y_{i t} - y_{i u})}{π_{i}} \frac{(y_{j t} - y_{j u})}{π_{j}} . (4.5)$

Similarly, replacing in (2.12) the variable $y_{i t}$ with $y_{i t} - y_{i u}$ yields the estimator of the variance due to the non-response

${\hat{V}}_{t}^{nr} {{\hat{Δ}}_{t t} (u \to t)} = \sum_{δ =1}^{t} \sum_{i \in s_{t}} \frac{{\hat{p}}_{i}^{δ} (1 - {\hat{p}}_{i}^{δ})}{{\hat{p}}_{i}^{δ \to t}} {(\frac{y_{i t} - y_{i u}}{π_{i} {\hat{p}}_{i}^{1 \to δ}} - k_{i}^{δ} {({\hat{h}}_{i}^{δ})}^{⊤} {\hat{γ}}_{t Δ}^{δ})}^{2} (4.6)$

with

${\hat{γ}}_{t Δ}^{δ} = {\sum_{i \in s_{t}} k_{i}^{δ} \frac{{\hat{p}}_{i}^{δ} (1 - {\hat{p}}_{i}^{δ})}{{\hat{p}}_{i}^{δ \to t}} {\hat{h}}_{i}^{δ} {({\hat{h}}_{i}^{δ})}^{⊤}}^{- 1} \sum_{i \in s_{t}} \frac{1 - {\hat{p}}_{i}^{δ}}{{\hat{p}}_{i}^{1 \to t}} {\hat{h}}_{i}^{δ} \frac{y_{i t} - y_{i u}}{π_{i}} . (4.7)$

The global variance estimator for ${\hat{Δ}}_{t t} (u \to t)$ is

${\hat{V}}_{t} {{\hat{Δ}}_{t t} (u \to t)} = {\hat{V}}_{t}^{p} {{\hat{Δ}}_{t t} (u \to t)} + {\hat{V}}_{t}^{nr} {{\hat{Δ}}_{t t} (u \to t)} . (4.8)$

Variance estimation for measures of change is also considered in Berger (2004), Qualité and Tillé (2008), Goga et al. (2009), Chauvet and Goga (2018), among others.

The simplified estimator of the variance due to non-response is

${\hat{V}}_{t, simp}^{nr} {{\hat{Δ}}_{t t} (u \to t)} = \sum_{i \in s_{t}} \frac{1 - {\hat{p}}_{i}^{1 \to t}}{{({\hat{p}}_{i}^{1 \to t})}^{2}} {(\frac{y_{i t} - y_{i u}}{π_{i}})}^{2} . (4.9)$

If the variables $y_{i t}$ and $y_{i u}$ are strongly positively correlated, the bias of the simplified variance estimator is expected to be small.

ISSN : 1492-0921

Editorial policy

Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.

Submission of Manuscripts

Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).

Note of appreciation

Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.

Standards of service to the public

Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.

Copyright

Published by authority of the Minister responsible for Statistics Canada.

Use of this publication is governed by the Statistics Canada Open Licence Agreement.

Catalogue No. 12-001-X

Frequency: Semi-annual

Ottawa

Date modified:: 2018-12-20

Language selection

Search and menus

Search

Variance estimation under monotone non-response for a panel survey
Section 4. Longitudinal estimators

Variance estimation under monotone non-response for a panel survey Section 4. Longitudinal estimators

Editorial policy

Submission of Manuscripts

Note of appreciation

Standards of service to the public

Copyright

Variance estimation under monotone non-response for a panel survey
Section 4. Longitudinal estimators