Bayesian inference for a variance component model using pairwise composite likelihood with survey data
Section 3. Simulation studies

3.1 Simulation design

Using simulation studies we have evaluated the performance of the proposed method, i.e., pairwise composite likelihood with a curvature adjustment, and compared it with using the full likelihood and the pairwise composite likelihood. We used the model in (1.1) to generate our data, i.e., for $i = 1, \dots, n$ and $j = 1, \dots, m$ we simulated values of $Y_{i j}$ from

$Y_{i j} = θ + u_{i} + e_{i j}, (3.1)$

where $θ = 1,$ $u_{i} \overset{iid}{~} N (0, σ_{u}^{2}),$ and $e_{i j} \overset{iid}{~} N (0, σ_{e}^{2}) .$ This is equivalent to having applied the superpopulation generation and sampling described in the paragraph surrounding (1.1).

Our first study, not included here, considered inference about $θ$ with known $σ_{u}$ and $σ_{e} .$ It showed that using the pairwise composite likelihood for inference about $θ$ badly overstated the precision, and that the curvature adjustment was successful. Thus, we proceeded to a more thorough study, considering inference for both $θ$ and $σ_{u} .$ To simplify we took $σ_{e} = 0 .5,$ and considered $n \in {20, 40}$ and $m \in {5, 10} .$ For the half-Cauchy prior defined in (2.4) we took $A \in {5, 10, 15} .$ There were 500 replicate data sets for each setting.

We considered three scenarios: (1) $σ_{u} \in {0 .1, 0 .5}$ and the half-Cauchy prior on $σ_{u};$ (2) Signal to Noise Ratio, $SNR \in {0 .25, 0 .75}$ and the half-Cauchy prior on $σ_{u},$ where $SNR = σ_{u}^{2} / (σ_{u}^{2} + σ_{e}^{2});$ and (3) $σ_{u} \in {0 .1, 0 .5}$ and a uniform prior on $σ_{u} .$ Throughout, we took a uniform prior on $θ .$

In Section 3.2 we describe the algorithms for the simulation studies.

3.2 Algorithms

As in Sections 2.1 and 2.2 define $y (n) = {y_{1}, \dots, y_{n}}$ with $y_{i} = {(y_{i 1}, \dots, y_{i m})}^{T}$ and $\bar{y} = \sum_{i =1}^{n} \sum_{j =1}^{m} y_{i j} / (m n) .$ Further, $η^{(t)}$ denotes the value of $η$ at the $t^{th}$ iteration where $η = {(θ, σ_{u})}^{T} .$ The full likelihood is

$L_{FL} (θ, σ_{u} | y (n)) \propto {| Σ_{m} |}^{- n / 2} exp [- \frac{1}{2} tr (Σ_{m}^{- 1} S_{0})], (3.2)$

as in (2.7).

Using (3.2) together with the prior, $π (η),$ yields the posterior density,

$p_{FL} (η | y (n)) \propto L_{FL} (η | y (n)) π (η) .$

Sampling $θ$ and $σ_{u}$ is done in three steps:

Step 1. Sample $θ^{(t)}$ from $p_{FL} (θ | y (n), σ_{u}^{(t - 1)})$ where

$θ | (y (n), σ_{u}) ~ N (\bar{y}, \frac{σ_{e}^{2} + m σ_{u}^{2}}{m n}) .$

We set the starting value, $σ_{u}^{(0)},$ to be the maximum likelihood estimate of $σ_{u} .$

Step 2. Use the Metropolis-Hastings (MH) algorithm to sample $σ_{u}^{(t)}$ from $p_{FL} (σ_{u} | y (n), θ^{(t)}) .$ The latter is easily obtained from $p_{FL} (η | y (n)) .$ Given $s > 0,$ the candidate $σ_{u},$ labelled $σ_{u}^{*},$ is sampled from the jumping distribution, $N (σ_{u}^{(t - 1)}, s^{2}) .$ If $σ_{u}^{*} < 0, σ_{u}^{(t)} = σ_{u}^{(t - 1)} .$ Otherwise, the procedure is standard with accept/reject ratio $p_{FL} (η_{FL}^{*} | y (n)) / p_{FL} (η_{FL}^{(t - 1)} | y (n))$ where $η_{FL}^{*} = {(θ^{(t)}, σ_{u}^{*})}^{T}$ and $η_{FL}^{(t - 1)} = {(θ^{(t)}, σ_{u}^{(t - 1)})}^{T} .$

Step 3. Repeat Steps 1 and 2 for $K = 1,000$ times with the first 200 samples used as the burn-in.

The pairwise composite likelihood (PL) is

$L_{PL} (θ, σ_{u} | y (n)) \propto {| Σ_{2} |}^{n m (m - 1) / 4} exp [- \frac{1}{2} tr (Σ_{2}^{- 1} S_{0 PL})], (3.3)$

as in (2.8).

Using (3.3) together with the chosen prior, $π (η),$ yields the posterior density $p_{PL} (η | y (n)) .$

Sampling $θ$ and $σ_{u}$ is done in three steps:

Step 1. Sample $θ^{(t)}$ from $p_{PL} (θ | y (n), σ_{u}^{(t - 1)})$ where

$θ | (y (n), σ_{u}) ~ N (\bar{y}, \frac{σ_{e}^{2} + 2 σ_{u}^{2}}{n m (m - 1)}) .$

Step 2. Use the Metropolis-Hastings (MH) algorithm to sample $σ_{u}^{(t)}$ from $p_{PL} (σ_{u} | y (n), θ^{(t)}),$ as described in Step 2 above for the FL (substituting PL for FL in all formulas).

Step 3. Repeat Steps 1 and 2 for $K = 1,000$ times with the first 200 samples used as the burn-in.

The final part is to obtain the (curvature) adjusted pairwise composite likelihood (APL), as described in Section 2.3. This derivation, based on the approach of RCD, exploits ${\hat{η}}_{A PL},$ the estimated posterior means of $θ$ and $σ_{u} .$

Step 1. Given $(s_{θ}, s_{σ})$ sample the candidate $η^{*} = {(θ^{*}, σ_{u}^{*})}^{T}$ from the bivariate normal jumping distribution, $N_{2} (η^{(t - 1)}, Σ)$ where $Σ = diag (s_{θ}^{2}, s_{σ}^{2}) .$ If $σ_{u}^{*} <0, η^{(t)} = η^{(t - 1)} .$ Otherwise, go to Step 2.

Step 2. Define $l_{PL} (y (n) | θ, σ_{u})$ as the log pairwise composite likelihood obtained by taking the logarithm of (3.3), and $l_{PL} (y_{i} | θ, σ_{u})$ as the log pairwise composite likelihood corresponding to the data from cluster $i,$ i.e., $y_{i} .$

Step 3. Numerically obtain $\hat{H} = \nabla^{2} l_{PL} (y (n) | {\hat{θ}}_{PL}, {\hat{σ}}_{u PL})$ and

$\hat{J} = \sum_{i =1}^{n} [\nabla l_{PL} (y_{i} | {\hat{θ}}_{PL}, {\hat{σ}}_{u PL}) {\nabla l (y_{i} | {\hat{θ}}_{PL}, {\hat{σ}}_{u PL})}^{T}],$

where ${\hat{θ}}_{PL}$ and ${\hat{σ}}_{u PL}$ are the estimated posterior means of $θ$ and $σ_{u} .$

Step 4. Based on the approach of RCD, and using the singular value decomposition, we write $\hat{H} = M^{T} M$ and $\hat{H} {\hat{J}}^{- 1} \hat{H} = M_{A}^{T} M_{A}$ for some matrices $M$ and $M_{A} .$ Then define $C = M^{- 1} M_{A} .$ In our case, $C$ is a $2 \times 2$ matrix.

Step 5. From RCD the adjusted log pairwise composite likelihood, $l_{APL},$ is

$l_{APL} (y (n) | η) = l_{PL} (y (n) | η^{*})$

where

$η^{*} = {\hat{η}}_{PL} + C (η - {\hat{η}}_{PL}) .$

Step 6. Define the adjusted pairwise posterior density as

$p_{APL} (η | y (n)) \propto L_{APL} (y (n) | η) π (θ, σ_{u})$

where $L_{APL} (y (n) | η) = exp (l_{APL} (y (n) | η)),$ the latter defined in Step 5.

Using the candidate value, $η^{*},$ from Step 1 define the adjusted candidate value $η_{c}^{*} = {\hat{η}}_{PL} + C (η^{*} - {\hat{η}}_{PL}) .$ Then the accept/reject ratio is

$p_{APL} (η_{c}^{*} | y (n)) / p_{APL} (η^{(t)} | y (n)) .$

The remaining steps are the standard ones for the Metropolis-Hastings algorithm.

3.3 Results from simulations

For each method (FL, PL, APL), each design parameter $(m, n)$ and each prior distribution we summarized the simulation results using (a) the credible interval coverage rate in repeated sampling, and (b) the averages of the 0.025, 0.25, 0.50, 0.75 and 0.975 points of the posterior distributions of $θ$ and $σ_{u} .$

There are also graphical summaries, i.e., averaged posterior density estimates for each of the posterior distributions, i.e., $p_{FL} (η | y (n)), p_{PL} (η | y (n))$ and $p_{APL} (η | y (n)) .$ First, consider an interval, say, $[a, b],$ that supports most of the mass (e.g., 95%) of the posterior densities. Then divide it into $M = 50$ equally spaced subintervals with the cut points $a = c_{0} < c_{1} < \dots < c_{M - 1} < c_{M} = b .$ For $t = 1, \dots, T,$ let ${\hat{f}}_{P}^{(t)} (.)$ denote the estimate of the posterior density $f_{P} (.),$ derived from the $t^{th}$ simulation, where $P$ stands for FL, PL or APL, and $T$ is the number of simulations. Next define, for $r = 1, \dots, M,$

${\hat{f}}_{P} (c_{r}) = \frac{1}{T} \sum_{t = 1}^{T} {\hat{f}}_{P}^{(t)} (c_{r}) .$

Then a curve connecting the points ${c_{r}, {\hat{f}}_{P} (c_{r})}$ for $a = c_{0} < c_{1} < \dots < c_{M} = b,$ is taken as the averaged posterior density estimate for $f_{P} (.) .$

Table 3.1 presents the coverage rates for $θ$ and $σ_{u}$ for $A = 15,$ $n \in {20, 40},$ $m \in {5, 10},$ and $σ_{u} \in {0 .1, 0 .289, 0 .5, 0 .866} .$ Figure 3.1 has the average posterior density estimates for $θ$ and $σ_{u}$ for $A = 15, σ_{u} \in {0 .1, 0 .5}, n = 40,$ and $m = 10.$ In both Table 3.1 and Figure 3.1 the summaries are given for the full likelihood (FL), pairwise composite likelihood (CL), and adjusted pairwise composite likelihood (APL).

Table 3.1
Coverage rates (in percent) for the 95% credible intervals of $θ$ and $σ_{u}$ with $A =15$
Table summary
This table displays the results of Coverage rates (in percent) for the 95% credible intervals of (équation) and (équation) with (équation) (équation)0.1, (équation)0.289, (équation)0.5, (équation)0.866 and (équation), calculated using (équation) units of measure (appearing as column headers).
		$n =20$	$n =40$	$n =20$	$n =40$	$n =20$	$n =40$	$n =20$	$n =40$
		$σ_{u} =$ 0.1		$σ_{u} =$ 0.289		$σ_{u} =$ 0.5		$σ_{u} =$ 0.866
		$θ$
$m =5$	${\hat{θ}}_{FL}$	97.40	95.80	94.84	94.60	94.80	94.40	94.80	95.00
	${\hat{θ}}_{PL}$	68.20	66.60	58.45	58.40	53.60	51.40	50.00	50.20
	${\hat{θ}}_{APL}$	92.40	93.00	92.96	93.60	92.20	92.20	91.60	93.00
$m =10$	${\hat{θ}}_{FL}$	94.80	95.00	95.00	94.00	94.80	94.20	95.00	93.80
	${\hat{θ}}_{PL}$	43.80	42.80	35.40	31.80	30.40	29.60	27.40	26.40
	${\hat{θ}}_{APL}$	90.60	91.80	92.20	93.40	92.80	92.60	91.80	93.00
$m =5$	${\hat{σ}}_{u, FL}$	97.20	99.00	91.55	95.40	93.00	94.80	92.60	95.00
	${\hat{σ}}_{u, PL}$	92.80	85.60	59.62	61.80	52.40	54.20	46.20	48.20
	${\hat{σ}}_{u, APL}$	88.40	83.40	86.85	92.20	84.40	91.20	82.00	89.60
( $m =10$	${\hat{σ}}_{u, FL}$	99.00	97.20	93.60	92.80	93.80	93.80	93.00	93.60
	${\hat{σ}}_{u, PL}$	63.60	56.80	33.40	38.00	27.00	29.60	24.40	26.60
	${\hat{σ}}_{u, APL}$	82.80	84.40	85.20	89.00	80.80	86.60	79.00	87.00

The following summary includes the results for only the half-Cauchy prior with $A \in {5, 10, 15},$ $m \in {5, 10},$ $n \in {20, 40},$ and $σ_{u} \in {0 .1, 0 .289, 0 .5, 0 .866},$ the second and fourth values corresponding to SNR = 0.25 and SNR = 0.75, respectively. The results are similar for the three choices of $A,$ and for the uniform prior.

Without any adjustment the coverages of PL differ substantially from the nominal 0.95. For example (Table 3.1), for $A = 15,$ $n = 40,$ $m = 10,$ and $σ_{u} = 0 .5,$ the coverage for $θ$ is less than 0.30. Considering all values of the design parameters, the largest coverage is 0.70. In most cases, the coverage for $θ$ is much less than 0.70.

With the curvature adjustment the coverage for $θ$ is excellent. Of the 48 cases (three choices of $A,$ two choices of $m,$ two choices of $n,$ four choices of $σ_{u}),$ thirteen had coverage between 0.93 and 0.95, twenty-two between 0.92 and 0.93, eleven between 0.91 and 0.92, and two below 0.91, with the latter for $σ_{u} = 0 .1,$ $n = 20,$ $m = 10,$ and $A = 5$ and 15.

With the curvature adjustment the coverage for $σ_{u}$ varies considerably, but there is, in almost all cases, a very large improvement in coverage relative to using the uncorrected pairwise composite likelihood.

The plots (Figure 3.1) show that for $θ$ the posterior distribution corresponding to the adjusted likelihood is very close to the posterior distribution using the full likelihood. For $σ_{u}$ there are differences between the posterior distributions corresponding to the full and adjusted likelihoods, most notably a shift to smaller values for the latter.

To investigate the effects of increasing $m$ and $n,$ consider the difference $δ = C_{FL} - C_{APL}$ where $C$ denotes coverage and FL and APL refer to the corresponding posterior distributions.

Overall with all $m, n, A,$ and $σ_{u},$ for $θ,$ $δ$ decreases as $n$ increases. For the larger values of $σ_{u},$ $δ$ decreases as $m$ increases, while for the smaller values of $σ_{u},$ $δ$ tends to increase as $m$ increases. Overall, for $σ_{u},$ $δ$ decreases as $n$ increases except in the case $σ_{u} = 0 .1,$ while $δ$ increases as $m$ increases.

The reason for the deterioration of the adjustment as $m$ increases might be that the number of pairs per cluster is $m (m - 1) / 2$ and increases more rapidly, so that the pairwise likelihood quickly becomes more concentrated around its mode; the curvature adjustment may not suffice to compensate for a change in shape of the log pairwise composite likelihood, e.g., an increase in kurtosis.

Table 3.2 presents one-sided non-coverage rates of the 95% Credible Intervals for $θ$ and $σ_{u}$ with $A = 15,$ $n \in {20, 40},$ $m \in {5, 10},$ and $σ_{u} \in {0 .1, 0 .289, 0 .5, 0 .866} .$ We observe the following:

For $θ,$ the non-coverage for full likelihood intervals appears symmetric. The adjusted pairwise likelihood has undercoverage for $θ,$ and except when $σ_{u}$ is 0.1 the non-coverage is symmetric. A dependence of the coverage on $m$ is seen only in the $σ_{u} = 0 .1$ case.
For $σ_{u},$ the full likelihood interval has non-coverage that is close to nominal and not very skewed, except in the case when $σ_{u} = 0 .1,$ where there is marked over-coverage. For $σ_{u} > 0 .1$ and $m = 5,$ coverage improves as $n$ moves from 20 to 40, but for $σ_{u} > 0 .1$ and $m = 10,$ there is little difference in coverage for the two values of $n .$
For $σ_{u},$ the adjusted pairwise likelihood has asymmetric non-coverage. Except in the case of $σ_{u} = 0 .1,$ the magnitude of the non-coverage tends to be similar on the left to that of the full likelihood, but much greater on the right, and the coverage improves as $n$ moves from 20 to 40.

Description of Figure 3.1

Figure representing the estimated averaged posterior densities of $θ$ (graphs a and b) and $σ_{μ}$ (graphs c and d) using three methods (full (black), composite (red) and adjusted composite (blue) likelihood methods) when the scale hyperparameter $A = 15,$ the number of clusters $n = 40,$ the cluster size $m = 10,$ and $σ_{μ} = (0 .1, 0 .5)$ using a half-Cauchy prior for $σ_{μ} .$ The plots show that for $θ$ the posterior distribution corresponding to the adjusted likelihood is very close to the posterior distribution using the full likelihood. For $σ_{μ},$ there are differences between the posterior distributions corresponding to the full and adjusted likelihoods, most notably a shift to smaller values for the latter.

Remembering that the adjusted log pairwise likelihood is not explicitly being constructed to approximate the log full likelihood, it does appear in Figure 3.1 that the adjusted log pairwise likelihood falls more quickly in the tails.

We also tried centering the curvature adjustment at the log pairwise posterior mode rather than the log pairwise posterior mean, and found

Table 3.2
One-sided non-coverage rates (in percent) of the 95% Credible Intervals (CIs) of $θ$ and $σ_{u}$ with $A =15$
Table summary
This table displays the results of One-sided non-coverage rates (in percent) of the 95% Credible Intervals (CIs) of $θ$ and $σ_{u}$ with $A =15$ Non-CR-L, Non-CR-R, $σ_{u} =$ 0.1 and $σ_{u} =$ 0.289, calculated using $θ$ units of measure (appearing as column headers).
		$σ_{u} =$ 0.1				$σ_{u} =$ 0.289
		Non-CR-L	Non-CR-R	Non-CR-L	Non-CR-R	Non-CR-L	Non-CR-R	Non-CR-L	Non-CR-R
		$n =20$		$n =40$		$n =20$		$n =40$
		$θ$
$m =5$	${\hat{θ}}_{FL}$	1.40	1.20	1.60	2.60	3.05	2.11	3.20	2.20
	${\hat{θ}}_{PL}$	16.60	15.20	16.40	17.00	21.13	20.42	20.40	21.20
	${\hat{θ}}_{APL}$	2.60	5.00	2.20	4.80	3.76	3.29	3.80	2.60
$m =10$	${\hat{θ}}_{FL}$	2.80	2.40	2.40	2.60	3.00	2.00	3.20	2.80
	${\hat{θ}}_{PL}$	26.80	29.40	28.60	28.60	33.00	31.60	33.80	34.40
	${\hat{θ}}_{APL}$	3.60	5.80	4.60	3.60	4.00	3.80	3.80	2.80
		$σ_{u}$
$m =5$	${\hat{σ}}_{u, FL}$	2.80	0.00	1.00	0.00	3.05	5.40	1.80	2.80
	${\hat{σ}}_{u, PL}$	7.20	0.00	9.20	5.20	14.79	25.59	14.80	23.40
	${\hat{σ}}_{u, APL}$	4.60	7.00	3.60	13.00	3.05	10.09	2.40	5.40
$m =10$	${\hat{σ}}_{u, FL}$	1.00	0.00	2.00	0.80	3.00	3.40	3.80	3.40
	${\hat{σ}}_{u, PL}$	14.60	21.80	17.80	25.40	22.80	43.80	24.80	37.20
	${\hat{σ}}_{u, APL}$	3.40	13.80	3.80	11.80	3.00	11.80	2.80	8.20
		$σ_{u} =$ 0.5				$σ_{u} =$ 0.866
		$n =20$		$n =40$		$n =20$		$n =40$
		$θ$
$m =5$	${\hat{θ}}_{FL}$	3.20	2.00	3.00	2.60	3.40	1.80	3.00	2.00
	${\hat{θ}}_{PL}$	24.40	22.00	24.60	24.00	26.40	23.60	25.80	24.00
	${\hat{θ}}_{APL}$	4.40	3.40	4.20	3.60	4.20	4.20	3.80	3.20
$m =10$	${\hat{θ}}_{FL}$	3.00	2.20	3.40	2.40	3.00	2.00	3.40	2.80
	${\hat{θ}}_{PL}$	34.60	35.00	35.60	34.80	36.80	35.80	37.60	36.00
	${\hat{θ}}_{APL}$	4.00	3.20	4.20	3.20	4.80	3.40	3.40	3.60
		$σ_{u}$
$m =5$	${\hat{σ}}_{u, FL}$	3.00	4.00	2.20	3.00	3.40	4.00	2.00	3.00
	${\hat{σ}}_{u, PL}$	16.00	31.60	18.20	27.60	19.20	34.60	21.00	30.80
	${\hat{σ}}_{u, APL}$	1.20	14.40	1.80	7.00	1.40	16.60	2.20	8.20
$m =10$	${\hat{σ}}_{u, FL}$	3.20	3.00	2.80	3.40	3.80	3.20	3.20	3.20
	${\hat{σ}}_{u, PL}$	24.00	49.00	28.00	42.40	25.40	50.20	29.80	43.60
	${\hat{σ}}_{u, APL}$	2.60	16.60	2.40	11.00	3.20	17.80	2.00	11.00
Note: Non-CR-L represents the left-side non-coverage rates (in percent) for the 95% CIs of $θ$ and $σ_{u}$ ; Non-CR-R represents the right-side non-coverage rates (in percent) for the 95% CIs of $θ$ and $σ_{u}$ .

that the under-coverage increased, though the asymmetry of coverage was less severe, for the resulting credible intervals.

ISSN : 1492-0921

Editorial policy

Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.

Submission of Manuscripts

Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).

Note of appreciation

Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.

Standards of service to the public

Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.

Copyright

Published by authority of the Minister responsible for Statistics Canada.

Use of this publication is governed by the Statistics Canada Open Licence Agreement.

Catalogue No. 12-001-X

Frequency: Semi-annual

Ottawa

Date modified:: 2022-06-21

Language selection

Search and menus

Search

Bayesian inference for a variance component model using pairwise composite likelihood with survey data
Section 3. Simulation studies

3.1 Simulation design

3.2 Algorithms

3.3 Results from simulations

Bayesian inference for a variance component model using pairwise composite likelihood with survey data Section 3. Simulation studies

3.1 Simulation design

3.2 Algorithms

3.3 Results from simulations

Editorial policy

Submission of Manuscripts

Note of appreciation

Standards of service to the public

Copyright

Bayesian inference for a variance component model using pairwise composite likelihood with survey data
Section 3. Simulation studies