Bayesian inference for a variance component model using pairwise composite likelihood with survey data
Section 3. Simulation studies
3.1 Simulation design
Using simulation studies we have evaluated the
performance of the proposed method, i.e., pairwise composite likelihood with a
curvature adjustment, and compared it with using the full likelihood and the
pairwise composite likelihood. We used the model in (1.1) to generate our data,
i.e., for and we simulated values of from
where and This is
equivalent to having applied the superpopulation generation and sampling
described in the paragraph surrounding (1.1).
Our first study, not included here, considered inference
about with known and It showed that using the pairwise composite
likelihood for inference about badly overstated the precision, and that the
curvature adjustment was successful. Thus, we proceeded to a more thorough
study, considering inference for both and To simplify we took and considered and For the half-Cauchy prior defined in (2.4) we
took There were 500 replicate data sets for each
setting.
We considered three scenarios: (1) and the half-Cauchy prior on (2) Signal to Noise Ratio, and the half-Cauchy prior on where and (3) and a uniform prior on Throughout, we took a uniform prior on
In Section 3.2 we describe the algorithms for the
simulation studies.
3.2 Algorithms
As in Sections 2.1 and 2.2 define with and Further, denotes the value of at the iteration where The full likelihood is
as in (2.7).
Using (3.2)
together with the prior, yields
the posterior density,
Sampling and is done
in three steps:
Step 1. Sample from where
We set the
starting value, to be
the maximum likelihood estimate of
Step 2. Use the Metropolis-Hastings (MH)
algorithm to sample from The latter is easily obtained from Given the candidate labelled is sampled from the jumping distribution, If Otherwise, the procedure is standard with
accept/reject ratio where and
Step 3. Repeat Steps 1 and 2 for times with the first 200 samples used as the
burn-in.
The pairwise
composite likelihood (PL) is
as in (2.8).
Using (3.3) together with the chosen prior, yields the posterior density
Sampling and is done in three steps:
Step 1. Sample from where
Step 2. Use the Metropolis-Hastings (MH)
algorithm to sample from as described in Step 2 above for the FL
(substituting PL for FL in all formulas).
Step 3. Repeat Steps 1 and 2 for times with the first 200 samples used as the
burn-in.
The final
part is to obtain the (curvature) adjusted pairwise composite likelihood (APL),
as described in Section 2.3. This derivation, based on the approach of
RCD, exploits the
estimated posterior means of and
Step 1. Given sample the candidate from the bivariate normal jumping
distribution, where If Otherwise, go to Step 2.
Step 2. Define as the log pairwise composite likelihood
obtained by taking the logarithm of (3.3), and as the log pairwise composite likelihood
corresponding to the data from cluster i.e.,
Step 3. Numerically obtain and
where and are the
estimated posterior means of and
Step 4. Based on the approach of RCD, and using
the singular value decomposition, we write and for some matrices and Then define In our case, is a matrix.
Step 5. From RCD the adjusted log pairwise
composite likelihood, is
where
Step 6. Define the adjusted pairwise posterior
density as
where the
latter defined in Step 5.
Using the candidate value, from Step 1 define the adjusted candidate
value Then the accept/reject ratio is
The remaining
steps are the standard ones for the Metropolis-Hastings algorithm.
3.3 Results from simulations
For each method (FL, PL, APL), each design parameter and each prior distribution we summarized the
simulation results using (a) the credible interval coverage rate in repeated
sampling, and (b) the averages of the 0.025, 0.25, 0.50, 0.75 and 0.975 points
of the posterior distributions of and
There are also graphical summaries, i.e., averaged
posterior density estimates for each of the posterior distributions, i.e., and First, consider an interval, say, that supports most of the mass (e.g., 95%) of
the posterior densities. Then divide it into equally spaced subintervals with the cut
points For let denote the estimate of the posterior density derived from the simulation, where stands for FL, PL or APL, and is the number of simulations. Next define, for
Then a curve connecting the points for is taken as the averaged posterior density
estimate for
Table 3.1 presents the coverage rates for and for and Figure 3.1 has the average posterior
density estimates for and for and In both Table 3.1 and Figure 3.1 the
summaries are given for the full likelihood (FL), pairwise composite likelihood
(CL), and adjusted pairwise composite likelihood (APL).
Table 3.1
Coverage rates (in percent) for the 95% credible intervals of
and
with
Table summary
This table displays the results of Coverage rates (in percent) for the 95% credible intervals of (équation) and (équation) with (équation) (équation)0.1, (équation)0.289, (équation)0.5, (équation)0.866 and (équation), calculated using (équation) units of measure (appearing as column headers).
|
0.1 |
0.289 |
0.5 |
0.866 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
97.40 |
95.80 |
94.84 |
94.60 |
94.80 |
94.40 |
94.80 |
95.00 |
|
|
68.20 |
66.60 |
58.45 |
58.40 |
53.60 |
51.40 |
50.00 |
50.20 |
|
|
92.40 |
93.00 |
92.96 |
93.60 |
92.20 |
92.20 |
91.60 |
93.00 |
|
|
|
94.80 |
95.00 |
95.00 |
94.00 |
94.80 |
94.20 |
95.00 |
93.80 |
|
|
43.80 |
42.80 |
35.40 |
31.80 |
30.40 |
29.60 |
27.40 |
26.40 |
|
|
90.60 |
91.80 |
92.20 |
93.40 |
92.80 |
92.60 |
91.80 |
93.00 |
|
|
|
97.20 |
99.00 |
91.55 |
95.40 |
93.00 |
94.80 |
92.60 |
95.00 |
|
|
92.80 |
85.60 |
59.62 |
61.80 |
52.40 |
54.20 |
46.20 |
48.20 |
|
|
88.40 |
83.40 |
86.85 |
92.20 |
84.40 |
91.20 |
82.00 |
89.60 |
| (
|
|
99.00 |
97.20 |
93.60 |
92.80 |
93.80 |
93.80 |
93.00 |
93.60 |
|
|
63.60 |
56.80 |
33.40 |
38.00 |
27.00 |
29.60 |
24.40 |
26.60 |
|
|
82.80 |
84.40 |
85.20 |
89.00 |
80.80 |
86.60 |
79.00 |
87.00 |
The following summary includes the results for only the
half-Cauchy prior with and the second and fourth values corresponding to SNR = 0.25
and SNR = 0.75, respectively. The results are similar for the three
choices of and for the uniform prior.
Without any adjustment the coverages of PL differ
substantially from the nominal 0.95. For example (Table 3.1), for and the coverage for is less than 0.30. Considering all
values of the design parameters, the largest coverage is 0.70. In most cases,
the coverage for is much less than 0.70.
With the curvature adjustment the coverage for is excellent. Of the 48 cases (three choices
of two choices of two choices of four choices of thirteen had coverage between 0.93 and 0.95,
twenty-two between 0.92 and 0.93, eleven between 0.91 and 0.92, and two below
0.91, with the latter for and and 15.
With the curvature adjustment the coverage for varies considerably, but there is, in almost
all cases, a very large improvement in coverage relative to using the
uncorrected pairwise composite likelihood.
The plots (Figure 3.1) show that for the posterior distribution corresponding to
the adjusted likelihood is very close to the posterior distribution using the
full likelihood. For there are differences between the posterior
distributions corresponding to the full and adjusted likelihoods, most notably
a shift to smaller values for the latter.
To investigate the effects of increasing and consider the difference where denotes coverage and FL and APL refer to the
corresponding posterior distributions.
Overall with all and for decreases as increases. For the larger values of decreases as increases, while for the smaller values of tends to increase as increases. Overall, for decreases as increases except in the case while increases as increases.
The reason for the deterioration of the adjustment as increases might be that the number of pairs
per cluster is and increases more rapidly, so that the
pairwise likelihood quickly becomes more concentrated around its mode; the
curvature adjustment may not suffice to compensate for a change in shape of the
log pairwise composite likelihood, e.g., an increase in kurtosis.
Table 3.2 presents one-sided non-coverage rates of
the 95% Credible Intervals for and with and We observe the following:
- For the
non-coverage for full likelihood intervals appears symmetric. The adjusted
pairwise likelihood has undercoverage for and
except when is 0.1
the non-coverage is symmetric. A dependence of the coverage on is seen
only in the case.
- For the full
likelihood interval has non-coverage that is close to nominal and not very
skewed, except in the case when where
there is marked over-coverage. For and coverage
improves as moves
from 20 to 40, but for and there is
little difference in coverage for the two values of
- For the
adjusted pairwise likelihood has asymmetric non-coverage. Except in the case of
the
magnitude of the non-coverage tends to be similar on the left to that of the
full likelihood, but much greater on the right, and the coverage improves as moves
from 20 to 40.

Description of Figure 3.1
Figure representing the estimated averaged posterior densities of
(graphs a and b) and
(graphs c and d) using three methods (full (black), composite (red) and adjusted composite (blue) likelihood methods) when the scale hyperparameter
the number of clusters
the cluster size
and
using a half-Cauchy prior for
The plots show that for
the posterior distribution corresponding to the adjusted likelihood is very close to the posterior distribution using the full likelihood. For
there are differences between the posterior distributions corresponding to the full and adjusted likelihoods, most notably a shift to smaller values for the latter.
Remembering that the adjusted log pairwise likelihood is
not explicitly being constructed to approximate the log full likelihood, it
does appear in Figure 3.1 that the adjusted log pairwise likelihood falls
more quickly in the tails.
We also tried centering the curvature adjustment at the
log pairwise posterior mode rather than the log pairwise posterior mean, and
found
Table 3.2
One-sided non-coverage rates (in percent) of the 95% Credible Intervals (CIs) of
and
with
Table summary
This table displays the results of One-sided non-coverage rates (in percent) of the 95% Credible Intervals (CIs) of
and
with
Non-CR-L, Non-CR-R,
0.1 and
0.289, calculated using
units of measure (appearing as column headers).
|
Non-CR-L |
Non-CR-R |
Non-CR-L |
Non-CR-R |
Non-CR-L |
Non-CR-R |
Non-CR-L |
Non-CR-R |
|
0.1 |
0.289 |
|
|
|
|
|
|
|
|
|
|
1.40 |
1.20 |
1.60 |
2.60 |
3.05 |
2.11 |
3.20 |
2.20 |
|
|
16.60 |
15.20 |
16.40 |
17.00 |
21.13 |
20.42 |
20.40 |
21.20 |
|
|
2.60 |
5.00 |
2.20 |
4.80 |
3.76 |
3.29 |
3.80 |
2.60 |
|
|
|
2.80 |
2.40 |
2.40 |
2.60 |
3.00 |
2.00 |
3.20 |
2.80 |
|
|
26.80 |
29.40 |
28.60 |
28.60 |
33.00 |
31.60 |
33.80 |
34.40 |
|
|
3.60 |
5.80 |
4.60 |
3.60 |
4.00 |
3.80 |
3.80 |
2.80 |
|
|
|
|
|
2.80 |
0.00 |
1.00 |
0.00 |
3.05 |
5.40 |
1.80 |
2.80 |
|
|
7.20 |
0.00 |
9.20 |
5.20 |
14.79 |
25.59 |
14.80 |
23.40 |
|
|
4.60 |
7.00 |
3.60 |
13.00 |
3.05 |
10.09 |
2.40 |
5.40 |
|
|
|
1.00 |
0.00 |
2.00 |
0.80 |
3.00 |
3.40 |
3.80 |
3.40 |
|
|
14.60 |
21.80 |
17.80 |
25.40 |
22.80 |
43.80 |
24.80 |
37.20 |
|
|
3.40 |
13.80 |
3.80 |
11.80 |
3.00 |
11.80 |
2.80 |
8.20 |
|
0.5
|
0.866
|
|
|
|
|
|
|
|
|
|
|
3.20 |
2.00 |
3.00 |
2.60 |
3.40 |
1.80 |
3.00 |
2.00 |
|
|
24.40 |
22.00 |
24.60 |
24.00 |
26.40 |
23.60 |
25.80 |
24.00 |
|
|
4.40 |
3.40 |
4.20 |
3.60 |
4.20 |
4.20 |
3.80 |
3.20 |
|
|
|
3.00 |
2.20 |
3.40 |
2.40 |
3.00 |
2.00 |
3.40 |
2.80 |
|
|
34.60 |
35.00 |
35.60 |
34.80 |
36.80 |
35.80 |
37.60 |
36.00 |
|
|
4.00 |
3.20 |
4.20 |
3.20 |
4.80 |
3.40 |
3.40 |
3.60 |
|
|
|
|
|
3.00 |
4.00 |
2.20 |
3.00 |
3.40 |
4.00 |
2.00 |
3.00 |
|
|
16.00 |
31.60 |
18.20 |
27.60 |
19.20 |
34.60 |
21.00 |
30.80 |
|
|
1.20 |
14.40 |
1.80 |
7.00 |
1.40 |
16.60 |
2.20 |
8.20 |
|
|
|
3.20 |
3.00 |
2.80 |
3.40 |
3.80 |
3.20 |
3.20 |
3.20 |
|
|
24.00 |
49.00 |
28.00 |
42.40 |
25.40 |
50.20 |
29.80 |
43.60 |
|
|
2.60 |
16.60 |
2.40 |
11.00 |
3.20 |
17.80 |
2.00 |
11.00 |
that the under-coverage increased, though the asymmetry of coverage was
less severe, for the resulting credible intervals.