Multilevel time series modelling of antenatal care coverage in Bangladesh at disaggregated administrative levels
Section 4. Time series multilevel modelling
In this study, direct estimates and their standard
errors are available for the survey years 1994, 1997, 2000, 2004, 2007, 2011
and 2014. To account for the varying time-lags of 3 or 4 years between the
subsequent survey years, the MTS models are defined at an annual frequency,
(i.e., values refer to a reference period of one year) at the most detailed
regional level of the 64 districts. With a time span of 21 years, there are 1,344
domain-year combinations. With seven available survey years, the model is
fitted to the 448 domain-year observations. The years between two subsequent
surveys are defined as missing in the model. In this way the period-to-period
evolution of the trend is specified correctly and the model provides
predictions for the missing domain-year combinations.
For convenience let us now denote by the input series for the time series models
for either ANC0 or ANC4 in year and domain This can be the untransformed direct
estimates, the square root transformed direct estimates or the model
predictions obtained with the cross-sectional FH models. Here domain index runs from 1 to and time index from 1 to We further combine these estimates into a
vector a vector of dimension
4.1 Model structure
The multilevel models considered take the general linear
additive form
where is a design matrix for a -vector of fixed effects and the are design matrices for -dimensional random effect vectors Here the sum over runs over several possible random effect terms
at different levels, such as local level and smooth trends at district and
division levels, white noise at the most detailed level of the domains, etc. This is explained in more detail
below. In formula (4.1) denotes, depending on the input series, the
sampling errors of the direct estimates or the prediction errors of the
cross-sectional FH model. The errors are taken to be normally distributed as where If the input series are the untransformed
direct estimates, then is the covariance matrix for the untransformed
direct estimates observed in year If the input series are transformed, then is the covariance matrix for the transformed
direct estimates, as described in Subsection 3.5. If the input series are
the predictions based on the cross-sectional FH models, then contains the estimated mean squared errors of
the FH predictions. Under MTS-II, is diagonal and ignores the correlations
between the domain predictions. Under MTS-III, is a full covariance matrix that also
accommodates the correlations between domain predictions.
Based on the distribution of the sampling errors in (4.1), the likelihood function conditional
on fixed and random effects parameters can be defined as
where is the linear predictor. For the errors a Student-t distribution instead of the normal
distribution can be considered to give smaller weight to more outlying
observations, following West (1984).
The fixed effect part of can contain components like an intercept, a
linear trend, main effects for division and district and possibly the
second-order interactions for linear trends and division or district. The
vector of fixed effects is assigned a normal prior with the identity matrix of dimension This is only very weakly informative as a
standard error of 10 is very large relative to the scales of the (transformed)
direct estimates and the covariates used.
The second term on the right hand side of (4.1) consists
of a sum of contributions to the linear predictor by random effects or varying
coefficient terms. The random effect vectors for different are assumed to be independent, but the
components within a vector are possibly correlated to accommodate
temporal or cross-sectional correlation. To describe the general model for each
vector of random effects, we suppress superscript in what follows for notational convenience.
Each random effects vector is assumed to be distributed as
where and are and covariance matrices, respectively, and denotes the Kronecker product of with The total length of is and these coefficients may be thought of as
corresponding to effects allowed to vary over levels of a factor variable. If, e.g., corresponds to division, then defines different random effects that correspond to
the 7 categories of division. If subsequently corresponds to time, then years. In that case each of the 7 effects can
vary over its 21 levels (years in this case). Each random effect generated for
a division year combination is shared by all districts
belonging to that division in that particular year.
The covariance matrix describes the covariance structure among the
levels of the factor variable, and is assumed to be known. Instead of
covariance matrices, precision matrices are actually used, because of computational
efficiency (Rue and Held, 2005).
The covariance matrix for the varying effects can be parameterized in one of
three different ways: (i) a full parameterized covariance matrix, (ii) a
diagonal matrix with unequal diagonal elements, and (iii) a diagonal matrix
with equal diagonal elements. The scaled-inverse Wishart prior is used as
proposed in O’Malley and Zaslavsky
(2008) and recommended by Gelman
and Hill (2007) when a full covariance matrix is assumed, while
half-Cauchy priors are used for the standard deviations when the covariance
matrix is assumed diagonal with equal or unequal elements. In case of diagonal
variances, half-Cauchy priors are better default priors than the more common
inverse gamma priors (Gelman, 2006).
The following random effect structures are considered in
the model selection procedure:
- Random
intercepts for the domains. In this case and is a scalar variance parameter. This implies and
- First
or second order random walks at different aggregation levels. A first order
random walk or local level trend at district level is defined as with and A second order
random walk or smooth trend model at district level is defined as with and Both kind of
trends can be defined similarly at the division or national level. See Rue and Held (2005) for the specification of the precision matrix for first and
second order random walks. A full covariance matrix for the trend innovations
can be considered to allow for cross-sectional besides temporal correlations,
or a diagonal matrix with different or equal variance parameters to allow for
temporal correlations only. In the case of equal variances, and First and second
order random walk components at district level are denoted below by and respectively. At
division level they are denoted by and
- The
first order random walks as used in our models cannot capture an overall level
as the corresponding random effects are constrained to sum to zero over time.
Similarly, the second order random walks cannot capture both level and linear
trend. This means that level and linear trend must be accommodated by other
model terms, as either fixed or random effects. District-level intercepts have
already been discussed under item 1. To also include linear trends by district,
this component can be extended to random intercepts and slopes linear in time.
In that case can be either a general covariance matrix
accounting for correlations between
intercepts and slopes, or a diagonal matrix with diagonal elements and the variances of the radom intercept and
slopes respectively. This model component is referred to as below.
- Spatial random effects: random intercepts
varying over the spatial location of districts following an intrinsic
conditional autoregressive (ICAR) model (Besag and Kooperberg, 1995),
defined as for each spatial
effect conditional on the others. Here is the set of
domains neighbouring domain and the number of
domains neighbouring domain . See Rue
and Held (2005) for the specification of the
precision matrix This spatial
component is referred to later as
- White
noise: to allow for random unexplained variation, white noise at the most
detailed domain-by-year level can be included. In this case and a scalar variance parameter. This implies
We also investigated generalisations of (4.3) to
non-normal distributions of random effects by implementing Student-t, horseshoe
prior (Carvalho, Polson and
Scott, 2010) and Laplace (Tibshirani, 1996; Park and Casella, 2008).
These alternative distributions have fatter tails allowing for occasional large
effects. However, these distributions did not improve results for the
considered target variables in terms of model information criteria as well as
the underlying trend predictions. Therefore the normal distribution is used for
all random effect components. The exact lay out of the final MTS models for
ANC0 and ANC4 are specified in Subsections 5.1 and 5.2 respectively.
4.2 Model estimation
The models are fitted using Markov Chain Monte Carlo
(MCMC) sampling, in particular the Gibbs sampler (Geman and Geman, 1984; Gelfand and Smith, 1990). See Boonstra and van den Brakel (2022)
for a specification of the full conditional distributions. The models specified
in Subsection 4.1 are run in R (R Core Team,
2015) using package mcmcsae
(Boonstra, 2021). The Gibbs
sampler is run in parallel for three independent chains with randomly generated
starting values. In the model building stage 1,000 iterations are used, in
addition to a “burn-in” period of 100 iterations. This was sufficient for
reasonably stable Monte Carlo estimates of the model parameters and trend
predictions. For the selected model we use a longer run of 1,000 burn-in plus 5,000
iterations of which the draws of every fifth iteration are stored. This leaves
draws to compute
estimates and standard errors. The convergence of the MCMC simulation is
assessed using trace and autocorrelation plots as well as the Gelman-Rubin
potential scale reduction factor (Gelman
and Rubin, 1992), which diagnoses the mixing of the chains. For the
longer simulation of the selected model all model parameters and model
predictions have potential scale reduction factors below 1.01 and sufficient
effective numbers of independent draws.
Many models of the form (4.1) have been fitted to the
data. For the comparison of models using the same input data we use the Widely
Applicable Information Criterion or Watanabe-Akaike Information Criterion
(WAIC) (Watanabe, 2010, 2013)
and the Deviance Information Criterion (DIC) (Spiegelhalter, Best, Carlin and van der Linde, 2002). We also compare the models
graphically by their model fits and trend predictions at three aggregation
levels.
ISSN : 1492-0921
Editorial policy
Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.
Submission of Manuscripts
Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).
Note of appreciation
Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.
Standards of service to the public
Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.
Copyright
Published by authority of the Minister responsible for Statistics Canada.
© His Majesty the King in Right of Canada as represented by the Minister of Industry, 2022
Use of this publication is governed by the Statistics Canada Open Licence Agreement.
Catalogue No. 12-001-X
Frequency: Semi-annual
Ottawa