Bayesian small area demography
Section 3. Interpolating and forecasting obesity prevalence
3.1 The estimation problem
In
New Zealand, as in most countries, obesity rates are rising. Public health
researchers and policy makers monitor and forecast obesity prevalence, to
assess the success, or otherwise, of obesity-reduction measures, and to gauge
future demand for services.
The
main source of data on obesity prevalence in New Zealand is the New Zealand
Health Survey, a nationally-representative survey of around 19,000 people
(Ministry of Health, 2013). Like most household surveys, it has a complex
design, with stratification and clustering. Obesity is measured using body mass
index (BMI). A person is defined as being obese if he or she has a BMI of 30 or
higher.
Surveys
were carried out in 1997, 2003, 2007, 2012, and 2013. We use data for all these
years. Our objective is to obtain prevalence estimates for the period 1997-2013,
including non-survey years, and then forecast for the period 2014-2023. Our
estimates and forecasts are disaggregated into age groups 15-24, 25-34, 35-44,
45-54, 55-64, 65-74, and 75+.
3.2 The model
Our
main input data are published estimates for the proportion of New Zealanders
aged
at time
who are obese, which we denote
and the published standard errors for the
denoted
The
are graphed in Figure 3.1.

Description for Figure 3.1
Figure presenting the proportion of obesity in New Zealand, by age and year, as estimated in the New Zealand Health Survey. There are five linear graphs, one by year for 1997, 2003, 2007, 2012 and 2013. The percent is on the y-axis, ranging from 10% to 35%. The age is on the x-axis, going from 20 to 80.
In 1997, the proportion of obesity was about 10% for age 20. It increased to about 30% at age 50 before decreasing to about 12% at age 80. In 2003, the proportion of obesity was about 12% for age 20. It increased to about 33% at age 60 before decreasing to about 18% at age 80. In 2007, the proportion of obesity was about 13% for age 20. It increased to about 36% at age 60 before decreasing to about 21% at age 80. In 2012, the proportion of obesity was about 20% for age 20. It increased to about 37% at age 70 before decreasing to about 23% at age 80. In 2013, the proportion of obesity was about 21% for age 20. It increased to about 38% at age 70 before decreasing to about 27% at age 80.
When
individual-level data are available, the standard Bayesian approach towards
accounting for complex survey design is to include as many features of the
design as possible in the estimation model (Gelman et al., 2014,
Chapter 8). Chen, Wakefield and Lumely (2014) show, however, how the full
individual-level approach can be approximated by an aggregate-level approach
that starts from design based estimates such as
and
Chen et al. (2014) assume that the
design-based estimates are constructed so as to reflect all the important
features of the survey design, and show how these estimates can be converted
into a form suitable for inclusion in an aggregate-level model.
Applying
the approach of Chen et al. (2014), we approximate the individual-level
approach using a Binomial likelihood. We obtain counts of individuals with
obesity
and total counts of individuals
by finding
and
such that
and
The likelihood is
Here
is the super-population probability of
obesity: the probability that a person aged
at time
is obese. Our objective is to estimate
for past years, including years without survey
data, and to forecast
for future years.
Our
prior model for
is
which includes age and time effects, but not an age-time interaction. We
experimented with an age-time interaction, but found that its size was small
enough to omit (results not shown).
As
with the mortality model of Section 2, we use a local trend model for the
age effect, though in the obesity case we do not have an infant covariate. The
rationale for using a local trend model is, once again, to capture the
correlations between neighbouring age groups. We also use the same prior for
the intercept as we do in Section 2, a normal distribution with mean 0 and
standard deviation 10.
We
use a local trend model for time,
but with two different sets of assumptions about innovation terms
and
In
our first version, we assume that
and
are always very close to 0, which we implement
by using extremely tight priors on the standard deviations for these terms. The
standard deviations for both terms have
priors with scales of 0.001. This version of
the local trend model essentially fits a straight line through the data. Aside
from assuming no change, this is perhaps the most common approach to
forecasting future rates in epidemiology and demography. We refer to this model
as the “straight line” model.
Our
second version is a generalization of the first. Rather than assuming that
and
are always close to 0, we allow them to take
values that imply year-on-year changes in obesity rates of a few percentage
points. We do this by setting the scale of the prior for the standard deviation
of
to 0.05 and setting the scale of the prior for
standard deviation of
to 0.025. We use a larger scale for
than for
on the basis that levels change more rapidly
than systematic trends. We refer to the model based on this version of the time
effect as the “flexible” model.
We
carry out the estimation using our package demest,
with the same settings for burnin, production, chains, and thinning as for the
mortality application.
3.3 Results
Figure 3.2
shows results based on the “straight line” model. Estimates for survey years
are shown in red, and estimates and forecasts for the remaining years are shown
in blue. As is conventional with forecasting, we use 80% credible intervals,
rather than 95%.
Estimates
for years with survey data are more precise than those for years without survey
data, as we would expect. Estimates for years between surveys are more precise
than those for forecasts. The differences in precision between estimates and
forecasts are, nevertheless, small. Strong assumptions about linearity lead to
precise forecasts.

Description for Figure 3.2
Figure presenting the estimates and forecasts of obesity prevalence in New Zealand based on the “straight line” model. There are seven graphs, one for each of the following age group: 15-24, 25-34, 35-44, 45-54, 55-64, 65-74, 75 and over. Each graph contains a line representing posterior medians with an 80% credible interval around. The obesity prevalence is on the y-axis, ranging from 0.0 to 0.5. The years are on the x-axis going from 1997 to 2023. For each age group, the obesity prevalence is increasing throughout the years. The obesity prevalence is also increasing the older people get before starting to decrease at age 65.
Figure 3.3
shows results based on the “flexible” model. Point estimates and forecasts from
the flexible model are indistinguishable from those of the straight line model.
The level of uncertainty, however, is clearly different. Compared with the
straight line model, there is a modest increase in uncertainty for years
between surveys and a large increase in uncertainty for the forecast period,
particularly in later years.

Description for Figure 3.3
Figure presenting the estimates and forecasts of obesity prevalence in New Zealand based on the “flexible” model. There are seven graphs, one for each of the following age group: 15-24, 25-34, 35-44, 45-54, 55-64, 65-74, 75 and over. Each graph contains a line representing posterior medians with an 80% credible interval around. The obesity prevalence is on the y-axis, ranging from 0.0 to 0.5 for the first four age groups and from 0.0 to 0.6 for the last three. The years are on the x-axis going from 1997 to 2023. For each age group, the obesity prevalence is increasing throughout the years. The obesity prevalence is also increasing the older people get before starting to decrease at age 65. For each age group, there is a modest increase in uncertainty for years between surveys and a large increase in uncertainty for the forecast period, particularly in later years.
The
flexible model, arguably, gives a better representation of knowledge about
obesity trends in New Zealand than the linear model. The linear assumption is
conventional, but does not have any strong theoretical basis. Over-reliance on
the linear assumption can produce over-confidence. The flexible model
illustrates the implications of weaker assumptions.
ISSN : 1492-0921
Editorial policy
Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.
Submission of Manuscripts
Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).
Note of appreciation
Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.
Standards of service to the public
Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.
Copyright
Published by authority of the Minister responsible for Statistics Canada.
© Her Majesty the Queen in Right of Canada as represented by the Minister of Industry, 2019
Use of this publication is governed by the Statistics Canada Open Licence Agreement.
Catalogue No. 12-001-X
Frequency: Semi-annual
Ottawa