Development of a small area estimation system at Statistics Canada
Section 5. Hierarchical Bayes (HB) method
The basic Fay-Herriot area
level model includes a linear sampling model for direct survey estimates
and a linear linking model for the parameters of interest. Such models are matched because
appears
as a linear function in both the sampling and linking models. There are
instances when these equations are not matched such as when a function,
is modelled as a linear function of explanatory
variables instead of
The sampling
model and linking model pair is
and
where
and
The model pair given by (5.1) and (5.2) is referred to
as an unmatched model. Nonlinear
linking models are often needed in practice to provide a better model fit to
the data. For example, if the parameter of interest is a probability or a rate
within the range of 0 and
1, a
linear linking model with normal random effects may not be appropriate. A
linking model, in this case, could be a logistic or log-linear model. Such a
model was used to adjust counts for detailed levels for the 2011 Census of
Canada. A good description of what is involved to carry out such an adjustment
can be found in Dick (1995) and You, Rao and Dick (2004).
The production
system includes the following choices of
The inclusion of
corresponds to the matched model represented
by equations (3.1) and (3.2). An advantage of choosing the Hierarchical
Bayes method is that the estimated
cannot
be negative. The function
where
is equal
to the population mean
was used
in Fay and Herriot (1979). Their context was to estimate per capita income
(PCI) for small places in the
United
States
with a population less than 1,000. The function
was included to support the methodology to estimate the net undercoverage in Canadian Censuses. In this model,
represents the number of individuals not
counted in the census, while
is the known census count. As a result,
is the proportion of individuals undercounted
by the Census.
The sampling
variances,
are
assumed known for all the linking models represented by (5.2). The variances
are assumed to be estimated for the first two functions (the matched
Fay-Herriot and unmatched log-linear model) given in (5.3). If the
sampling variances,
are assumed known, then the unknown parameters
in the sampling model (5.1) and the linking model (5.2) can be presented in a
hierarchical Bayes (HB) framework as follows
and
If the sampling variances are unknown, they are
estimated by adding
where
follows a chi-square distribution with
degrees of freedom.
The model parameters
and
(when it
is unknown) are assumed to obey prior distributions. The distributions used in the production system for
and
are the flat
prior,
and
If
is
estimated, the prior
is added
to the Bayesian model. These prior distributions are multiplied by the
density functions of the distributions associated with the sampling and linking
models. This yields a joint likelihood function in terms of the model
parameters. This function is used to obtain a full conditional (posterior)
distribution for each of the unknown parameters. For some of these, the
resulting distribution has a tractable or well-known form. For others, the
resulting distribution is a product of density functions with no known form.
All HB methods involve estimation of the model parameters through repeated sampling of their respective full
conditional distributions.
Markov Chain Monte Carlo (MCMC) methods are used to
obtain estimates from the full conditional distribution of each parameter.
Gibbs sampling is used repeatedly to sample from the full conditional
distributions. The Gibbs sampling method (Gelfand and Smith, 1990) with the
Metropolis-Hastings algorithm (Chib and Greenberg, 1995) are used to find the
posterior means and posterior variances; see Estevao et al. (2015) for
details. The various estimators of
resulting from (5.3) are denoted as
5.1 Benchmarked HB estimator
Benchmarking of the estimators uses the difference
adjustment method described in Section 3.2. That is, the benchmarked
estimators
are computed as
where
for
and
is the
benchmark value. The terms
are
defined as follows:
if the
benchmarking is to a total, and
if the
benchmarking is for the mean. The
are
either known or unknown. The
can be a
value provided by the user that represents the total or mean of the
-values of population
The
benchmarking ensures that
ISSN : 1492-0921
Editorial policy
Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.
Submission of Manuscripts
Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).
Note of appreciation
Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.
Standards of service to the public
Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.
Copyright
Published by authority of the Minister responsible for Statistics Canada.
© Her Majesty the Queen in Right of Canada as represented by the Minister of Industry, 2019
All rights reserved. Use of this publication is governed by the Statistics Canada Open Licence Agreement.
Catalogue No. 12-001-X
Frequency: Semi-annual
Ottawa