Conditional calibration and the sage statistician
Section 5. The Bayesian posterior distribution of $Q$

Table of contents

The Bayesian approach differs from the Neymanian approach (and from Fisher’s fiducial approach) by formulating the problem so that a real conditional probability distribution for the estimand $Q$ can be calculated, using the laws of probability theory to condition on the fact that the observed data equals $Y^{*} -$ this distribution is called the posterior distribution of $Q,$ that is, posterior after seeing $Y = Y^{*} .$ To conduct this activity formally, $Q$ must be a random variable, and thus $Q$ needs to have a “starting” probability distribution, called its prior distribution, meaning prior to seeing any data; in the context of our setup, this prior distribution is a distribution over the possible local estimands, that is, a set of $K$ probabilities (summing to one), one probability for each possible Truth. This prior distribution is essentially a set of $K$ weights ${W_{k}, k = 1, \dots, K}$ reflecting the prior beliefs of experts that each of the $K$ possible local estimands is the correct one. The Neymanian frequentist has no use for such weights over the set of possible Truths, because the 95% is supposed to hold for any set of weights, and thus for each possible Truth (i.e., for all $K$ point mass prior distributions).

Now comes the part of the argument that hints at a departure from Neyman’s 1970’s claim to me that conditional inference is too difficult. In the context of the simulation just described, and admitting some Bayesian or fiducial logic, when confronted with actual observed data set $Y^{*},$ attention should be focused on the parts of the simulation where the generated $Y_{j k}$ equals $Y^{*};$ the other $Y_{j k}$ can be ignored (at least in the context of the idealized description here, where $J$ is essentially infinite) because, to be fully Bayesian, we want to condition on $Y$ equaling $Y^{*} .$

In fact, let us use the simulation itself to describe the Bayesian posterior distribution of $Q,$ i.e., the distribution of $Q$ conditioning on the fact that $Y = Y^{*} .$ Let $M_{k}^{*}$ be the proportion of the $J$ values of $Y_{j k}$ that match $Y^{*},$ for $k = 1, \dots, K;$ that is, for truth $T_{k},$ $M_{k}^{*}$ is the proportion of the generated data sets from truth $T_{k}$ that match the actual data set $Y^{*} .$ For example, if $M_{k}^{*}$ is zero, then the a priori possible truth $T_{k}$ could not be the actual truth because it could not have generated observed data $Y^{*} .$ The posterior probability that the estimand $Q$ equals $Q_{k},$ the local value of $\tilde{Q}$ for Truth $T_{k},$ is the weighted average of the proportions, $M_{k}^{*},$ weighted by $W_{k},$ the prior probability that $T_{k}$ is the correct truth. Here, this weighted average of proportions is generally labeled $π_{k},$ where $π_{k}$ for the observed data $Y^{*}$ is labelled $π_{k}^{*}$ and equals $M_{k}^{*} W_{k} / \sum_{k^{'} = 1}^{K} [M_{k^{'}}^{*} W_{k^{'}}];$ we could call $π_{k}^{*}$ the estimated ability of Truth $T_{k}$ to match observed data $Y^{*} .$ This description of the posterior distribution of $Q$ using simulation is from Rubin (1984); see Figure 5.1.

Figure 5.1 Description of posterior distribution from Rubin (1984)

Description for Figure 5.1

Description of posterior distribution from Rubin (1984). Suppose we first draw equally likely values of $θ$ from $p (θ),$ and label these $θ_{1}, \dots, θ_{s} .$ The $θ_{j}, j = 1, \dots, s$ can be thought of as representing the possible populations that might have generated the observed $X .$ For each $θ_{j},$ we now draw an $X$ from $f (X | θ = θ_{j});$ label these $X_{1}, \dots, X_{s} .$ The $X_{j}$ represent possible values of $X_{j}$ that might have been observed under the full model $f (X | θ) p (θ) .$ Now some of the $X$ will look just like the observed $X$ and many will not; of course, subject to the degree of rounding and the number of possible values of $X,$ $s$ might have to be very large in order to find generated $X_{j}$ that agree with observed $X,$ but this creates no problem for our conceptual experiment. Suppose we collect together all $X_{j}$ that match the observed $X,$ and then all $θ_{j}$ that correspond to these $X_{j} .$ This collection of $θ_{j}$ represents the values of $θ$ that could have generated the observed $X;$ formally, this collection of $θ$ values represents the posterior distribution of $θ .$ An interval that includes 95% of these values of $θ$ is a 95% probability interval for $θ$ and has the frequency interpretation that under the model, 95% of populations that could have generated the data are included within the 95% interval.

There are objections to this approach. First, where do the prior weights $W_{k}$ come from and who are the experts providing these weights? Perhaps we should find some way to avoid using these potentially overly subjective prior weights? Second, perhaps the requirement for exact equality between a generated data set $Y_{j k}$ and the observed data set $Y^{*}$ should be relaxed in some way so that a generated $Y_{j k}$ does not have to equal $Y^{*}$ exactly but only “look like” it came from the same distribution as did $Y^{*},$ and so match $Y^{*}$ in some way?

More on this second point first, which is clearly important when trying to conduct an actual simulation like this idealized one with a finite budget. The approximate equality between generated data $Y$ and observed data $Y^{*}$ can be achieved in situations with low-dimensional sufficient statistics, because only those statistics have to match. But this idea of generated data being “close to” observed data $Y^{*}$ is the basis of all work using this description of the posterior distribution to conduct “ABC” $-$ Approximate Bayesian Computation, apparently first described in the paragraph in Figure 5.1 (https://en.wikipedia.org/wiki/Approximate_Bayesian_computation, Tavare, Balding, Griffiths and Donnelly, 1997). We simply assume at this point that we have chosen some such metric to define the function $M_{k},$ and use it to define the ability of Truth $T_{k}$ to generate data sets that match the observed data, $Y^{*} .$

ISSN : 1492-0921

Editorial policy

Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.

Submission of Manuscripts

Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).

Note of appreciation

Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.

Standards of service to the public

Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.

Copyright

Published by authority of the Minister responsible for Statistics Canada.

Use of this publication is governed by the Statistics Canada Open Licence Agreement.

Catalogue No. 12-001-X

Frequency: Semi-annual

Ottawa

Date modified:: 2019-07-04

Language selection

Search and menus

Search

Conditional calibration and the sage statistician
Section 5. The Bayesian posterior distribution of $Q$