4. Computing response propensities using latent trait models
Alina Matei and M. Giovanna Ranalli
Previous | Next
The
variable
can be computed
using a latent trait model. In general, latent variable models are multivariate
regression models that link continuous or categorical responses to unobserved
covariates. A latent trait model is essentially a factor analysis model for
binary data (see Bartholomew,
Steele, Moustaki and Galbraith 2002; Skrondal
and Rabe-Hesketh 2007).
We
start by creating the matrix with elements
Figure 4.1 shows
a schematic of the indicators
for respondents
and nonrespondents. Then, we assume that the factors that drive unit response
are the same as those that drive item response on selected variables of
interest. In other words, item nonresponse is assumed nonignorable.
Figure
4.1 Schematic representing variables
for the sets
and

Description for Figure 4.1
Let
be the
probability of response of unit
for item
for all
and
As in the case
of unit nonresponse,
is modelled as a
function of the variable of interest using logistic regression as follows
for
and
where
and
are parameters.
Since
is known only
for units with
Model (4.1)
cannot be estimated. As in the case of unit nonresponse, we propose to estimate
as a function of
an auxiliary variable related to the variable of interest, that is
Model (4.1) is
rewritten
for
and
Model (4.2) is
not an ordinary logistic regression model, because the
are unobservable
values taken by a latent variable. Latent trait models can be used in this case
to estimate
and the model
parameters. Note that in the area of educational testing and psychological
measurement, latent trait modelling is termed Item Response Theory.
The
Rasch model (Rasch 1960) is a first simple latent trait model that is well
known in the psychometrical literature and used to analyze data from
assessments to measure variables such as abilities and attitudes. It takes the
following form
The parameters
are estimated
for each item
and reflect the
extremeness (easiness) of item
larger values
correspond to a larger probability of a positive response at all points in the
latent space. The parameter
is known as the
‘discrimination’ parameter and can be fixed to some arbitrary value without
affecting the likelihood as long as the scale of the individuals’ propensities
is allowed to be free. In many situations the assumption that item
discriminations are constant across items is too restrictive. The two-parameter
logistic (2PL) model generalizes the Rasch model by allowing the slopes to
vary. Specifically, the 2PL model assumes the form given in Equation (4.2). The
parameters
are now
estimated for each item
and provide a
measure of how much information an item provides about the latent variable
To achieve
identifiability of Model (4.2), we can fix the value of one or more parameters
and
in the
estimation process. Moran (1986) showed that in the 2PL model, all the
parameters are identifiable under wide conditions, provided the number of items
exceeds two, and all the slopes are assumed to be strictly positive. A further
generalization to Model (4.2) is considered in the literature - the 3PL model -
that includes another parameter, the guessing parameter, to model the
probability that a subject with a latent variable tending to
responds to an
item. Such an extension does not seem necessary in the context at hand and will
not be considered further.
4.1
Assumptions in latent trait models
Latent
trait models typically rely on the following assumptions. The first one is the
so-called conditional independence assumption, which postulates that
item responses are independent given the latent variable (i.e., the latent
variable accounts for all association among the observed variables
Consequently,
given
the conditional
probability of
is
Following Bartholomew et al. (2002, page 181) ‘the assumption of
conditional independence can only be tested indirectly by checking whether the
model fits the data. A latent variable model is accepted as a good fit when the
latent variables account for most of the association among the observed
responses.’
A
second assumption of Models (4.2) and (4.3) is that of monotonicity: as
the latent variable
increases, the
probability of response to an item increases or stays the same across intervals
of
In other words,
for two values of
say
and
and arbitrarily
assuming that
monotonicity
implies that
for
Larger values of
are associated
with a greater chance of a response to each item.
Finally,
the third, and possibly strongest, assumption of Models (4.2) and (4.3) is that
of unidimensionality, implying that a single latent variable fully
explains the willingness of unit
to answer the
questionnaire. All these basic assumptions imply that the dependence between
the items
may be explained
by the latent variable
which represents
the units’ willingness and that the probability that a unit
responds to a
given variable increases with
4.2
Estimation of the model
In
what follows we focus on the two-parameter logistic (2PL) model given in (4.2).
Let
and
Model (4.2) can
be fitted using maximum likelihood or bayesian methods. We focus here on the
former. Under the maximum likelihood approach, three major methods - joint,
conditional and marginal maximum likelihood - are developed. Here, we will
concentrate on marginal maximum likelihood that can be applied to fit the 2PL
model. This method is also used in the simulation studies of Section 6. It
consists of maximizing the likelihood of the model after the
are integrated
out on the basis of a common distribution assumed on these parameters. In
particular, it is assumed that
is a random
variable following a distribution with the density function
typically
It is also
assumed that the response vectors
are independent
of one another and the conditional independence assumption holds.
For
a set of
respondents
having the response vectors
the marginal
likelihood can be expressed as
where
and
now denotes the
density of the
distribution.
The method consists in maximizing the corresponding log-likelihood, given by
with respect to
using, for
example, the EM algorithm. Estimates of
and
are thus
provided. Afterwards,
is estimated
using the empirical Bayes method by maximizing the posterior density
with respect to
and keeping item
parameters and observations fixed. Estimates of
are obtained
using Expression (4.2), where
and
are replaced
with their estimates.
4.3.
Goodness-of-fit measures of the model
Different
goodness-of-fit measures are proposed in the literature to test whether the
model given in (4.2) adequately fits the data (see e.g., Bartholomew et al.
2002). One uses two-way and three-way margins of the response items. Discrepancies
between the expected
and observed
counts in these
tables are measured using the statistic
Large values of
for the
second-order or third-order margins will identify sets of items for which the
model does not fit well. Note that the residuals
are not
independent and they cannot be summed to give an overall test statistics
distributed as a chi-squared (see Bartholomew et al. 2002, page 186). Item
fit indexes (Bond and Fox 2007) can be used to this end as well. On the basis
of estimated latent variables and item parameters, the expected response of a
unit to an item can be computed. The similarity between the observed and
expected responses to any item can be assessed through two fit mean-square
statistics: the outlier-sensitive fit statistic (item outfit) and the
information-weighted fit statistic (item infit). The estimate produced by the
item outfit is relatively more affected by unexpected responses different from
a person’s measure, i.e., it is more sensitive to unexpected observations by
units on items that are relatively very easy or very hard for them to answer.
The item infit has each observation weighted by the information and, on the
other side, is relatively more affected by unexpected responses closer to a
person’s measure, i.e., it is more sensitive to unexpected patterns of
observations by units on items that are roughly targeted on them according to
their latent variable value. The expected value for both statistics is one. For
infit and outfit values greater/less than one indicate more/less variation
between the observed and the predicted response patterns, a range of 0.5 to 1.5
is generally acceptable (Bond and Fox 2007).
In
addition, point-measure correlations (Olsson, Drasgow and Dorans 1982) can be
used to estimate the correlation between the latent variable and the single
item response. Items for which such measures take negative or zero values
should be removed from the analysis or may be evidence that the latent
construct is not unidimensional. Unidimensionality can be tested by running a
Principal Components Analysis (PCA) of the standardized residuals for the items (Wright
1996). In this way the first component (dimension) has already been removed,
and it is possible to look at secondary dimensions, components or contrasts.
Unidimensionality is supported by observing that the eigenvalue of the first
PCA component in the correlation matrix of the residuals is small (usually less
than 2.0). If not, the loadings on the first contrast indicate that there are
contrasting patterns in the residuals.
Finally,
when items are used to form a scale, they need to have internal consistency.
Cronbach alpha can be used to test whether items have the reliability property,
i.e., if they all measure the same thing, then they should be correlated with
one another.
4.4. Estimation of
Two
solutions are shown here to estimate
using
information from the latent trait model. The first solution uses logistic
regression to estimate
for all
and a two-stage
approach.
Stage 1: First, an estimate
of
is provided. To
compute a value
for
we assume again
that unit nonresponse is just an extreme form of item nonresponse. Thus, a
nonrespondent does not answer any item
and thus
for all
The computation
of
for
is handled as
follows: we add to the set
a phantom
respondent unit
having
equal to 0, for
all
We denote this
new set by
We estimate the
parameters of Model (4.2) using all units
and compute the
values
Model (4.2)
allows the computation of
for all
Unit
has an estimated
value
We assign to all
units
an estimate
equal to
Thus, the same
value of
is provided for
all
Using this
method, each unit
has associated
an estimate
This is the key
feature for the estimation of the response probabilities
provided in the
next stage.
Stage 2: We use the estimate
for
provided in the
first stage as a covariate in Model (3.4) instead of the unknown value of
in particular
Model (4.4) provides estimates of for all
One
of the Referees suggested the following solution to estimate
Let
be the raw score
for unit
i.e., the number
of items unit
has responded
to: if
then
then
Then
can be estimated
by modelling
By the
conditional independence assumption we have
We
have
because
As a result, we
obtain
The estimated response probability
is obtained as a
solution to the polynomial equation
This solution, although very elegant, has two drawbacks. If
is large, the
above polynomial equation is difficult or even impossible to solve. If it
possible to solve the polynomial equation for moderate
the real
solutions are not necessarily in (0, 1). This solution has not been
considered here further.
Previous | Next