4 A static adaptive survey design: Assigning telephone interviewers
Barry Schouten, Melania Calinescu and Annemieke Luiten
Previous | Next
In this section, a simulation study is presented where
telephone interviewer assignment is the design feature of interest. The
response probabilities used in the example are estimated from real telephone
survey data.
The Dutch Survey of Consumer Satisfaction (SCS) is a
monthly telephone survey about the sentiments of households about their
economic situation and expenditure. The survey provides insight into short-term
economic development, and early indicators of differences in consumer trends.
Each month 1,500 households are sampled. The two most influential causes of
nonresponse in the SCS are non-contact and refusal. Of the sample 95% is
contacted, and of the contacted 71% of the households participate. The response
rate is 67%.
One of the most important factors that affect
participation is the interviewer. Interviewer's performance may vary greatly
when it comes to obtaining response. In total 60 interviewers worked on the SCS
during 2005. That means an interviewer had contact with 280 households on
average. Interviewer participation rates ranged from 50% to 79%. The lowest
rate of 50% was, however, exceptional as the one but lowest participation rate
was 61%. The mean interviewer participation rate was 67%. Households were
randomly assigned to interviewers in the CATI management system. Hence, with
respect to the interviewer the data are randomized (or interpenetrated). In the
following, the interviewer will be the design feature of interest. The survey
strategy set consists of sixty strategies,
From the available auxiliary variables a vector was selected containing ethnicity, gender
composition of the household core (male, female or mix), average age of the
household core in 5-year classes, type of household, degree of urbanization of
the neigborhood of residence and average value of houses in the neighborhood.
Especially age, average house value and type of household relate to key
statistics deduced from the SCS. No paradata were available in this study.
Therefore, the adaptive survey design is static. In the optimization the
allocation probabilities need to be chosen, i.e., it needs to be decided to which interviewers subpopulations
based on are assigned (such that ).
The coefficient of variation of the response
propensities defined by (2.8) is selected as the target
quality function. To estimate the response propensities for interviewers, a multilevel model is used
with the identity link function, i.e.,
a linear regression with two levels. The interviewers form the first level of
the model and the households the second level. The multilevel model is used to
separate individual response propensities and interviewer response
propensities. The rationale is that by separating interviewer and individual,
the interviewer effect can be isolated and interviewer assignment can be
optimized. We chose a linear model as it allows for easy optimization. Since
the propensities are never close to 0 or 1, the linear model produces almost the
same estimates as a logit or probit model.
For the interviewer effect it was first investigated
whether it was sufficient to use a fixed slope multilevel model, i.e., the interviewer is added as a main
effect only and there are no interactions with auxiliary variables. All
pre-selected covariates gave significant contributions to the multilevel model,
but none of the interactions with the interviewer were significant at the 5%
level. For this reason, we restrained ourselves to the following main effect model
(4.1)
where is the covariate vector of household the sample size, is the (fixed) interviewer effect for
interviewer is the constant term or intercept and is the slope parameter. We let denote the response propensity of sample
unit
Model (4.1) was fitted to the SCS data set. Next, the
estimated interviewer effect was used to optimize the coefficient of
variation, subject to two cost constraints: both the total interview time and
the individual number of calls for each interviewer must be the same as in the
original design. Since the telephone management system handles the calls, the
interview time is the dominant component in the costs. If we fix the total
interview time, then we constrain costs to be the same as for the regular SCS.
Since interviewers can handle only a certain amount of calls, we must also fix
the number of calls they are allocated to. The first constraint implies that we
fix the response rate, as the total interview time is the multiple of the
average individual interview time and the number of respondents. The SCS
questionnaire is simple and does not contain any nested sets of survey items.
As a result the individual interview time shows hardly any variation over
population subgroups. The second constraint is equal to
(4.2)
where is the pre-specified number of calls for
interviewer and
We optimize the coefficient of variation by distributing
the to the households. Due to the additive nature
of the model, it is easy to show that any permutation of the interviewers to
the cases leads to the same average response propensity and, hence, to the same
interview time and costs. The average response propensity is
which does
not depend on the set of allocation probabilities As a consequence,
optimizing the coefficient of variation amounts to optimizing the variance of
the response propensities
If we restrict
ourselves to 0-1 decision variables, i.e.,
then it is relatively easy to show that the
optimal allocation corresponds to linking the best interviewers to the most
difficult sample units and vice versa. In other words, the sample units are
sorted by putting the individual response propensities without the interviewer
effect, in an increasing order, and the interviewers
are sorted in a decreasing order based on their interviewer effect, If two sample units and are allocated to two different interviewers,
say and and and then it is optimal to switch the two
interviewers, i.e., This can be shown as follows. The difference
in variance is proportional to
(4.3)
From (4.3), we can conclude that there is a decrease in
variance, and, hence, in the coefficient of variation, if we swap the two
interviewers for cases and From this argument, it follows easily that the
optimal solution is as suggested. In a similar fashion, but requiring more
algebra, it can be shown that the optimal solution for probabilistic
allocations, is the same.
The first two rows of table 4.1 contain the average
response propensity and the coefficient of variation before and after
re-assignment of interviewers. The coefficient of variation dropped from 0.117
to 0.035. In order to get an idea of the significance of the change in the
quality function, we computed bootstrap standard errors. For each bootstrap,
the re-assignment of interviewers was performed. The errors are given in table
4.1.
Table 4.1
The average response propensity and coefficient of variation of the regular SCS, the SCS after re-assignment of interviewers without and with adjustment for interview time. Bootstrap standard errors are given within brackets.
Table summary
This table displays the results of the average response propensity and coefficient of variation of the regular scs. The information is grouped by scs (appearing as row headers), adjustment for interview time?, header 1 and header 2 (appearing as column headers).
SCS |
Adjustment for interview time? |
|
|
Regular |
- |
70.8% |
0.117 (0.005) |
Re-assignment |
No |
70.8% |
0.035 (0.003) |
Re-assignment |
Yes |
70.8% |
0.034 (0.003) |
The reader may have noticed that fixed numbers of
interviewer cases do not imply fixed numbers of interviews per interviewer. In
fact, by rearranging the interviewers, the good interviewers will do fewer
interviews as they get the harder cases, while the less good interviewers do
more interviews. As a result, the good interviewers will work smaller numbers
of hours than they would do in the regular SCS and the less good interviewers
will work more. This would be an undesirable side effect, which can, however,
be adjusted relatively easy. Starting from the optimal solution, and sorting
again the sample units based on their individual response propensities without
the interviewer effect, we can shift neighbouring cases from less good
interviewers to better interviewers. This is done in such a way that the total
interview time per interviewer does not exceed that of the regular SCS. One can
again prove that this procedure leads to a new optimal solution where the
constraint on the fixed number of cases in (4.2) is replaced by the constraint
on the fixed number of interviews
(4.4)
where is the pre-specified number of interviews.
Table 4.1 presents the coefficient of variation for the optimal solution given
(4.4). The response rate remains fixed, and the coefficient of variation is
marginally smaller.
In 2009, the SCS survey has been used as an instrument
to test a static adaptive survey design. We refer to Luiten and Wetzels (2009)
and Luiten and Schouten (2013) for details. Interviewer assignment was one of
the main design features that were adapted. Other design features were the
survey mode and the contact protocol. Apart from telephone, also web was
selected as a potential survey mode. Sample units with low estimated contact
probabilities were assigned to more intensive contact protocols and were
prioritized. Based on historical SCS data, contact and response probabilities
were estimated. The pilot succeeded in significantly improving the coefficient
of variation, while fixing the response rate and budget.
In this section, we presented a simulation study where
good telephone interviewers get more difficult cases. This may in practice lead
to annoyance among these interviewers. When implementing such a design, one
should carefully instruct interviewers beforehand. In the 2009 SCS pilot, this
did not lead to any negative comments from interviewers. In face-to-face
surveys, a re-assignment of interviewers cannot be done so easily as travel costs
may change drastically. Still, within densely populated inteviewer regions,
re-assignment may be an option.
Previous | Next