Sample allocation for efficient model-based small area estimation
Section 3. Some model-free area allocations
The aim of this section is to list
the five previously presented allocation methods in order to use them later as
references. Depending on which kind of auxiliary information each one uses,
they are divided into two groups: number-based and parameter-based allocations.
3.1 Number-based
allocations
Two basic allocation solutions
commonly used go under the names equal allocation and proportional allocation.
Neither of these allocations contains any specific criterion on the area or
population level. Their implementation requires only information on the number of strata
and the numbers of units
in each stratum.
In the equal area allocation the
sample size
is simply a quotient
It is recommended to
choose the total sample size
so that the quotient is a whole number. This allocation method does
not take differences between the areas into account in any way, which results
in inaccurate area estimates. A natural lower limit of
the sample size is min
Proportional
allocation is a frequently used basic method. Area sample sizes are solved from
If the sizes of the
areas vary strongly, it can lead to situations
where the allocated sample size
for one or more areas. This is an obstacle in calculating direct
design-based estimates of standard errors. One solution is to apply the
combined allocation proposed by Costa, Satorra and Ventura (2004). The idea is a weighted solution between the equal and
proportional allocation depending on the situation. The combined area sample
size is
for a specified
constant
A
minor problem is present if for some areas
A modified solution exists for this case.
3.2 Parameter-based
allocations
These allocations use area-level
information of the study variable
and in some cases of the auxiliary variable
correlated with
The values of
are available for all population units. In
practice the unknown
is replaced with
a proper proxy variable
such as a study variable obtained from an earlier research of the same
subject, or the values of
are generated with a suitable model developed
of a small pre-sample. Also
can be substituted for
Allocation criteria can be set on population
level, only on area level or on combined population and area level.
The Neyman allocation aims at reaching an
optimal accuracy concerning population parameters
(Tschuprow
1923). The standard
deviation of the study variable
or some
proxy variable and the number of units in each area must be known. Allocation
favors large areas with strong variation.
The Bankier or power allocation
(1988) is based on a criterion set on the area level. Area CV values of
are weighted by area total transformations
which contain a tuning constant
In practice
or
must be used in place of
Allocation favors mainly large areas with high
CV.
Choudhry, Rao and Hidiroglou (2012) present the NLP allocation
method for direct estimation. This method uses non-linear programming to find a
solution. Criteria for the allocation are defined by setting upper limits for
CV values of the study variable
in each area and in the population. In
practice
or
replaces
The program searches the minimum sample size
satisfying these conditions. The SAS (Statistical Analysis System) procedure NLP with Newton-Raphson
option was used to find the solution. The allocation favors areas with high CV regardless
of the area size
A summary of the model-free
allocations and the formulas for calculating
area sample sizes are presented in Table 3.1.
Table 3.1
Summary of number-based and parameter-based allocations
Table summary
This table displays the results of Summary of number-based and parameter-based allocations. The information is grouped by Allocation (appearing as row headers), Computing area sample size xxxxx and Optimality level (appearing as column headers).
| Allocation |
Computing area sample size |
Optimality level |
| Equal |
|
Area |
| Proportional |
|
Population |
| Neyman |
where is the standard deviation of
(in practise or in area |
Population |
| Bankier |
where is the area total of
and is a tuning constant. In practise or
replace |
Area |
| NLP |
satisfying tolerances and In practise or replace |
Jointly population and area
|
Some other parameter-based
allocation methods
are mentioned briefly. For example Longford (2006) introduced inferential
priorities
for the strata
and
for the population and used those constraints for allocation. Another solution is presented by Falorsi and Righi
(2008). This
solution does not contain a direct imposition of quotas, but tries to solve the
comprehensive collection of data by
using a multi-stage sampling design, so that the area estimation can be implemented effectively.
ISSN : 1492-0921
Editorial policy
Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.
Submission of Manuscripts
Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).
Note of appreciation
Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.
Standards of service to the public
Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.
Copyright
Published by authority of the Minister responsible for Statistics Canada.
© Minister of Industry, 2017
Use of this publication is governed by the Statistics Canada Open Licence Agreement.
Catalogue No. 12-001-X
Frequency: semi-annual
Ottawa