Sample-based estimation of mean electricity consumption curves for small domains
Section 3. Direct estimation methods in the design-based approach
In
this section, we adopt the sampling design approach. This means that the
variable interest values
for each population unit are considered to be deterministic
and the only variable present is that of the construction of the sample. The statistical inference then only describes the randomness
created by the sampling design.
We
will present two classic estimators, the Horvitz-Thompson estimator and the
calibration estimator, which will be the references to which we will compare
our methods to evaluate performances. These are direct estimators, i.e., estimators
constructed by using, for the estimation of the mean for each domain, only
units and auxiliary information related to the domain in question.
The
functional Horvitz-Thompson estimator (Horvitz and Thompson, 1952; Cardo,
Chaouch, Goga and Labruère, 2010) of
is given by:
with
the sampling weight of unit
also called the Horvitz-Thompson weight. It obviously cannot be calculated for the unsampled domains (i.e.,
domains
such that
is empty) and it is extremely unstable for small domains. Moreover, it in no way uses the predictor variables available
to us.
To
take advantage of the auxiliary information, again in a sampling design
approach, we can use the calibration estimator proposed by Deville and Särndal
(1992).
The
calibration estimator for the mean
is given by:
where the calibration weights
are as close as possible to the sampling weights
units of
within the meaning of a certain distance or pseudo-distance
defined by the statistician:
For the distance of chi-square
the weights are given by
and the estimator becomes
where
The calibration weights are not dependent on time
but they are dependent in this case on the domain
therefore, the estimator
does not satisfy the additivity property, i.e.,
where
is the calibration estimator of
Where
the vector
is in the model, thus,
If size
is large, this estimator is approximately bias-free regarding
the sampling plan. We can consider the modified
estimator:
where
does not depend on domain
and, therefore, the estimator
satisfies the additivity property, i.e.,
where
is the calibration
estimator of
As well, if
is large, it has no asymptotic bias even if size
is not large. The asymptotic variance
functions of
and
are equal to the Horvitz-Thompson variances of residuals
and
(see Rao and Molina,
2015).
Nonetheless,
for each domain, these estimates are based only on data from the domain in
question (curves and explanatory variables) without considering the rest of the
sample. Like
the Horvitz-Thompson estimator, they are therefore inaccurate for small domains
and cannot be calculated for unsampled domains.
The
methods that we present in the following section will allow us, by presenting a
model common to all units of the population that describes the link between
variables of interest and auxiliary information, to jointly use all data from
the sample to perform the estimate for each domain, and thus increase the
accuracy for each one. It will also make it possible to even provide estimates for
unsampled domains.
ISSN : 1492-0921
Editorial policy
Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.
Submission of Manuscripts
Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).
Note of appreciation
Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.
Standards of service to the public
Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.
Copyright
Published by authority of the Minister responsible for Statistics Canada.
© Her Majesty the Queen in Right of Canada as represented by the Minister of Industry, 2018
Use of this publication is governed by the Statistics Canada Open Licence Agreement.
Catalogue No. 12-001-X
Frequency: Semi-annual
Ottawa