A short note on quantile and expectile estimation in unequal probability samples 1. IntroductionA short note on quantile and expectile estimation in unequal probability samples 1. Introduction

Quantile estimation and quantile regression have seen a number of new developments in recent years with Koenker (2005) as a central reference. The principle idea is thereby to estimate an inverted cumulative distribution function, generally called the quantile function $Q (α) = F^{- 1} (α) for α \in (0,1),$ where the 0.5 quantile $Q (0.5),$ the median, plays a central role. For survey data tracing from an unequal probability sample with known probabilities of inclusion Kuk (1988) shows how to estimate quantiles taking the inclusion probabilities into account. The central idea is to estimate a distribution function of the variable of interest and invert this in a second step to obtain the quantile function. Chambers and Dunstan (1986) propose a model-based estimator of the distribution function. Rao, Kovar and Mantel (1990) propose a design-based estimator of the cumulative distribution function using auxiliary information. Bayesian approaches in this direction have recently been proposed in Chen, Elliott, and Little (2010) and Chen, Elliott, and Little (2012).

Quantile estimation results from minimizing an $L_{1}$ loss function as demonstrated in Koenker (2005). If the $L_{1}$ loss is replaced by the $L_{2}$ loss function one obtains so called expectiles as introduced in Aigner, Amemiya and Poirier (1976) or Newey and Powell (1987). For $α \in (0,1),$ this leads to the expectile function $M (α)$ which, like the quantile function $Q (α),$ uniquely defines the cumulative distribution function $F (y)$ . Expectiles are relatively easy to estimate and they have recently gained some interest, see e.g., Schnabel and Eilers (2009), Pratesi, Ranalli, and Salvati (2009), Sobotka and Kneib (2012) and Guo and Härdle (2013). However since expectiles lack a simple interpretation their acceptance and usage in statistics is less developed than quantiles, see Kneib (2013). Quantiles and expectiles are connected in that a unique and invertible transformation function $h_{y} : [0,1] \to [0,1]$ exists so that $M (h (α)) = Q (α),$ see Yao and Tong (1996) and De Rossi and Harvey (2009). This connection can be used to estimate quantiles from a set of fitted expectiles. The idea has been used in Schulze Waltrup, Sobotka, Kneib and Kauermann (2014) and the authors show empirically that the resulting quantiles can be more efficient than empirical quantiles, even if a smoothing step is applied to the latter (see Jones 1992). An intuitive explanation for this is that expectiles account for all the data while quantiles based on the empirical distribution function only take the left (or the right) hand side of the data into account. That is, the median is defined by the 50% left (or 50% right) part of the data while the mean (as 50% expectile) is a function of all data points. In this note we extend these findings and demonstrate how expectiles can be estimated for unequal probability samples and how to obtain a fitted distribution function from fitted expectiles.

The paper is organized as follows. In Section 2 we give the necessary notation and discuss quantile regression in unequal probability sampling. This is extended in Section 3 towards expectile estimation. Section 4 utilizes the connection between expectiles and quantiles and demonstrates how to derive quantiles from fitted expectiles. Section 5 demonstrates in simulations the efficiency gain in quantiles derived from expectiles and a discussion concludes the paper in Section 6.

ISSN : 1492-0921

Editorial policy

Survey Methodology publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves. All papers will be refereed. However, the authors retain full responsibility for the contents of their papers and opinions expressed are not necessarily those of the Editorial Board or of Statistics Canada.

Submission of Manuscripts

Survey Methodology is published twice a year in electronic format. Authors are invited to submit their articles in English or French in electronic form, preferably in Word to the Editor, (statcan.smj-rte.statcan@canada.ca, Statistics Canada, 150 Tunney’s Pasture Driveway, Ottawa, Ontario, Canada, K1A 0T6). For formatting instructions, please see the guidelines provided in the journal and on the web site (www.statcan.gc.ca/SurveyMethodology).

Note of appreciation

Canada owes the success of its statistical system to a long-standing partnership between Statistics Canada, the citizens of Canada, its businesses, governments and other institutions. Accurate and timely statistical information could not be produced without their continued co-operation and goodwill.

Standards of service to the public

Statistics Canada is committed to serving its clients in a prompt, reliable and courteous manner. To this end, the Agency has developed standards of service which its employees observe in serving its clients.

Copyright

Published by authority of the Minister responsible for Statistics Canada.

Use of this publication is governed by the Statistics Canada Open Licence Agreement.

Catalogue No. 12-001-X

Frequency: semi-annual

Ottawa

Date modified:: 2016-06-22

Language selection

Search and menus

Search

A short note on quantile and expectile estimation in unequal probability samples 1. IntroductionA short note on quantile and expectile estimation in unequal probability samples 1. Introduction