1. Introduction

Jae-kwang Kim, Seunghwan Park and Seo-young Kim

Previous | Next

Combining information from different sources is an important problem in statistics. In survey sampling, combining information from multiple surveys can improve the quality of small area estimates. The source of information can come from a probability sample with direct measurements, from another probability sample with indirect measurements (such as self-reported health status), or from auxiliary area-level information. Many approaches of combining information, such as the multiple-frame and statistical matching methods, require access to individual level data, which is not always feasible in practice.

We consider an area-level model approach to small area estimation when there are several sources of auxiliary information. Pfeffermann (2002) and Rao (2003) provided thorough reviews of methods used in small area estimation. Lohr and Prasad (2003) used multivariate models to combine information from several surveys. Ybarra and Lohr (2008) considered the small area estimation problem when the area-level auxiliary information has measurement errors. Merkouris (2010) discussed the small area estimation by combining information from multiple surveys. Raghunathan, Xie, Schenker, Parsons, Davis, Dodd and Feuer (2007) and Manzi, Spiegelhalter, Turner, Flowers and Thompson (2011) used Bayesian hierarchical models to combine information from multiple surveys for small area estimation. Kim and Rao (2012) considered a design-based approach to combining information from two independent surveys.

To describe the setup, suppose that the finite population consists of H MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadIeaaa a@397D@  subpopulations, denoted by U 1 , , U H , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadwfada WgaaWcbaGaaGymaaqabaGccaaISaGaeSOjGSKaaGilaiaadwfadaWg aaWcbaGaamisaaqabaGccaqGSaaaaa@3F95@  and that we are interested in estimating the subpopulation totals X h = i U h x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadIfada WgaaWcbaGaamiAaaqabaGccqGH9aqpdaaeqaqabSqaaiaadMgacqGH iiIZcaWGvbWaaSbaaeaacaWGObaabeaaaeqaniabggHiLdGccaWG4b WaaSbaaSqaaiaadMgaaeqaaaaa@4415@  of a variable x MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadIhaaa a@39AD@  for each area h . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadIgaca GGUaaaaa@3A4F@  We assume that there is a survey that measures x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadIhada WgaaWcbaGaamyAaaqabaaaaa@3AC7@  from the sample but its sample size is not large enough to obtain estimates for X h MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadIfada WgaaWcbaGaamiAaaqabaaaaa@3AA6@  with reasonable accuracy. Consider one of the surveys, called survey A, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadgeaca GGSaaaaa@3A26@  as the main survey, and let X ^ h MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiqadIfaga qcamaaBaaaleaacaWGObaabeaaaaa@3AB6@  denote a design-consistent estimator of X h MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadIfada WgaaWcbaGaamiAaaqabaaaaa@3AA6@  obtained from survey A. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadgeaca GGUaaaaa@3A28@  Often, we compute X ^ h = i A h w ia x i , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiqadIfaga qcamaaBaaaleaacaWGObaabeaakiabg2da9maaqababeWcbaGaamyA aiabgIGiolaadgeadaWgaaqaaiaadIgaaeqaaaqab0GaeyyeIuoaki aadEhadaWgaaWcbaGaamyAaiaadggaaeqaaOGaamiEamaaBaaaleaa caWGPbaabeaakiaabYcaaaa@47D0@  where A h MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadgeada WgaaWcbaGaamiAaaqabaaaaa@3A8F@  is the set of sample A MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadgeaaa a@3976@  for subpopulation h MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadIgaaa a@399D@  and w i a MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadEhada WgaaWcbaGaamyAaiaadggaaeqaaaaa@3BAC@  is the weight of unit i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadMgaaa a@399E@  in sample A. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadgeaca GGUaaaaa@3A28@

In addition to the main survey, suppose that there is another survey, called survey B, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaackeaca GGSaaaaa@3A26@  that measures a rough estimate for x i . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadIhada WgaaWcbaGaamyAaaqabaGccaGGUaaaaa@3BD0@  Let y 1 i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadMhada WgaaWcbaGaaGymaiaadMgaaeqaaaaa@3BD0@  be the measurement taken from survey B. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaackeaca GGUaaaaa@3A28@  We may assume that y 1 i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadMhada WgaaWcbaGaaGymaiaadMgaaeqaaaaa@3BD0@  is a rough measurement of x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadIhada WgaaWcbaGaamyAaaqabaaaaa@3B14@  with some level of measurement error. Thus, we may assume

y 1i = β 0 + β 1 x i + e 1i (1.1) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadMhada WgaaWcbaGaaGymaiaadMgaaeqaaOGaeyypa0JaeqOSdi2aaSbaaSqa aiaaicdaaeqaaOGaey4kaSIaeqOSdi2aaSbaaSqaaiaaigdaaeqaaO GaamiEamaaBaaaleaacaWGPbaabeaakiabgUcaRiaadwgadaWgaaWc baGaaGymaiaadMgaaeqaaOGaaGzbVlaaywW7caaMf8UaaGzbVlaayw W7caGGOaGaaGymaiaac6cacaaIXaGaaiykaaaa@53F7@

for some ( β 0 , β 1 ) , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaamaabmaaba GaeqOSdi2aaSbaaSqaaiaaicdaaeqaaOGaaGilaiabek7aInaaBaaa leaacaaIXaaabeaaaOGaayjkaiaawMcaaiaacYcaaaa@410F@  where e 1 i ( 0, σ e 1 2 ) . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadwgada WgaaWcbaGaaGymaiaadMgaaeqaaebbfv3ySLgzGueE0jxyaGqbaOGa e8hpIOZaaeWaaeaacaaIWaGaaGilaiabeo8aZnaaDaaaleaacaWGLb GaaGymaaqaaiaaikdaaaaakiaawIcacaGLPaaacaGGUaaaaa@4985@  Model (1.1) is variable-specific and the linear regression assumption or equal variance assumptions can be relaxed later. If ( β 0 , β 1 )=( 0,1 ), MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaamaabmaaba GaeqOSdi2aaSbaaSqaaiaaicdaaeqaaOGaaGilaiabek7aInaaBaaa leaacaaIXaaabeaaaOGaayjkaiaawMcaaiabg2da9maabmaabaGaaG imaiaaiYcacaaIXaaacaGLOaGaayzkaaGaaiilaaaa@45C9@  then model (1.1) means that there is no measurement bias. Note that model parameters ( β 0 , β 1 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaamaabmaaba GaeqOSdi2aaSbaaSqaaiaaicdaaeqaaOGaaGilaiabek7aInaaBaaa leaacaaIXaaabeaaaOGaayjkaiaawMcaaaaa@405F@  in (1.1) are not area specific, but may be different for groups of areas, as demonstrated in the Korean labor force survey application in Section 5. Separate regression models for different groups may lead to smaller model errors and thus improve the statistical efficiency of the proposed method. From survey B, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaackeaca GGSaaaaa@3A26@  we can obtain another estimator Y ^ 1h = i B h w ib y 1i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiqadMfaga qcamaaBaaaleaacaaIXaGaamiAaaqabaGccqGH9aqpdaaeqaqabSqa aiaadMgacqGHiiIZcaWGcbWaaSbaaeaacaWGObaabeaaaeqaniabgg HiLdGccaWG3bWaaSbaaSqaaiaadMgacaWGIbaabeaakiaadMhadaWg aaWcbaGaaGymaiaadMgaaeqaaaaa@48DE@  of X h , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadIfada WgaaWcbaGaamiAaaqabaGccaGGSaaaaa@3BAD@  where w i b MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadEhada WgaaWcbaGaamyAaiaadkgaaeqaaaaa@3BFA@  is the weight of unit i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadMgaaa a@39EB@  in the sample from survey B MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaackeaaa a@3976@  and B h MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadkeada WgaaWcbaGaamiAaaqabaaaaa@3ADD@  is the B- MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaackeaca qGTaaaaa@3A26@ sample for subpopulation h . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadIgaca GGUaaaaa@3A9C@  Note that Y ^ 1 h MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiqadMfaga qcamaaBaaaleaacaaIXaGaamiAaaqabaaaaa@3BBF@  can be obtained, for each area, if the same areas are identified in both surveys A MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadgeaaa a@3976@  and B. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaackeaca GGUaaaaa@3A28@  Model (1.1) can be used to combine information from the two surveys.

Finally, another source of information can be the Census information. Census information does not suffer from coverage error or sampling error. But, it may have measurement errors and it does not provide updated information for each month or year. Let y 2 i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadMhada WgaaWcbaGaaGOmaiaadMgaaeqaaaaa@3BD1@  be the measurement for unit i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadMgaaa a@39EB@  from the Census. The subpopulation total Y 2h = i C h y 2i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadMfada WgaaWcbaGaaGOmaiaadIgaaeqaaOGaeyypa0ZaaabeaeqaleaacaWG PbGaeyicI4Saam4qamaaBaaabaGaamiAaaqabaaabeqdcqGHris5aO GaamyEamaaBaaaleaacaaIYaGaamyAaaqabaaaaa@45CA@  is available when C h MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadoeada WgaaWcbaGaamiAaaqabaaaaa@3ADE@  is the set of Census C MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadoeaaa a@39C5@  for subpopulation h . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadIgaca GGUaaaaa@3A9C@

Table 1.1 summarizes the major sources of information that we can consider into small area estimation.

Table 1.1
Available information for small area estimation
Table summary
This table displays the results of Available information for small area estimation. The information is grouped by Data (appearing as row headers), Observation, Area level estimate and Properties (appearing as column headers).
Data Observation Area level estimate Properties
Survey A MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadgeaaa a@3B99@ direct obs. ( x i ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaamaabmaaba GaamiEamaaBaaaleaacaWGPbaabeaaaOGaayjkaiaawMcaaaaa@3ECA@ X ^ h , V ^ ( X ^ h ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiqadIfaga qcamaaBaaaleaacaWGObaabeaakiaaiYcaceWGwbGbaKaadaqadaqa aiqadIfagaqcamaaBaaaleaacaWGObaabeaaaOGaayjkaiaawMcaaa aa@426A@ Sampling error (large)
Survey B MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7HqaqFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeGabiGaaiaacaqabeaadaqaaqaaaOqaaiaadkeaaa a@3B9A@ aux. obs. ( y 1 i ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaamaabmaaba GaamyEamaaBaaaleaacaaIXaGaamyAaaqabaaakiaawIcacaGLPaaa aaa@3F86@ Y ^ 1 h , V ^ ( Y ^ 1 h ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiqadMfaga qcamaaBaaaleaacaaIXaGaamiAaaqabaGccaaISaGabmOvayaajaWa aeWaaeaaceWGzbGbaKaadaWgaaWcbaGaaGymaiaadIgaaeqaaaGcca GLOaGaayzkaaaaaa@43E2@ Bias
Measurement error
Sampling error
Census aux. obs. ( y 2 i ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaamaabmaaba GaamyEamaaBaaaleaacaaIYaGaamyAaaqabaaakiaawIcacaGLPaaa aaa@3F87@ Y 2 h MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXdbvk9qq=xd9qqaq=Jf9sr 0=vr0=vrWZqaaeaabiGaaiaacaqabeaadaqaaqaaaOqaaiaadMfada WgaaWcbaGaaGOmaiaadIgaaeqaaaaa@3DD3@ Measurement error
No updated information

In this paper, we consider an area-level model approach for small area estimation combining all available information. The proposed approach is based on the measurement error models, where the sampling errors of the direct estimators are treated as measurement errors, and all the other auxiliary information are combined through a set of linking models. The proposed approach is applied to the small area estimation problem for labor force surveys in Korea, where three estimates are combined to produce small area estimates for unemployment rates.

The paper is organized as follows. In Section 2, the basic setup is introduced and the small area estimation problem is viewed as a measurement error model prediction problem. In Section 3, parameter estimation for the area level small area model is discussed. In Section 4, estimation of mean squared error is briefly discussed. In Section 5, the proposed method is applied to the labor force survey data in Korea. Concluding remarks are made in Section 6.

Previous | Next

Date modified: