2. Measure of influence: Conditional bias

Cyril Favre Martinoz, David Haziza and Jean-François Beaumont

Previous | Next

Consider a finite population of individuals, denoted by U , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGvbGaai ilaaaa@3A28@ of size N . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGobGaai Olaaaa@3A23@ We want to estimate the total for the variable of interest y , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG5bGaai ilaaaa@3A4C@ denoted by t= iU y i . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG0bGaey ypa0ZaaabeaeqaleaacaWGPbGaeyicI4Saamyvaaqab0GaeyyeIuoa kiaadMhadaWgaaWcbaGaamyAaaqabaGccaGGUaaaaa@4267@ From the population we select a sample S , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGtbGaai ilaaaa@3A26@ of (expected) size n , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGUbGaai ilaaaa@3A41@ using the sampling design p ( S ) . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGWbWaae WaaeaacaWGtbaacaGLOaGaayzkaaGaaiOlaaaa@3CA6@ A classical estimator of t MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG0baaaa@3953@ is the expansion estimator, also known as the Horvitz-Thompson estimator, t ^ = iS d i y i , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aacqGH9aqpdaaeqaqaaiaadsgadaWgaaWcbaGaamyAaaqabaGccaWG 5bWaaSbaaSqaaiaadMgaaeqaaaqaaiaadMgacqGHiiIZcaWGtbaabe qdcqGHris5aOGaaiilaaaa@446A@ where d i =1/ π i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGKbWaaS baaSqaaiaadMgaaeqaaOGaeyypa0ZaaSGbaeaacaaIXaaabaGaeqiW da3aaSbaaSqaaiaadMgaaeqaaaaaaaa@3F15@ is the sampling weight of unit i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGPbaaaa@3948@ and π i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacqaHapaCda WgaaWcbaGaamyAaaqabaaaaa@3B31@ denotes its probability of inclusion in the sample. Although the expansion estimator, t ^ , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aacaGGSaaaaa@3A13@ is design-unbiased for t , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG0bGaai ilaaaa@3A47@ it can be highly unstable in the presence of influential values.

To measure the impact (or influence) that a sampled unit has on the expansion estimator, we use the concept of conditional bias of a unit; see Moreno-Rebollo, Muñoz-Reyez and Muñoz-Pichardo (1999), Moreno-Rebollo, Muñoz-Reyez, Jimenez-Gamero and Muñoz-Pichardo (2002) and Beaumont, Haziza and Ruiz-Gazen (2013). Let I i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGjbWaaS baaSqaaiaadMgaaeqaaaaa@3A42@ be the sample selection indicator variable for unit i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGPbaaaa@3948@ such that I i =1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGjbWaaS baaSqaaiaadMgaaeqaaOGaeyypa0JaaGymaaaa@3C0D@ if i S MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGPbGaey icI4Saam4uaaaa@3BA4@ and I i =0, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGjbWaaS baaSqaaiaadMgaaeqaaOGaeyypa0JaaGimaiaacYcaaaa@3CBC@ otherwise. The conditional bias of the estimator t ^ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aaaaa@3963@ associated with a sampled unit is defined as

B 1i HT = E p ( t ^ | I i = )t= jU ( π ij π i π j π i π j ) y j ,(2.1) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGcbWaa0 baaSqaaiaaigdacaWGPbaabaGaaeisaiaabsfaaaGccqGH9aqpcaWG fbWaaSbaaSqaaiaadchaaeqaaOWaaeWaaeaaceWG0bGbaKaadaabbe qaaiaadMeadaWgaaWcbaGaamyAaaqabaGccqGH9aqpaiaawEa7aaGa ayjkaiaawMcaaiabgkHiTiaadshacqGH9aqpdaaeqbqabSqaaiaadQ gacqGHiiIZcaWGvbaabeqdcqGHris5aOWaaeWaaeaadaWcaaqaaiab ec8aWnaaBaaaleaacaWGPbGaamOAaaqabaGccqGHsislcqaHapaCda WgaaWcbaGaamyAaaqabaGccqaHapaCdaWgaaWcbaGaamOAaaqabaaa keaacqaHapaCdaWgaaWcbaGaamyAaaqabaGccqaHapaCdaWgaaWcba GaamOAaaqabaaaaaGccaGLOaGaayzkaaGaamyEamaaBaaaleaacaWG QbaabeaakiaaiYcacaaMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaacI cacaaIYaGaaiOlaiaaigdacaGGPaaaaa@6F20@

where π i j MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacqaHapaCda WgaaWcbaGaamyAaiaadQgaaeqaaaaa@3C20@ is the joint probability of inclusion of units i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGPbaaaa@398C@ and j MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGQbaaaa@398D@ in the sample. In general, the conditional bias (2.1) is unknown, since the values of the variable of interest are observed only for the sampled units. In practice, the conditional bias must be estimated. We consider the conditionally unbiased estimator (for example, see Beaumont et al. 2013):

B ^ 1i HT = jS ( π ij π i π j π j π ij ) y j =( d i 1) y i + jS,ji ( π ij π i π j π j π ij ) y j .(2.2) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaafaqaaeGaca aabaGabmOqayaajaWaa0baaSqaaiaaigdacaWGPbaabaGaaeisaiaa bsfaaaaakeaacqGH9aqpdaaeqbqabSqaaiaadQgacqGHiiIZcaWGtb aabeqdcqGHris5aOWaaeWaaeaadaWcaaqaaiabec8aWnaaBaaaleaa caWGPbGaamOAaaqabaGccqGHsislcqaHapaCdaWgaaWcbaGaamyAaa qabaGccqaHapaCdaWgaaWcbaGaamOAaaqabaaakeaacqaHapaCdaWg aaWcbaGaamOAaaqabaGccqaHapaCdaWgaaWcbaGaamyAaiaadQgaae qaaaaaaOGaayjkaiaawMcaaiaadMhadaWgaaWcbaGaamOAaaqabaaa keaaaeaacqGH9aqpcaGGOaGaamizamaaBaaaleaacaWGPbaabeaaki abgkHiTiaaigdacaGGPaGaamyEamaaBaaaleaacaWGPbaabeaakiab gUcaRmaaqafabeWcbaGaamOAaiabgIGiolaadofacaaISaGaamOAai abgcMi5kaadMgaaeqaniabggHiLdGcdaqadaqaamaalaaabaGaeqiW da3aaSbaaSqaaiaadMgacaWGQbaabeaakiabgkHiTiabec8aWnaaBa aaleaacaWGPbaabeaakiabec8aWnaaBaaaleaacaWGQbaabeaaaOqa aiabec8aWnaaBaaaleaacaWGQbaabeaakiabec8aWnaaBaaaleaaca WGPbGaamOAaaqabaaaaaGccaGLOaGaayzkaaGaamyEamaaBaaaleaa caWGQbaabeaakiaai6cacaaMf8UaaGzbVlaaywW7caaMf8UaaGzbVl aacIcacaaIYaGaaiOlaiaaikdacaGGPaaaaaaa@8C13@

This estimator is conditionally unbiased in the sense that E p ( B ^ 1i HT | I i =1 )= B 1i HT . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGfbWaaS baaSqaaiaadchaaeqaaOWaaeWaaeaaceWGcbGbaKaadaqhaaWcbaGa aGymaiaadMgaaeaacaqGibGaaeivaaaakmaaeeaabaGaamysamaaBa aaleaacaWGPbaabeaakiabg2da9iaaigdaaiaawEa7aaGaayjkaiaa wMcaaiabg2da9iaadkeadaqhaaWcbaGaaGymaiaadMgaaeaacaqGib Gaaeivaaaakiaac6caaaa@4B79@ We make the following remarks on the conditional bias and its estimator: (i) The conditional bias (2.1) and its estimator (2.2) depend on the inclusion probabilities π i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacqaHapaCda WgaaWcbaGaamyAaaqabaaaaa@3B31@ and the joint inclusion probabilities π i j . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacqaHapaCda WgaaWcbaGaamyAaiaadQgaaeqaaOGaaiOlaaaa@3CDC@ In other words, the conditional bias is a measure that takes the sampling design into account. (ii) If π i =1, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacqaHapaCda WgaaWcbaGaamyAaaqabaGccqGH9aqpcaaIXaGaaiilaaaa@3DAC@ then B 1i HT =0 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGcbWaa0 baaSqaaiaaigdacaWGPbaabaGaaeisaiaabsfaaaGccqGH9aqpcaaI Waaaaa@3E63@ and, similarly, B ^ 1i HT =0. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWGcbGbaK aadaqhaaWcbaGaaGymaiaadMgaaeaacaqGibGaaeivaaaakiabg2da 9iaaicdacaGGUaaaaa@3F25@ That is, when π i =1, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacqaHapaCda WgaaWcbaGaamyAaaqabaGccqGH9aqpcaaIXaGaaiilaaaa@3DAC@ unit i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGPbaaaa@3948@ is selected in all possible samples, and consequently E p ( t ^ | I i =1 )t= E p ( t ^ )t=0, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGfbWaaS baaSqaaiaadchaaeqaaOWaaeWaaeaaceWG0bGbaKaadaabbaqaaiaa dMeadaWgaaWcbaGaamyAaaqabaGccqGH9aqpcaaIXaaacaGLhWoaai aawIcacaGLPaaacqGHsislcaWG0bGaeyypa0JaamyramaaBaaaleaa caWGWbaabeaakmaabmaabaGabmiDayaajaaacaGLOaGaayzkaaGaey OeI0IaamiDaiabg2da9iaaicdacaGGSaaaaa@4DF1@ since t ^ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aaaaa@3963@ is a design-unbiased estimator of t . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG0bGaai Olaaaa@3A49@ A unit selected systematically in the sample therefore has no influence and does not contribute to the variance of t ^ . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aacaGGUaaaaa@3A15@ (iii) The estimated conditional bias (2.2) depends on the second-order inclusion probabilities, π i j . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacqaHapaCda WgaaWcbaGaamyAaiaadQgaaeqaaOGaaGOlaaaa@3CE2@ For some designs, these probabilities may be difficult to calculate, in which case approximations will be used. For sampling designs that belong to the class of high-entropy designs (e.g., Berger 1998), a number of approximations of the second-order inclusion probabilities have been proposed in the literature; for example, see Haziza, Mecatti and Rao (2008). An alternative solution is to calculate approximations of the π i j MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacqaHapaCda WgaaWcbaGaamyAaiaadQgaaeqaaaaa@3C20@ using Monte Carlo methods; see Fattorini (2006) and Thompson and Wu (2008).

For a stratified simple random sampling design, the conditional bias (2.1) associated with sampled unit i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGPbaaaa@398C@ in stratum h MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGObaaaa@398B@ is given by

B 1i HT = N h N h 1 ( N h n h 1 )( y i y ¯ Uh ),(2.3) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGcbWaa0 baaSqaaiaaigdacaWGPbaabaGaaeisaiaabsfaaaGccqGH9aqpdaWc aaqaaiaad6eadaWgaaWcbaGaamiAaaqabaaakeaacaWGobWaaSbaaS qaaiaadIgaaeqaaOGaeyOeI0IaaGymaaaadaqadaqaamaalaaabaGa amOtamaaBaaaleaacaWGObaabeaaaOqaaiaad6gadaWgaaWcbaGaam iAaaqabaaaaOGaeyOeI0IaaGymaaGaayjkaiaawMcaamaabmaabaGa amyEamaaBaaaleaacaWGPbaabeaakiabgkHiTiqadMhagaqeamaaBa aaleaacaWGvbGaamiAaaqabaaakiaawIcacaGLPaaacaaISaGaaGzb VlaaywW7caaMf8UaaGzbVlaaywW7caGGOaGaaGOmaiaac6cacaaIZa Gaaiykaaaa@5E44@

where n h MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGUbWaaS baaSqaaiaadIgaaeqaaaaa@3A66@ denotes the size of the sample selected in stratum h, y ¯ Uh = N h 1 i U h y i , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGObGaaG ilaiqadMhagaqeamaaBaaaleaacaWGvbGaamiAaaqabaGccqGH9aqp caWGobWaa0baaSqaaiaadIgaaeaacqGHsislcaaIXaaaaOWaaabeae qaleaacaWGPbGaeyicI4SaamyvamaaBaaabaGaamiAaaqabaaabeqd cqGHris5aOGaamyEamaaBaaaleaacaWGPbaabeaakiaaiYcaaaa@4AD5@ and U h MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGvbWaaS baaSqaaiaadIgaaeqaaaaa@3A4D@ denotes the population of units in stratum h MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGObaaaa@3947@ of size N h ,h=1,,H. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGobWaaS baaSqaaiaadIgaaeqaaOGaaGilaiaadIgacqGH9aqpcaaIXaGaaiil aiablAciljaaiYcacaWGibGaaGOlaaaa@41C1@ The estimator of the conditional bias (2.2) reduces to

B ^ 1i HT = n h n h 1 ( N h n h 1 )( y i y ¯ Sh ), MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWGcbGbaK aadaqhaaWcbaGaaGymaiaadMgaaeaacaqGibGaaeivaaaakiabg2da 9maalaaabaGaamOBamaaBaaaleaacaWGObaabeaaaOqaaiaad6gada WgaaWcbaGaamiAaaqabaGccqGHsislcaaIXaaaamaabmaabaWaaSaa aeaacaWGobWaaSbaaSqaaiaadIgaaeqaaaGcbaGaamOBamaaBaaale aacaWGObaabeaaaaGccqGHsislcaaIXaaacaGLOaGaayzkaaWaaeWa aeaacaWG5bWaaSbaaSqaaiaadMgaaeqaaOGaeyOeI0IabmyEayaara WaaSbaaSqaaiaadofacaWGObaabeaaaOGaayjkaiaawMcaaiaaiYca aaa@5348@

where y ¯ Sh = n h 1 i S h y i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG5bGbae badaWgaaWcbaGaam4uaiaadIgaaeqaaOGaeyypa0JaamOBamaaDaaa leaacaWGObaabaGaeyOeI0IaaGymaaaakmaaqababeWcbaGaamyAai abgIGiolaadofadaWgaaqaaiaadIgaaeqaaaqab0GaeyyeIuoakiaa dMhadaWgaaWcbaGaamyAaaqabaaaaa@488E@ and S h MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGtbWaaS baaSqaaiaadIgaaeqaaaaa@3A4B@ is the sample in stratum h . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGObGaai Olaaaa@3A3D@

For a Poisson design, the conditional bias of sampled unit i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGPbaaaa@398C@ is given by

B i HT ( I i =1 )=( d i 1 ) y i .(2.4) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGcbWaa0 baaSqaaiaadMgaaeaacaqGibGaaeivaaaakmaabmaabaGaamysamaa BaaaleaacaWGPbaabeaakiabg2da9iaaigdaaiaawIcacaGLPaaacq GH9aqpdaqadaqaaiaadsgadaWgaaWcbaGaamyAaaqabaGccqGHsisl caaIXaaacaGLOaGaayzkaaGaamyEamaaBaaaleaacaWGPbaabeaaki aai6cacaaMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaacIcacaaIYaGa aiOlaiaaisdacaGGPaaaaa@558C@

In contrast to the simple random sampling without-replacement design, the conditional bias (2.4) is known for all units in the sample, since it does not depend on finite population parameters.

Previous | Next

Date modified: