A method to find an efficient and robust sampling strategy under model uncertainty
Section 2. Optimal strategy under the superpopulation model

Let U MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyvaaaa@35D6@ be a finite population of size N MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOtaaaa@35CF@ with elements labeled { 1, 2, , k , , N } . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaiWabeaaca aIXaGaaGilaiaaysW7caaIYaGaaGilaiaaysW7cqWIMaYscaGGSaGa aGjbVlaadUgacaaISaGaaGjbVlablAciljaacYcacaaMe8UaamOtaa Gaay5Eaiaaw2haaiaac6caaaa@48A1@ Let x k = ( x 1 k , x 2 k , , x J k ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiEamaaBa aaleaacaWGRbaabeaakiaaysW7caaI9aGaaGjbVpaabmqabaGaamiE amaaBaaaleaacaaIXaGaam4AaaqabaGccaaISaGaaGjbVlaadIhada WgaaWcbaGaaGOmaiaadUgaaeqaaOGaaGilaiaaysW7cqWIMaYscaaI SaGaaGjbVlaadIhadaWgaaWcbaGaamOsaiaadUgaaeqaaaGccaGLOa Gaayzkaaaaaa@4D24@ be a known vector of values of J MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOsaaaa@35CB@ auxiliary variables and y k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyEamaaBa aaleaacaWGRbaabeaaaaa@3716@ the unknown value of a study variable associated to unit k U . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Aaiaays W7cqGHiiIZcaaMe8Uaamyvaiaac6caaaa@3C16@ We are interested in the estimation of the total of y , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyEaiaacY caaaa@36AA@ t y = U y k . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiDamaaBa aaleaacaWG5baabeaakiaaysW7caaI9aGaaGjbVpaaqababaGaamyE amaaBaaaleaacaWGRbaabeaaaeaacaWGvbaabeqdcqGHris5aOGaai Olaaaa@4092@

Let Ω MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeuyQdCfaaa@368A@ be the power set of U . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyvaiaac6 caaaa@3688@ A sample is any subset s Ω MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Caiaays W7cqGHiiIZcaaMe8UaeuyQdCfaaa@3C20@ and a sampling design is a probability distribution on Ω , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeuyQdCLaai ilaaaa@373A@ denoted by P ( S = s ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiuamaabm qabaGaam4uaiaaysW7caaI9aGaaGjbVlaadohaaiaawIcacaGLPaaa aaa@3D0C@ or simply p ( s ) . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiCamaabm qabaGaaGjcVlaadohacaaMi8oacaGLOaGaayzkaaGaaiOlaaaa@3C47@ Let π k = s k p ( s ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiWda3aaS baaSqaaiaadUgaaeqaaOGaaGjbVlaai2dacaaMe8+aaabeaeaacaWG WbWaaeWabeaacaaMi8Uaam4CaiaayIW7aiaawIcacaGLPaaaaSqaai aadohacaaMc8+efv3ySLgznfgDOjdaryqr1ngBPrginfgDObcv39ga iuaacqWFlis5caaMc8Uaam4Aaaqab0GaeyyeIuoaaaa@5474@ be the inclusion probability of k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Aaaaa@35EC@ and π k l = s { k , l } p ( s ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiWda3aaS baaSqaaiaadUgacaWGSbaabeaakiaaysW7caaI9aGaaGjbVpaaqaba baGaamiCamaabmqabaGaaGjcVlaadohacaaMi8oacaGLOaGaayzkaa aaleaacaWGZbGaaGPaVlabgoOijlaaykW7daGadeqaaiaayIW7caWG RbGaaGilaiaaykW7caWGSbGaaGjcVdGaay5Eaiaaw2haaaqab0Gaey yeIuoaaaa@54AB@ the joint inclusion probability of k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Aaaaa@35EC@ and l . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiBaiaac6 caaaa@369F@ A probability sampling design is a sampling design such that π k > 0 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiWda3aaS baaSqaaiaadUgaaeqaaOGaaGjbVlaai6dacaaMe8UaaGimaaaa@3C7B@ for all k U . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Aaiaays W7cqGHiiIZcaaMe8Uaamyvaiaac6caaaa@3C16@

An estimator is a real valued function of the sample, t ^ y = t ^ y ( S ) . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmiDayaaja WaaSbaaSqaaiaadMhaaeqaaOGaaGjbVlaai2dacaaMe8UabmiDayaa jaWaaSbaaSqaaiaadMhaaeqaaOWaaeWabeaacaaMi8Uaam4uaiaayI W7aiaawIcacaGLPaaacaGGUaaaaa@438D@ By strategy we refer to the couple sampling design and estimator, ( p ( ) , t ^ y ) . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaeWabeaaca WGWbWaaeWabeaacaaMi8UaeyyXICTaaGjcVdGaayjkaiaawMcaaiaa iYcacaaMe8UabmiDayaajaWaaSbaaSqaaiaadMhaaeqaaaGccaGLOa GaayzkaaGaaiOlaaaa@43A3@

We consider only probability sampling designs with fixed sample size. As a convenient stepping stone we begin by considering unbiased linear estimators of the form

t ^ y = ( U z k s z k π k ) + s y k π k = U z k + s e k π k ( 2.1 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmiDayaaja WaaSbaaSqaaiaadMhaaeqaaOGaaGjbVlaaykW7caaI9aGaaGjbVlaa ykW7daqadaqaamaaqafabaGaamOEamaaBaaaleaacaWGRbaabeaaae aacaWGvbaabeqdcqGHris5aOGaaGjbVlabgkHiTiaaysW7daaeqbqa amaalaaabaGaamOEamaaBaaaleaacaWGRbaabeaaaOqaaiabec8aWn aaBaaaleaacaWGRbaabeaaaaaabaGaam4Caaqab0GaeyyeIuoaaOGa ayjkaiaawMcaaiaaysW7cqGHRaWkcaaMe8+aaabuaeaadaWcaaqaai aadMhadaWgaaWcbaGaam4AaaqabaaakeaacqaHapaCdaWgaaWcbaGa am4AaaqabaaaaaqaaiaadohaaeqaniabggHiLdGccaaMe8UaaGypai aaysW7daaeqbqaaiaadQhadaWgaaWcbaGaam4AaaqabaaabaGaamyv aaqab0GaeyyeIuoakiaaysW7cqGHRaWkcaaMe8+aaabuaeaadaWcaa qaaiaadwgadaWgaaWcbaGaam4AaaqabaaakeaacqaHapaCdaWgaaWc baGaam4AaaqabaaaaaqaaiaadohaaeqaniabggHiLdGccaaMf8UaaG zbVlaaywW7caaMf8UaaGzbVlaacIcacaaIYaGaaiOlaiaaigdacaGG Paaaaa@7D70@

with z k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOEamaaBa aaleaacaWGRbaabeaaaaa@3717@ arbitrary known constants and e k = y k z k . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyzamaaBa aaleaacaWGRbaabeaakiaaysW7caaI9aGaaGjbVlaadMhadaWgaaWc baGaam4AaaqabaGccaaMe8UaeyOeI0IaaGjbVlaadQhadaWgaaWcba Gaam4AaaqabaGccaGGUaaaaa@43EF@ This estimator is called the difference estimator. The estimator defined in this way is said to be calibrated on z MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOEaaaa@35FB@ as it satisfies t ^ z = U z k . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmiDayaaja WaaSbaaSqaaiaadQhaaeqaaOGaaGjbVlaai2dacaaMe8+aaabeaeaa caWG6bWaaSbaaSqaaiaadUgaaeqaaaqaaiaadwfaaeqaniabggHiLd GccaGGUaaaaa@40A4@ Note that if z k = 0 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOEamaaBa aaleaacaWGRbaabeaakiaaysW7caaI9aGaaGjbVlaaicdaaaa@3BBC@ for all k U MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Aaiaays W7cqGHiiIZcaaMe8Uaamyvaaaa@3B64@ the estimator reduces to t ^ y = s y k / π k , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmiDayaaja WaaSbaaSqaaiaadMhaaeqaaOGaaGjbVlaai2dacaaMe8+aaSGbaeaa daaeqaqaaiaadMhadaWgaaWcbaGaam4AaaqabaaabaGaam4Caaqab0 GaeyyeIuoaaOqaaiabec8aWnaaBaaaleaacaWGRbaabeaaaaGccaGG Saaaaa@43B7@ that is, the Horvitz-Thompson estimator (Horvitz and Thompson, 1952). In later sections we focus on the generalized regression estimator (GREG).

The design Mean Squared Error (MSE) of the difference estimator is

MSE p ( t ^ y ) = MSE p ( s e k π k ) = U U ( π k l π k π l ) e k π k e l π l . ( 2.2 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaeytaiaabo facaqGfbWaaSbaaSqaaiaabchaaeqaaOWaaeWabeaaceWG0bGbaKaa daWgaaWcbaGaamyEaaqabaaakiaawIcacaGLPaaacaaMe8UaaGPaVl aai2dacaaMe8UaaGPaVlaab2eacaqGtbGaaeyramaaBaaaleaacaqG WbaabeaakmaabmaabaWaaabuaeaadaWcaaqaaiaadwgadaWgaaWcba Gaam4AaaqabaaakeaacqaHapaCdaWgaaWcbaGaam4Aaaqabaaaaaqa aiaadohaaeqaniabggHiLdaakiaawIcacaGLPaaacaaMe8UaaGPaVl aai2dacaaMe8UaaGPaVpaaqafabaWaaabuaeaadaqadeqaaiabec8a WnaaBaaaleaacaWGRbGaamiBaaqabaGccaaMe8UaeyOeI0IaaGjbVl abec8aWnaaBaaaleaacaWGRbaabeaakiabec8aWnaaBaaaleaacaWG SbaabeaaaOGaayjkaiaawMcaaiaaysW7daWcaaqaaiaadwgadaWgaa WcbaGaam4AaaqabaaakeaacqaHapaCdaWgaaWcbaGaam4Aaaqabaaa aOGaaGjbVpaalaaabaGaamyzamaaBaaaleaacaWGSbaabeaaaOqaai abec8aWnaaBaaaleaacaWGSbaabeaaaaaabaGaamyvaaqab0Gaeyye IuoaaSqaaiaadwfaaeqaniabggHiLdGccaaIUaGaaGzbVlaaywW7ca aMf8UaaGzbVlaaywW7caGGOaGaaGOmaiaac6cacaaIYaGaaiykaaaa @85B8@

As mentioned in the introduction, due to the non-existence of an optimal strategy under the design-based approach, often a superpopulation model, ξ 0 , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqOVdG3aaS baaSqaaiaaicdaaeqaaOGaaiilaaaa@385F@ is proposed and we search for an optimal strategy with respect to the anticipated mean-squared error,

MSE ξ 0 p ( t ^ y ) = E ξ 0 MSE p ( t ^ y ) = E ξ 0 E p ( ( t ^ y t y ) 2 ) . ( 2.3 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaeytaiaabo facaqGfbWaaSbaaSqaaiabe67a4naaBaaameaacaaIWaaabeaaliaa bchaaeqaaOWaaeWabeaaceWG0bGbaKaadaWgaaWcbaGaamyEaaqaba aakiaawIcacaGLPaaacaaMe8UaaGPaVlaai2dacaaMe8UaaGPaVlaa bweadaWgaaWcbaGaeqOVdG3aaSbaaWqaaiaaicdaaeqaaaWcbeaaki aab2eacaqGtbGaaeyramaaBaaaleaacaqGWbaabeaakmaabmqabaGa bmiDayaajaWaaSbaaSqaaiaadMhaaeqaaaGccaGLOaGaayzkaaGaaG jbVlaai2dacaaMe8UaaeyramaaBaaaleaacqaH+oaEdaWgaaadbaGa aGimaaqabaaaleqaaOGaaeyramaaBaaaleaacaWGWbaabeaakmaabm aabaWaaeWabeaaceWG0bGbaKaadaWgaaWcbaGaamyEaaqabaGccaaM e8UaeyOeI0IaaGjbVlaadshadaWgaaWcbaGaamyEaaqabaaakiaawI cacaGLPaaadaahaaWcbeqaaiaaikdaaaaakiaawIcacaGLPaaacaGG UaGaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7caGGOaGaaGOmaiaac6 cacaaIZaGaaiykaaaa@7310@

We may assume that the y MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyEaaaa@35F9@ -values are realizations of the following model, denoted ξ 0 , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqOVdG3aaS baaSqaaiaaicdaaeqaaOGaaiilaaaa@385F@

Y k = f ( x k | δ 1 ) + ε k MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamywamaaBa aaleaacaWGRbaabeaakiaaysW7caaMc8UaaGypaiaaysW7caaMc8Ua amOzamaabmqabaWaaqGabeaacaWG4bWaaSbaaSqaaiaadUgaaeqaaO GaaGPaVdGaayjcSdGaaGPaVlabes7aKnaaBaaaleaacaaIXaaabeaa aOGaayjkaiaawMcaaiaaysW7cqGHRaWkcaaMe8UaeqyTdu2aaSbaaS qaaiaadUgaaeqaaaaa@5090@

with

E ξ 0 ( ε k ) = 0, V ξ 0 ( ε k ) = σ 0 2 g ( x k | δ 2 ) 2 and E ξ 0 ( ε k ε l ) = 0 k l ( 2.4 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaeyramaaBa aaleaacqaH+oaEdaWgaaadbaGaaGimaaqabaaaleqaaOWaaeWabeaa cqaH1oqzdaWgaaWcbaGaam4AaaqabaaakiaawIcacaGLPaaacaaMe8 UaaGypaiaaysW7caaIWaGaaGilaiaaywW7caGIwbWaaSbaaSqaaiab e67a4naaBaaameaacaaIWaaabeaaaSqabaGcdaqadeqaaiabew7aLn aaBaaaleaacaWGRbaabeaaaOGaayjkaiaawMcaaiaaysW7caaI9aGa aGjbVlabeo8aZnaaDaaaleaacaaIWaaabaGaaGOmaaaakiaadEgada qadeqaamaaeiqabaGaamiEamaaBaaaleaacaWGRbaabeaakiaaykW7 aiaawIa7aiaaykW7cqaH0oazdaWgaaWcbaGaaGOmaaqabaaakiaawI cacaGLPaaadaahaaWcbeqaaiaaikdaaaGccaaMf8Uaaeyyaiaab6ga caqGKbGaaGzbVlaabweadaWgaaWcbaGaeqOVdG3aaSbaaWqaaiaaic daaeqaaaWcbeaakmaabmqabaGaeqyTdu2aaSbaaSqaaiaadUgaaeqa aOGaeqyTdu2aaSbaaSqaaiaadYgaaeqaaaGccaGLOaGaayzkaaGaaG jbVlaai2dacaaMe8UaaGimaiaaysW7caaMi8UaeyiaIiIaaGjbVlaa dUgacaaMe8UaeyiyIKRaaGjbVlaadYgacaaMf8UaaGzbVlaaywW7ca aMf8UaaGzbVlaacIcacaaIYaGaaiOlaiaaisdacaGGPaaaaa@8CFC@

where δ = ( δ 1 , δ 2 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiTdqMaaG jbVlaai2dacaaMe8+aaeWabeaacqaH0oazdaWgaaWcbaGaaGymaaqa baGccaaISaGaaGjbVlabes7aKnaaBaaaleaacaaIYaaabeaaaOGaay jkaiaawMcaaaaa@437C@ is a vector of parameters, f : R J R MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOzaiaayI W7caaI6aGaaGjbVpXvP5wqonvsaeHbmv3yPrwyGmuySXwANjxyWHwE aGqbciab=jfasnaaCaaaleqabaGaamOsaaaakiaaysW7caaMc8Uaey OKH4QaaGjbVlaaykW7cqWFsbGuaaa@4EB2@ and g : R J R + . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4zaiaayI W7caaI6aGaaGjbVpXvP5wqonvsaeHbmv3yPrwyGmuySXwANjxyWHwE aGqbciab=jfasnaaCaaaleqabaGaamOsaaaakiaaysW7caaMc8Uaey OKH4QaaGjbVlaaykW7cqWFsbGudaahaaWcbeqaaiabgUcaRaaakiaa c6caaaa@507E@ The random sample s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Caaaa@35F4@ and the errors ε k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqyTdu2aaS baaSqaaiaadUgaaeqaaaaa@37BF@ are assumed to be independent. Following Rosén (2000), the terms f ( x k | δ 1 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOzamaabm qabaWaaqGabeaacaWG4bWaaSbaaSqaaiaadUgaaeqaaOGaaGPaVdGa ayjcSdGaaGPaVlabes7aKnaaBaaaleaacaaIXaaabeaaaOGaayjkai aawMcaaaaa@40D7@ and g ( x k | δ 2 ) > 0 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4zamaabm qabaWaaqGabeaacaWG4bWaaSbaaSqaaiaadUgaaeqaaOGaaGPaVdGa ayjcSdGaaGPaVlabes7aKnaaBaaaleaacaaIYaaabeaaaOGaayjkai aawMcaaiaaysW7caaMc8UaaGOpaiaaysW7caaMc8UaaGimaaaa@488B@ will be called trend and spread, respectively. The term trend should not in general be understood in a temporal sense, rather it refers to the development of y MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyEaaaa@35FA@ -values with x . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiEaiaac6 caaaa@36AB@

Note that under ξ 0 , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqOVdG3aaS baaSqaaiaaicdaaeqaaOGaaiilaaaa@385F@ e k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyzamaaBa aaleaacaWGRbaabeaaaaa@3702@ in the difference estimator (2.1) is a random variable that represents the distance between the value of the study variable and z k , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOEamaaBa aaleaacaWGRbaabeaakiaacYcaaaa@37D1@ i.e., e k = f ( x k | δ 1 ) + ε k z k . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyzamaaBa aaleaacaWGRbaabeaakiaaysW7caaI9aGaaGjbVlaadAgadaqadeqa amaaeiqabaGaamiEamaaBaaaleaacaWGRbaabeaakiaaykW7aiaawI a7aiaaykW7cqaH0oazdaWgaaWcbaGaaGymaaqabaaakiaawIcacaGL PaaacaaMe8Uaey4kaSIaaGjbVlabew7aLnaaBaaaleaacaWGRbaabe aakiaaysW7cqGHsislcaaMe8UaamOEamaaBaaaleaacaWGRbaabeaa kiaac6caaaa@546F@ Therefore E ξ 0 e k = f ( x k | δ 1 ) z k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaeyramaaBa aaleaacqaH+oaEdaWgaaadbaGaaGimaaqabaaaleqaaOGaamyzamaa BaaaleaacaWGRbaabeaakiaaysW7caaI9aGaaGjbVlaadAgadaqade qaamaaeiqabaGaamiEamaaBaaaleaacaWGRbaabeaakiaaykW7aiaa wIa7aiaaykW7cqaH0oazdaWgaaWcbaGaaGymaaqabaaakiaawIcaca GLPaaacaaMe8UaeyOeI0IaaGjbVlaadQhadaWgaaWcbaGaam4Aaaqa baaaaa@509D@ and E ξ 0 e k 2 = ( f ( x k | δ 1 ) z k ) 2 + σ 0 2 g ( x k | δ 2 ) 2 . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaeyramaaBa aaleaacqaH+oaEdaWgaaadbaGaaGimaaqabaaaleqaaOGaamyzamaa DaaaleaacaWGRbaabaGaaGOmaaaakiaaysW7caaI9aGaaGjbVpaabm qabaGaamOzamaabmqabaWaaqGabeaacaWG4bWaaSbaaSqaaiaadUga aeqaaOGaaGPaVdGaayjcSdGaaGPaVlabes7aKnaaBaaaleaacaaIXa aabeaaaOGaayjkaiaawMcaaiaaysW7cqGHsislcaaMe8UaamOEamaa BaaaleaacaWGRbaabeaaaOGaayjkaiaawMcaamaaCaaaleqabaGaaG OmaaaakiabgUcaRiabeo8aZnaaDaaaleaacaaIWaaabaGaaGOmaaaa kiaadEgadaqadeqaamaaeiqabaGaamiEamaaBaaaleaacaWGRbaabe aakiaaykW7aiaawIa7aiaaykW7cqaH0oazdaWgaaWcbaGaaGOmaaqa baaakiaawIcacaGLPaaadaahaaWcbeqaaiaaikdaaaGccaGGUaaaaa@65B5@ With some algebra, it can be seen from (2.2) and (2.3) that the anticipated MSE of the difference estimator becomes

MSE ξ 0 p ( t ^ y ) = MSE p ( s f ( x k | δ 1 ) z k π k ) + σ 0 2 U ( 1 π k 1 ) g ( x k | δ 2 ) 2 ( 2.5 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaeytaiaabo facaqGfbWaaSbaaSqaaiabe67a4naaBaaameaacaaIWaaabeaaliaa bchaaeqaaOWaaeWabeaaceWG0bGbaKaadaWgaaWcbaGaamyEaaqaba aakiaawIcacaGLPaaacaaMe8UaaGPaVlaai2dacaaMe8UaaGPaVlaa b2eacaqGtbGaaeyramaaBaaaleaacaqGWbaabeaakmaabmaabaWaaa buaeaadaWcaaqaaiaadAgadaqadeqaamaaeiqabaGaamiEamaaBaaa leaacaWGRbaabeaakiaaykW7aiaawIa7aiaaykW7cqaH0oazdaWgaa WcbaGaaGymaaqabaaakiaawIcacaGLPaaacaaMe8UaeyOeI0IaaGjb VlaadQhadaWgaaWcbaGaam4AaaqabaaakeaacqaHapaCdaWgaaWcba Gaam4AaaqabaaaaaqaaiaadohaaeqaniabggHiLdaakiaawIcacaGL PaaacaaMe8Uaey4kaSIaaGjbVlabeo8aZnaaDaaaleaacaaIWaaaba GaaGOmaaaakmaaqafabaWaaeWaaeaadaWcaaqaaiaaigdaaeaacqaH apaCdaWgaaWcbaGaam4AaaqabaaaaOGaaGjbVlabgkHiTiaaysW7ca aIXaaacaGLOaGaayzkaaGaam4zamaabmqabaWaaqGabeaacaWG4bWa aSbaaSqaaiaadUgaaeqaaOGaaGPaVdGaayjcSdGaaGPaVlabes7aKn aaBaaaleaacaaIYaaabeaaaOGaayjkaiaawMcaamaaCaaaleqabaGa aGOmaaaaaeaacaWGvbaabeqdcqGHris5aOGaaGzbVlaaywW7caaMf8 UaaGzbVlaacIcacaaIYaGaaiOlaiaaiwdacaGGPaaaaa@8E14@

Nedyalkova and Tillé (2008) derive the anticipated MSE in a more general case.

Tillé and Wilhelm (2017) give the anticipated MSE of the Horvitz-Thompson estimator. The second term in (2.5) is the Godambe-Joshi lower bound (e.g., Särndal, Swensson and Wretman, 1992, page 453).

The anticipated MSE in (2.5) is the sum of two positive terms. It is easy to see that if

  1. the estimator is calibrated on z k = f ( x k | δ 1 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOEamaaBa aaleaacaWGRbaabeaakiaaysW7caaI9aGaaGjbVlaadAgadaqadeqa amaaeiqabaGaamiEamaaBaaaleaacaWGRbaabeaakiaaykW7aiaawI a7aiaaykW7cqaH0oazdaWgaaWcbaGaaGymaaqabaaakiaawIcacaGL Paaaaaa@46DD@ the first term vanishes and the anticipated MSE equals the Godambe-Joshi lower bound

MSE ξ 0 p ( t ^ y ) = σ 0 2 U ( 1 π k 1 ) g ( x k | δ 2 ) 2 . ( 2.6 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaeytaiaabo facaqGfbWaaSbaaSqaaiabe67a4naaBaaameaacaaIWaaabeaaliaa bchaaeqaaOWaaeWabeaaceWG0bGbaKaadaWgaaWcbaGaamyEaaqaba aakiaawIcacaGLPaaacaaMe8UaaGPaVlaai2dacaaMc8UaaGjbVlab eo8aZnaaDaaaleaacaaIWaaabaGaaGOmaaaakmaaqafabaWaaeWaae aadaWcaaqaaiaaigdaaeaacqaHapaCdaWgaaWcbaGaam4Aaaqabaaa aOGaaGjbVlabgkHiTiaaysW7caaIXaaacaGLOaGaayzkaaGaam4zam aabmqabaWaaqGabeaacaWG4bWaaSbaaSqaaiaadUgaaeqaaOGaaGPa VdGaayjcSdGaaGPaVlabes7aKnaaBaaaleaacaaIYaaabeaaaOGaay jkaiaawMcaamaaCaaaleqabaGaaGOmaaaaaeaacaWGvbaabeqdcqGH ris5aOGaaGOlaiaaywW7caaMf8UaaGzbVlaaywW7caaMf8Uaaiikai aaikdacaGGUaGaaGOnaiaacMcaaaa@6F35@

  1. Furthermore, after imposing the fixed sample size restriction U π k = n , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaabeaeaacq aHapaCdaWgaaWcbaGaam4AaaqabaGccaaMe8UaaGypaiaaysW7caWG UbaaleaacaWGvbaabeqdcqGHris5aOGaaiilaaaa@402A@ if
  2. the design is such that π k g ( x k | δ 2 ) , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiWda3aaS baaSqaaiaadUgaaeqaaOGaaGjbVlabg2Hi1kaaysW7caWGNbWaaeWa beaadaabceqaaiaadIhadaWgaaWcbaGaam4AaaqabaGccaaMc8oaca GLiWoacaaMc8UaeqiTdq2aaSbaaSqaaiaaikdaaeqaaaGccaGLOaGa ayzkaaGaaiilaaaa@4906@ denoted π ps ( δ 2 ) , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiWdaNaae iCaiaabohadaqadeqaaiabes7aKnaaBaaaleaacaaIYaaabeaaaOGa ayjkaiaawMcaaiaacYcaaaa@3D73@ the second term is minimized and we obtain

MSE ξ 0 p opt ( t ^ y ) = σ 0 2 ( 1 n ( U g ( x k | δ 2 ) ) 2 U g ( x k | δ 2 ) 2 ) . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaeytaiaabo facaqGfbWaa0baaSqaaiabe67a4naaBaaameaacaaIWaaabeaaliaa bchaaeaacaqGVbGaaeiCaiaabshaaaGcdaqadeqaaiqadshagaqcam aaBaaaleaacaWG5baabeaaaOGaayjkaiaawMcaaiaaykW7caaMe8Ua aGypaiaaykW7caaMe8Uaeq4Wdm3aa0baaSqaaiaaicdaaeaacaaIYa aaaOWaaeWaaeaadaWcaaqaaiaaigdaaeaacaWGUbaaamaabmaabaWa aabuaeaacaWGNbWaaeWabeaadaabceqaaiaadIhadaWgaaWcbaGaam 4AaaqabaGccaaMc8oacaGLiWoacaaMc8UaeqiTdq2aaSbaaSqaaiaa ikdaaeqaaaGccaGLOaGaayzkaaaaleaacaWGvbaabeqdcqGHris5aa GccaGLOaGaayzkaaWaaWbaaSqabeaacaaIYaaaaOGaaGjbVlabgkHi TiaaysW7daaeqbqaaiaadEgadaqadeqaamaaeiqabaGaamiEamaaBa aaleaacaWGRbaabeaakiaaykW7aiaawIa7aiaaykW7cqaH0oazdaWg aaWcbaGaaGOmaaqabaaakiaawIcacaGLPaaadaahaaWcbeqaaiaaik daaaaabaGaamyvaaqab0GaeyyeIuoaaOGaayjkaiaawMcaaiaai6ca aaa@757A@

Conditions 1 and 2 suggest the specific roles of the design and the estimator in the sampling strategy. The estimator should “explain” the trend in the calibration sense of condition 1. The design should “explain” the spread. A strategy that satisfies conditions 1 and 2 simultaneously will be called optimal. In the same sense, any estimator and any design satisfying, respectively, condition 1 and 2, will be called optimal. As this strategy plays an important role in this paper, we will denote it by π ps ( δ 2 ) diff ( δ 1 ) . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepG0lj9riW7rqqrFfpu0de9GqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiWdaNaae iCaiaabohadaqadeqaaiabes7aKnaaBaaaleaacaaIYaaabeaaaOGa ayjkaiaawMcaaiaaysW7cqGHsislcaaMe8UaaeizaiaabMgacaqGMb GaaeOzamaabmqabaGaeqiTdq2aaSbaaSqaaiaaigdaaeqaaaGccaGL OaGaayzkaaGaaiOlaaaa@4941@


Date modified: