4. Application to winsorized estimators

Cyril Favre Martinoz, David Haziza and Jean-François Beaumont

Previous | Next

Estimator (3.5) can be written in alternative forms, which can make it easier to implement in some cases. We consider the winsorized form. This form has been widely studied in the literature. As mentioned in Section 1, standard winsorization is distinguished from Dalén-Tambay winsorization.

Standard winsorization involves decreasing the value of units that are above a particular threshold, taking their weight into account. Let y ˜ i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG5bGbaG aadaWgaaWcbaGaamyAaaqabaaaaa@3A81@ be the value of variable y MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG5baaaa@3958@ for unit i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGPbaaaa@3948@ after winsorization. We have

y ˜ i ={ y i if  d i y i K K d i if  d i y i >K (4.1) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG5bGbaG aadaWgaaWcbaGaamyAaaqabaGccqGH9aqpdaGabaqaauaabaqaciaa aeaacaWG5bWaaSbaaSqaaiaadMgaaeqaaaGcbaGaaeyAaiaabAgaca qGGaGaamizamaaBaaaleaacaWGPbaabeaakiaadMhadaWgaaWcbaGa amyAaaqabaGccqGHKjYOcaWGlbaabaWaaSaaaeaacaWGlbaabaGaam izamaaBaaaleaacaWGPbaabeaaaaaakeaacaqGPbGaaeOzaiaabcca caWGKbWaaSbaaSqaaiaadMgaaeqaaOGaamyEamaaBaaaleaacaWGPb aabeaakiaaysW7caqG+aGaaGjbVlaadUeaaaGaaGzbVlaaywW7caaM f8UaaGzbVlaaywW7caGGOaGaaGinaiaac6cacaaIXaGaaiykaaGaay 5Eaaaaaa@6190@

where K>0 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGlbGaaG jbVlaab6dacaaMe8UaaGimaaaa@3DBF@ is the winsorization threshold. The standard winsorized estimator of the total t MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG0baaaa@3997@ is given by

t ^ s = iS d i y ˜ i (4.2) = t ^ +Δ( K ), MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaafaqaaeGaca aabaGabmiDayaajaWaaSbaaSqaaiaadohaaeqaaaGcbaGaeyypa0Za aabuaeaacaWGKbWaaSbaaSqaaiaadMgaaeqaaOGabmyEayaaiaWaaS baaSqaaiaadMgaaeqaaaqaaiaadMgacqGHiiIZcaWGtbaabeqdcqGH ris5aOGaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7caGGOaGaaGinai aac6cacaaIYaGaaiykaaqaaaqaaiabg2da9iqadshagaqcaiabgUca Riabfs5aenaabmaabaGaam4saaGaayjkaiaawMcaaiaacYcaaaaaaa@57F0@

where

Δ( K )= iS max( 0, d i y i K ). MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacqqHuoarda qadaqaaiaadUeaaiaawIcacaGLPaaacqGH9aqpcqGHsisldaaeqbqa bSqaaiaadMgacqGHiiIZcaWGtbaabeqdcqGHris5aOGaciyBaiaacg gacaGG4bWaaeWaaeaacaaIWaGaaGilaiaadsgadaWgaaWcbaGaamyA aaqabaGccaWG5bWaaSbaaSqaaiaadMgaaeqaaOGaeyOeI0Iaam4saa GaayjkaiaawMcaaiaai6caaaa@4FF4@

Hence, the estimator (4.2) can be written in the form (3.1). An alternative is to express t ^ s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaam4Caaqabaaaaa@3A87@ as a weighted sum of the initial values using modified weights:

t ^ s = iS d ˜ i y i , MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaam4CaaqabaGccqGH9aqpdaaeqbqaaiqadsgagaac amaaBaaaleaacaWGPbaabeaakiaadMhadaWgaaWcbaGaamyAaaqaba aabaGaamyAaiabgIGiolaadofaaeqaniabggHiLdGccaaISaaaaa@45EC@

where

d ˜ i = d i min( y i , K d i ) y i .(4.3) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWGKbGbaG aadaWgaaWcbaGaamyAaaqabaGccqGH9aqpcaWGKbWaaSbaaSqaaiaa dMgaaeqaaOWaaSaaaeaacaqGTbGaaeyAaiaab6gadaqadaqaaiaadM hadaWgaaWcbaGaamyAaaqabaGccaaISaWaaSaaaeaacaWGlbaabaGa amizamaaBaaaleaacaWGPbaabeaaaaaakiaawIcacaGLPaaaaeaaca WG5bWaaSbaaSqaaiaadMgaaeqaaaaakiaai6cacaaMf8UaaGzbVlaa ywW7caaMf8UaaGzbVlaacIcacaaI0aGaaiOlaiaaiodacaGGPaaaaa@55D9@

If min( y i ,K/ d i )= y i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGTbGaae yAaiaab6gadaqadaqaaiaadMhadaWgaaWcbaGaamyAaaqabaGccaaI SaWaaSGbaeaacaWGlbaabaGaamizamaaBaaaleaacaWGPbaabeaaaa aakiaawIcacaGLPaaacqGH9aqpcaWG5bWaaSbaaSqaaiaadMgaaeqa aaaa@4599@ (that is, if unit i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGPbaaaa@3948@ is not influential), then d ˜ i = d i . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWGKbGbaG aadaWgaaWcbaGaamyAaaqabaGccqGH9aqpcaWGKbWaaSbaaSqaaiaa dMgaaeqaaOGaaiOlaaaa@3E3B@ Thus, the weight of a non-influential unit is not modified. In contrast, the modified weight of an influential unit is less than d i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGKbWaaS baaSqaaiaadMgaaeqaaaaa@3A5D@ and may even be less than 1. It is worth noting that a unit with a value of y i =0 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG5bWaaS baaSqaaiaadMgaaeqaaOGaeyypa0JaaGimaaaa@3C3C@ presents no particular problems, since its contribution to the estimated total, t ^ s , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaam4CaaqabaGccaaISaaaaa@3B47@ is zero. In this case, an arbitrary value can be assigned to the modified weight d ˜ i . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWGKbGbaG aadaWgaaWcbaGaamyAaaqabaGccaaIUaaaaa@3B2E@

In the case of Dalén-Tambay winsorization, the values of the variable of interest after winsorization are defined by

y ˜ i ={ y i if  d i y i K K d i + 1 d i ( y i K d i ) if  d i y i >K .(4.4) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG5bGbaG aadaWgaaWcbaGaamyAaaqabaGccqGH9aqpdaGabaqaauaabaqaciaa aeaacaWG5bWaaSbaaSqaaiaadMgaaeqaaaGcbaGaaeyAaiaabAgaca qGGaGaamizamaaBaaaleaacaWGPbaabeaakiaadMhadaWgaaWcbaGa amyAaaqabaGccqGHKjYOcaWGlbaabaWaaSaaaeaacaWGlbaabaGaam izamaaBaaaleaacaWGPbaabeaaaaGccqGHRaWkdaWcaaqaaiaaigda aeaacaWGKbWaaSbaaSqaaiaadMgaaeqaaaaakmaabmaabaGaamyEam aaBaaaleaacaWGPbaabeaakiabgkHiTmaalaaabaGaam4saaqaaiaa dsgadaWgaaWcbaGaamyAaaqabaaaaaGccaGLOaGaayzkaaaabaGaae yAaiaabAgacaqGGaGaamizamaaBaaaleaacaWGPbaabeaakiaadMha daWgaaWcbaGaamyAaaqabaGccaaMe8UaaeOpaiaaysW7caWGlbaaaa Gaay5EaaGaaGzbVlaac6cacaaMf8UaaGzbVlaacIcacaaI0aGaaiOl aiaaisdacaGGPaaaaa@6A68@

This leads to the winsorized estimator of the total t y : MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG0bWaaS baaSqaaiaadMhaaeqaaOGaaiOoaaaa@3B89@

t ^ DT = iS d i y ˜ i (4.5) = t ^ +Δ( K ), MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaafaqaaeGaca aabaGabmiDayaajaWaaSbaaSqaaiaabseacaqGubaabeaaaOqaaiab g2da9maaqafabaGaamizamaaBaaaleaacaWGPbaabeaakiqadMhaga acamaaBaaaleaacaWGPbaabeaaaeaacaWGPbGaeyicI4Saam4uaaqa b0GaeyyeIuoakiaaywW7caaMf8UaaGzbVlaaywW7caaMf8Uaaiikai aaisdacaGGUaGaaGynaiaacMcaaeaaaeaacqGH9aqpceWG0bGbaKaa cqGHRaWkcqqHuoardaqadaqaaiaadUeaaiaawIcacaGLPaaacaaISa aaaaaa@589F@

where

Δ( K )= iS ( d i 1 ) d i max( 0, d i y i K ). MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacqqHuoarda qadaqaaiaadUeaaiaawIcacaGLPaaacqGH9aqpcqGHsisldaaeqbqa bSqaaiaadMgacqGHiiIZcaWGtbaabeqdcqGHris5aOWaaSaaaeaada qadaqaaiaadsgadaWgaaWcbaGaamyAaaqabaGccqGHsislcaaIXaaa caGLOaGaayzkaaaabaGaamizamaaBaaaleaacaWGPbaabeaaaaGcci GGTbGaaiyyaiaacIhadaqadaqaaiaaicdacaaISaGaamizamaaBaaa leaacaWGPbaabeaakiaadMhadaWgaaWcbaGaamyAaaqabaGccqGHsi slcaWGlbaacaGLOaGaayzkaaGaaGOlaaaa@574F@

Estimator (4.5) can also be written in the form (3.1). As in the case of t ^ s , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaam4CaaqabaGccaGGSaaaaa@3B41@ an alternative is to express t ^ DT MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaaeiraiaabsfaaeqaaaaa@3B2D@ as a weighted sum of the initial values using modified weights:

t ^ DT = iS d ˜ i y i , MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaaeiraiaabsfaaeqaaOGaeyypa0ZaaabuaeaaceWG KbGbaGaadaWgaaWcbaGaamyAaaqabaGccaWG5bWaaSbaaSqaaiaadM gaaeqaaaqaaiaadMgacqGHiiIZcaWGtbaabeqdcqGHris5aOGaaGil aaaa@4692@

where

d ˜ i =1+( d i 1 ) min( y i , K d i ) y i .(4.6) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWGKbGbaG aadaWgaaWcbaGaamyAaaqabaGccqGH9aqpcaaIXaGaey4kaSYaaeWa aeaacaWGKbWaaSbaaSqaaiaadMgaaeqaaOGaeyOeI0IaaGymaaGaay jkaiaawMcaamaalaaabaGaaeyBaiaabMgacaqGUbWaaeWaaeaacaWG 5bWaaSbaaSqaaiaadMgaaeqaaOGaaGilamaalaaabaGaam4saaqaai aadsgadaWgaaWcbaGaamyAaaqabaaaaaGccaGLOaGaayzkaaaabaGa amyEamaaBaaaleaacaWGPbaabeaaaaGccaaIUaGaaGzbVlaaywW7ca aMf8UaaGzbVlaaywW7caGGOaGaaGinaiaac6cacaaI2aGaaiykaaaa @5AAA@

As in the case of the standard winsorized estimator, the weight of a non-influential unit is not modified. Unlike standard winsorization, Dalén-Tambay winsorization guarantees that the modified weights will not be less than 1. Once again, a unit with a value of y i =0 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG5bWaaS baaSqaaiaadMgaaeqaaOGaeyypa0JaaGimaaaa@3C3C@ presents no particular problems, since its contribution to the estimated total, t ^ DT , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaaeiraiaabsfaaeqaaOGaaGilaaaa@3BED@ is zero. In this case, an arbitrary value can be assigned to the modified weight d ˜ i . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWGKbGbaG aadaWgaaWcbaGaamyAaaqabaGccaaIUaaaaa@3B2E@

Since the standard and Dalén-Tambay winsorized estimators are of the form (3.1), the optimal constant K opt MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGlbWaaS baaSqaaiaab+gacaqGWbGaaeiDaaqabaaaaa@3C32@ that minimizes (3.2) is obtained by solving

Δ( K )= 1 2 ( B ^ min + B ^ max ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacqqHuoarda qadaqaaiaadUeaaiaawIcacaGLPaaacqGH9aqpcqGHsisldaWcaaqa aiaaigdaaeaacaaIYaaaamaabmaabaGabmOqayaajaWaaSbaaSqaai aab2gacaqGPbGaaeOBaaqabaGccqGHRaWkceWGcbGbaKaadaWgaaWc baGaaeyBaiaabggacaqG4baabeaaaOGaayjkaiaawMcaaaaa@49B3@

or

jS a j max( 0, d j y j K ) = B ^ min + B ^ max 2 ,(4.7) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaadaaeqbqaai aadggadaWgaaWcbaGaamOAaaqabaGcciGGTbGaaiyyaiaacIhadaqa daqaaiaaicdacaaISaGaamizamaaBaaaleaacaWGQbaabeaakiaadM hadaWgaaWcbaGaamOAaaqabaGccqGHsislcaWGlbaacaGLOaGaayzk aaaaleaacaWGQbGaeyicI4Saam4uaaqab0GaeyyeIuoakiabg2da9m aalaaabaGabmOqayaajaWaaSbaaSqaaiaab2gacaqGPbGaaeOBaaqa baGccqGHRaWkceWGcbGbaKaadaWgaaWcbaGaaeyBaiaabggacaqG4b aabeaaaOqaaiaaikdaaaGaaGilaiaaywW7caaMf8UaaGzbVlaaywW7 caaMf8UaaiikaiaaisdacaGGUaGaaG4naiaacMcaaaa@6207@

where a j =1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGHbWaaS baaSqaaiaadQgaaeqaaOGaeyypa0JaaGymaaaa@3C26@ in the case of t ^ s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaam4Caaqabaaaaa@3A87@ and a j = ( d j 1 )/ d j MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGHbWaaS baaSqaaiaadQgaaeqaaOGaeyypa0ZaaSGbaeaadaqadaqaaiaadsga daWgaaWcbaGaamOAaaqabaGccqGHsislcaaIXaaacaGLOaGaayzkaa aabaGaamizamaaBaaaleaacaWGQbaabeaaaaaaaa@42C4@ in the case of t ^ DT . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaaeiraiaabsfaaeqaaOGaaiOlaaaa@3BE9@ It is shown in the Appendix that a solution to equation (4.7) exists under the following conditions:

  1. π i j π i π j 0 ;  and MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacqaHapaCda WgaaWcbaGaamyAaiaadQgaaeqaaOGaeyOeI0IaeqiWda3aaSbaaSqa aiaadMgaaeqaaOGaeqiWda3aaSbaaSqaaiaadQgaaeqaaOGaeyizIm QaaGimaiaacUdacaqGGaGaaeyyaiaab6gacaqGKbaaaa@4967@
  2. 1 2 ( B ^ min + B ^ max ) 0. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaadaWcaaqaai aaigdaaeaacaaIYaaaamaabmaabaGabmOqayaajaWaaSbaaSqaaiaa b2gacaqGPbGaaeOBaaqabaGccqGHRaWkceWGcbGbaKaadaWgaaWcba GaaeyBaiaabggacaqG4baabeaaaOGaayjkaiaawMcaaiabgwMiZkaa icdacaGGUaaaaa@4734@

Condition 1 is satisfied for most one-stage designs used in practice, such as stratified simple random sampling and Poisson sampling. Condition 2 implies that t ^ R MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaamOuaaqabaaaaa@3A66@ must be less than or equal to t ^ , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aacaGGSaaaaa@3A13@ since by construction, a winsorized estimator cannot be greater than the Horvitz-Thompson estimator. It is generally expected that Condition 2 will be satisfied in most skewed populations encountered in business surveys and social surveys. It is also shown in the Appendix that the solution to equation (4.7) is unique if the above conditions are met and if y i 0 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG5bWaaS baaSqaaiaadMgaaeqaaOGaeyyzImRaaGimaaaa@3CFC@ for i S . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGPbGaey icI4Saam4uaiaac6caaaa@3C56@ The Appendix contains a brief description of an algorithm for finding the solution to equation (4.7).

It should be noted that while the value K opt MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGlbWaaS baaSqaaiaab+gacaqGWbGaaeiDaaqabaaaaa@3C32@ is different for each type of winsorized estimator used, the resulting robust estimators are identical. In other words, we have

t ^ s ( K opt )= t ^ DT ( K opt )= t ^ R = t ^ B ^ min + B ^ max 2 .(4.8) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaam4CaaqabaGcdaqadaqaaiaadUeadaWgaaWcbaGa ae4BaiaabchacaqG0baabeaaaOGaayjkaiaawMcaaiabg2da9iqads hagaqcamaaBaaaleaacaqGebGaaeivaaqabaGcdaqadaqaaiaadUea daWgaaWcbaGaae4BaiaabchacaqG0baabeaaaOGaayjkaiaawMcaai abg2da9iqadshagaqcamaaBaaaleaacaWGsbaabeaakiabg2da9iqa dshagaqcaiabgkHiTmaalaaabaGabmOqayaajaWaaSbaaSqaaiaab2 gacaqGPbGaaeOBaaqabaGccqGHRaWkceWGcbGbaKaadaWgaaWcbaGa aeyBaiaabggacaqG4baabeaaaOqaaiaaikdaaaGaaGOlaiaaywW7ca aMf8UaaGzbVlaaywW7caaMf8UaaiikaiaaisdacaGGUaGaaGioaiaa cMcaaaa@64CE@

To compare the influence of each population unit with respect to the (non-robust) expansion estimator, t ^ , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aacaaISaaaaa@3A19@ and its robust version (4.8), we carried out a simulation study. For that purpose, we generated two populations, each of size N = 100. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGobGaey ypa0JaaGymaiaaicdacaaIWaGaaiOlaaaa@3D58@ One population was generated according to a normal distribution with mean 4,108 and standard deviation 1,500, and the other was generated according to a lognormal distribution with mean 4,108 and standard deviation 7,373. From each population we selected M = 500,000 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGnbGaey ypa0JaaeynaiaabcdacaqGWaGaaeilaiaabcdacaqGWaGaaeimaaaa @3F5C@ samples according to two sampling designs: (i) a simple random sampling without-replacement design of size n = 10 , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGUbGaey ypa0JaaGymaiaaicdacaGGSaaaaa@3CBC@ and (ii) a Bernoulli design of expected size n = 10. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGUbGaey ypa0JaaGymaiaaicdacaGGUaaaaa@3CBE@ First, we calculated the conditional bias of the Horvitz-Thompson estimator for a simple random sampling without-replacement design, given in (2.3) and for a Bernoulli design, given in (2.4). Note that the conditional bias of the Horvitz-Thompson estimator does not have to be approximated by simulation since all the population parameters are known. The conditional bias associated with unit i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGPbaaaa@398C@ of the robust estimator given in (3.3) was approximated as follows: Out of the 500,000 selected samples, we identified those which contained unit i . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGPbGaai Olaaaa@3A3E@ In each of these samples, we calculated the error, t ^ R t . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaamOuaaqabaGccqGHsislcaWG0bGaaiOlaaaa@3D08@ Finally, we calculated the average value of t ^ R t MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaamOuaaqabaGccqGHsislcaWG0baaaa@3C56@ over all the samples containing unit i . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0db9peuD0lXxcrpe0=1qpeea0=yrVue9 Fve9Fje8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGPbGaai Olaaaa@3A3E@

The results for the simple random sampling without-replacement design for the normal and lognormal distributions are shown in Figures 4.1 (a) and 4.1 (b) respectively. The results for the Bernoulli sampling design for the normal and lognormal distributions are shown in Figures 4.1 (c) and 4.1 (d) respectively. In each figure, the absolute value of the conditional bias of t ^ R MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaamOuaaqabaaaaa@3A66@ is shown in relation to the absolute value of the conditional bias of t ^ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aaaaa@3963@ for each population unit. The units above the first bisectrix have a conditional bias associated with t ^ R MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaamOuaaqabaaaaa@3A66@ whose absolute value is greater than that of the conditional bias associated with t ^ . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aacaGGUaaaaa@3A15@ Looking first at the results for simple random sampling without replacement, we see that the behaviour of the absolute value of the conditional bias of t ^ R MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaamOuaaqabaaaaa@3A66@ is similar to that of the absolute value of the conditional bias of t ^ , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aacaGGSaaaaa@3A13@ which indicates that the influence of the units is not altered significantly after robustification of the expansion estimator. This result is not surprising since the population does not contain any highly influential units. In the case of the lognormal distribution, we see that the influence of the values that have a high conditional bias associated with t ^ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aaaaa@3963@ has been reduced significantly. On the other hand, we note that for the majority of the data, the conditional bias of t ^ R MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaamOuaaqabaaaaa@3A66@ is slightly higher than that of t ^ . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aacaGGUaaaaa@3A15@ Turning to the results for Bernoulli sampling, we see that in the case of the normal population, the influence of most units has been reduced, since the absolute value of the conditional bias of t ^ R MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aadaWgaaWcbaGaamOuaaqabaaaaa@3A66@ is significantly lower than the absolute value of the conditional bias of t ^ . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG0bGbaK aacaGGUaaaaa@3A15@ In the case of the lognormal distribution, the results are similar to those obtained with simple random sampling without replacement for the same distribution.

Figure 4.1 Absolute value of the conditional biases of the robust and non-robust estimators

Figure 4.1 Absolute value of the conditional biases of the robust and non-robust estimators

Description for Figure 4.1

Previous | Next

Date modified: