A design effect measure for calibration weighting in single-stage samples 4. Empirical evaluation

We conducted two simulation studies using data that mimic single-stage sampling. The first utilizes publically-available data from tax returns and continuous variables of interest, while the second examines the performance of the alternative measures for a binary outcome measure in a single-stage survey.

4.1 Establishment data simulation study

Here a sample dataset of tax return data is used to mimic an establishment survey setup. The data come from the Tax Year 2007 Statistics of Income (SOI) Form 990 Exempt Organization (EO) sample. This is a stratified Bernoulli sample of 22,430 EO tax returns selected from 428,719 filed with and processed by the IRS between December 2007 and November 2009. This sample dataset, along with the population frame data, is free and electronically available online (Statistics of Income 2011). These data make a candidate “establishment-type” dataset for estimating design effects, in which Kish’s design effect may not apply.

The SOI EO sample dataset is used here as a pseudopopulation for illustration. Four variables of interest are used: Total Assets, Total Liabilities, Total Revenue, and Total Expenses. Returns that were sampled with certainty or that had “very small” assets (defined by having Total Assets less than $1,000,000, including zero) were removed, leaving 8,914 units. We then randomly replicated and perturbed the data to create a pseudopopulation of 50,000 units. We used simple random sampling with replacement to select more observations, then the additional data values were perturbed using the “jitter” (Chambers, Cleveland, Kleiner and Tukey 1983) function in R.

Figure 4.1 shows a pairwise plot of the pseudo-population, including plots of the variable values against each other in the lower left panels, histograms on the diagonal panels, and the correlations among the variables in the upper right panels. This plot mimics establishment-type data patterns. From the diagonal panels, we see that the variables of interest are all highly skewed. From the lower left panels, there exists a range of different relationships among them. The Total Assets variable is less related to Total Revenue and Total Expenses (with moderate correlations of 0.41 0.44 ) ; MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9pC0xbbf9=e0dfrpm0dXdirVu0=vr 0=vr0=fdbaqaaeGacaGaaiaabeqaamaabaabaaGcbaGaaGimaiaac6 cacaaI0aGaaGymaiabgkHiTiaaicdacaGGUaGaaGinaiaaisdacaGG PaGaai4oaaaa@407B@ Total Revenue and Total Expenses are highly correlated.

Figure 4.1 of section 4 Design of the Dutch Labour Force Survey

Description for Figure 4.1

Figure 4.1 shows a pairwise plot of the pseudo-population, including plots of the variable values against each other in the lower left panels, histograms on the diagonal panels, and the correlations among the variables in the upper right panels. The variables are Total Assets, Total Liabilities, Total Revenue and Total Expenses. From the diagonal panels, we see that the variables of interest are all highly left-skewed, the less skewed variable being Total Assets. From the lower left panels, there exists a range of different relationships among them. Correlations between the variables are given in the following table.

Data table for Figure 4.1
Table summary
This table displays the results of Data table for Figure 4.1. The information is grouped by Variables (appearing as row headers), Total Assets, Total Liabilities, Total Revenue and Total Expenses (appearing as column headers).
Variables Total Assets Total Liabilities Total Revenue Total Expenses
Total Assets 1 0.56 0.44 0.41
Total Liabilities 0.56 1 0.42 0.44
Total Revenue 0.44 0.42 1 0.99
Total Expenses 0.41 0.44 0.99 1

 

Three sizes of samples were selected ( n = 100 ; 500 ; 1,000 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaadaqadeqaai aad6gacqGH9aqpcaaIXaGaaGimaiaaicdacaGG7aGaaGynaiaaicda caaIWaGaai4oaiaabgdacaqGSaGaaeimaiaabcdacaqGWaaacaGLOa Gaayzkaaaaaa@4544@ without replacement from the pseudopopulation using the square root of Total Assets as a measure of size. This type of sampling is referred to as π ps MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacqaHapaCca qGWbGaae4Caaaa@3C0B@ sampling subsequently. The HT weights were then calibrated using the “linear” method in the “calibrate” function in the “survey” package for R (corresponding to a GREG estimator, Lumley 2012) to match the totals of an intercept, Total Assets and Total Revenue. The analysis variables are Total Liabilities and Total Expenses. (Note that we follow the common practice of developing procedures in the previous sections using formulas for with-replacement sampling but empirically evaluating them in without-replacement samples, which are the type used in applications.)

Eight design effects estimates are considered:

Note that neither the Spencer nor the proposed measures account for any reduction in variances due to sampling a large fraction of the population.

We selected ten thousand samples to further understand the empirical behavior of the alternative design effect estimators. The empirical relbiases and ratio of the mean square errors (MSE’s) of the totals are

relbias ( T ^ ) = 100 × s = 1 S ( T ^ s T ) / T MSE ratio = MSE ( T ^ HT ) / MSE ( T ^ GREG ) = s = 1 S ( T ^ HT , s T ) 2 / s = 1 S ( T ^ GREG , s T ) 2 MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaafaqaae4aca aabaGaaeOCaiaabwgacaqGSbGaaeOyaiaabMgacaqGHbGaae4Camaa bmaabaGabmivayaajaaacaGLOaGaayzkaaaabaGaeyypa0ZaaSGbae aacaaIXaGaaGimaiaaicdacqGHxdaTdaaeWaqaamaabmaabaGabmiv ayaajaWaaSbaaSqaaiaadohaaeqaaOGaeyOeI0IaamivaaGaayjkai aawMcaaaWcbaGaam4Caiabg2da9iaaigdaaeaacaWGtbaaniabggHi LdaakeaacaWGubaaaaqaaiaab2eacaqGtbGaaeyraiaabccacaqGYb GaaeyyaiaabshacaqGPbGaae4Baaqaaiabg2da9maalyaabaGaaeyt aiaabofacaqGfbWaaeWaaeaaceWGubGbaKaadaWgaaWcbaGaaeisai aabsfaaeqaaaGccaGLOaGaayzkaaaabaGaaeytaiaabofacaqGfbWa aeWaaeaaceWGubGbaKaadaWgaaWcbaGaae4raiaabkfacaqGfbGaae 4raaqabaaakiaawIcacaGLPaaaaaaabaaabaGaeyypa0ZaaSGbaeaa daaeWaqaamaabmaabaGabmivayaajaWaaSbaaSqaaiaabIeacaqGub GaaiilaiaadohaaeqaaOGaeyOeI0IaamivaaGaayjkaiaawMcaaaWc baGaam4Caiabg2da9iaaigdaaeaacaWGtbaaniabggHiLdGcdaahaa WcbeqaaiaaikdaaaaakeaadaaeWaqaamaabmaabaGabmivayaajaWa aSbaaSqaaiaabEeacaqGsbGaaeyraiaabEeacaGGSaGaam4Caaqaba GccqGHsislcaWGubaacaGLOaGaayzkaaWaaWbaaSqabeaacaaIYaaa aaqaaiaadohacqGH9aqpcaaIXaaabaGaam4uaaqdcqGHris5aaaaaa aaaa@8A13@

where T ^ s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWGubGbaK aadaWgaaWcbaGaam4Caaqabaaaaa@3A72@ is an estimated total from sample s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGZbaaaa@395D@ (either HT or GREG), S = 10,000 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGtbGaey ypa0JaaeymaiaabcdacaqGSaGaaeimaiaabcdacaqGWaaaaa@3E72@ is the number of samples selected, and T ^ HT , s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWGubGbaK aadaWgaaWcbaGaaeisaiaabsfacaGGSaGaam4Caaqabaaaaa@3CC4@ and T ^ GREG , s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWGubGbaK aadaWgaaWcbaGaae4raiaabkfacaqGfbGaae4raiaacYcacaWGZbaa beaaaaa@3E53@ are the estimated HT and GREG totals from sample s . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGZbGaai Olaaaa@3A0F@ The empirical deff MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGKbGaae yzaiaabAgacaqGMbaaaa@3C06@ of an estimated total is computed as empdeff ( T ^ ) = S 1 s = 1 S ( T ^ s T ^ ¯ ) 2 / Var srswr ( T ^ srswr ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGLbGaae yBaiaabchacaqGKbGaaeyzaiaabAgacaqGMbWaaeWaaeaaceWGubGb aKaaaiaawIcacaGLPaaacqGH9aqpcaWGtbWaaWbaaSqabeaacqGHsi slcaaIXaaaaOWaaSGbaeaadaaeWaqaamaabmaabaGabmivayaajaWa aSbaaSqaaiaadohaaeqaaOGaeyOeI0IabmivayaajyaaraaacaGLOa GaayzkaaWaaWbaaSqabeaacaaIYaaaaaqaaiaadohacqGH9aqpcaaI XaaabaGaam4uaaqdcqGHris5aaGcbaGaaeOvaiaabggacaqGYbWaaS baaSqaaiaabohacaqGYbGaae4CaiaabEhacaqGYbaabeaakmaabmaa baGabmivayaajaWaaSbaaSqaaiaabohacaqGYbGaae4CaiaabEhaca qGYbaabeaaaOGaayjkaiaawMcaaaaaaaa@604E@ where T ^ ¯ = S 1 s = 1 S T ^ s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWGubGbaK GbaebacqGH9aqpcaWGtbWaaWbaaSqabeaacqGHsislcaaIXaaaaOWa aabmaeaaceWGubGbaKaadaWgaaWcbaGaam4CaaqabaaabaGaam4Cai abg2da9iaaigdaaeaacaWGtbaaniabggHiLdaaaa@44B7@ and Var srswr ( T ^ srswr ) N 2 σ y 2 / n . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGwbGaae yyaiaabkhadaWgaaWcbaGaae4CaiaabkhacaqGZbGaae4Daiaabkha aeqaaOWaaeWaaeaaceWGubGbaKaadaWgaaWcbaGaae4Caiaabkhaca qGZbGaae4DaiaabkhaaeqaaaGccaGLOaGaayzkaaGaeSiuIi0aaSGb aeaacaWGobWaaWbaaSqabeaacaaIYaaaaOGaeq4Wdm3aa0baaSqaai aadMhaaeaacaaIYaaaaaGcbaGaamOBaaaacaGGUaaaaa@500F@

The results for relbiases and MSEs are shown in Table 4.1. Both estimators of totals are approximately unbiased. The GREG is also more precise than the HT estimator, especially for Total Expenses, as evidenced by the MSE ratios larger than one.

Table 4.1
Simulation results of HT and GREG totals, 10,000 πps MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqipu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacqaHapaCca qGWbGaae4Caaaa@3C05@ samples drawn from the SOI 2007 pseudopopulation EO data
Table summary
This table displays the results of Simulation results of HT and GREG totals Variable of Interest, Total Liabilities
(weakly correlated with X) and Total Expenses
(strongly correlated with X) , calculated using XXXXX units of measure (appearing as column headers).
  Variable of Interest
Total Liabilities
(weakly correlated with X)
Total Expenses
(strongly correlated with X)
n=100 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacaWGUbGaey ypa0JaaGymaiaaicdacaaIWaaaaa@3EBA@ n=500 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacaWGUbGaey ypa0JaaGynaiaaicdacaaIWaaaaa@3EBE@ n=1,000 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacaWGUbGaey ypa0JaaeymaiaabYcacaqGWaGaaeimaiaabcdaaaa@4007@ n=100 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacaWGUbGaey ypa0JaaGymaiaaicdacaaIWaaaaa@3EBA@ n=500 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacaWGUbGaey ypa0JaaGynaiaaicdacaaIWaaaaa@3EBE@ n=1,000 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacaWGUbGaey ypa0JaaeymaiaabYcacaqGWaGaaeimaiaabcdaaaa@4007@
Estimates  
Percent relbias(HT) -0.13 0.07 0.03 -0.64 0.05 0.07
Percent relbias(GREG) 0.37 0.27 0.14 -0.12 -0.01 0.00
MSE ratio 1.17 1.20 1.19 34.89 50.11 48.26

 

We also computed the biases of the various estimated design effects across the 10,000 samples. The relbiases of the Kish, Spencer, and proposed design effect estimates are computed as

relbias( deff K )=100× ( deff ¯ K edeff( T ^ HTy ) )/ edeff( T ^ HTy ) , MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGYbGaae yzaiaabYgacaqGIbGaaeyAaiaabggacaqGZbWaaeWaaeaacaqGKbGa aeyzaiaabAgacaqGMbWaaSbaaSqaaiaadUeaaeqaaaGccaGLOaGaay zkaaGaeyypa0JaaGymaiaaicdacaaIWaGaey41aq7aaSGbaeaadaqa daqaamaanaaabaGaaeizaiaabwgacaqGMbGaaeOzaaaadaWgaaWcba Gaam4saaqabaGccqGHsislcaqGLbGaaeizaiaabwgacaqGMbGaaeOz amaabmaabaGabmivayaajaWaaSbaaSqaaiaabIeacaqGubGaamyEaa qabaaakiaawIcacaGLPaaaaiaawIcacaGLPaaaaeaacaqGLbGaaeiz aiaabwgacaqGMbGaaeOzamaabmaabaGabmivayaajaWaaSbaaSqaai aabIeacaqGubGaamyEaaqabaaakiaawIcacaGLPaaaaaGaaiilaaaa @65ED@

relbias( deff S )=100× ( deff ¯ S edeff( T ^ HTy ) )/ edeff( T ^ HTy ) , MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGYbGaae yzaiaabYgacaqGIbGaaeyAaiaabggacaqGZbWaaeWaaeaacaqGKbGa aeyzaiaabAgacaqGMbWaaSbaaSqaaiaadofaaeqaaaGccaGLOaGaay zkaaGaeyypa0JaaGymaiaaicdacaaIWaGaey41aq7aaSGbaeaadaqa daqaamaanaaabaGaaeizaiaabwgacaqGMbGaaeOzaaaadaWgaaWcba Gaam4uaaqabaGccqGHsislcaqGLbGaaeizaiaabwgacaqGMbGaaeOz amaabmaabaGabmivayaajaWaaSbaaSqaaiaabIeacaqGubGaamyEaa qabaaakiaawIcacaGLPaaaaiaawIcacaGLPaaaaeaacaqGLbGaaeiz aiaabwgacaqGMbGaaeOzamaabmaabaGabmivayaajaWaaSbaaSqaai aabIeacaqGubGaamyEaaqabaaakiaawIcacaGLPaaaaaGaaiilaaaa @65FD@

and

relbias( deff H )=100× ( deff ¯ H edeff( T ^ GREG ) )/ edeff( T ^ GREG ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGYbGaae yzaiaabYgacaqGIbGaaeyAaiaabggacaqGZbWaaeWaaeaacaqGKbGa aeyzaiaabAgacaqGMbWaaSbaaSqaaiaadIeaaeqaaaGccaGLOaGaay zkaaGaeyypa0JaaGymaiaaicdacaaIWaGaey41aq7aaSGbaeaadaqa daqaamaanaaabaGaaeizaiaabwgacaqGMbGaaeOzaaaadaWgaaWcba GaamisaaqabaGccqGHsislcaqGLbGaaeizaiaabwgacaqGMbGaaeOz amaabmaabaGabmivayaajaWaaSbaaSqaaiaabEeacaqGsbGaaeyrai aabEeaaeqaaaGccaGLOaGaayzkaaaacaGLOaGaayzkaaaabaGaaeyz aiaabsgacaqGLbGaaeOzaiaabAgadaqadaqaaiqadsfagaqcamaaBa aaleaacaqGhbGaaeOuaiaabweacaqGhbaabeaaaOGaayjkaiaawMca aaaaaaa@6659@

where deff ¯ K , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaadaqdaaqaai aabsgacaqGLbGaaeOzaiaabAgaaaWaaSbaaSqaaiaadUeaaeqaaOGa aiilaaaa@3DCD@ deff ¯ S , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaadaqdaaqaai aabsgacaqGLbGaaeOzaiaabAgaaaWaaSbaaSqaaiaadofaaeqaaOGa aiilaaaa@3DD5@ and deff ¯ H MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaadaqdaaqaai aabsgacaqGLbGaaeOzaiaabAgaaaWaaSbaaSqaaiaadIeaaeqaaaaa @3D10@ are the average Kish, Spencer, and proposed deff s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGKbGaae yzaiaabAgacaqGMbacbaGaa8xgGiaabohaaaa@3DBF@ over all samples. The terms edeff ( T ^ HT y ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGLbGaae izaiaabwgacaqGMbGaaeOzamaabmaabaGabmivayaajaWaaSbaaSqa aiaabIeacaqGubGaamyEaaqabaaakiaawIcacaGLPaaaaaa@4236@ and edeff ( T ^ GREG ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGLbGaae izaiaabwgacaqGMbGaaeOzamaabmaabaGabmivayaajaWaaSbaaSqa aiaabEeacaqGsbGaaeyraiaabEeaaeqaaaGccaGLOaGaayzkaaaaaa@42C7@ are computed in two ways: (1) as the simulation empdeff MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGLbGaae yBaiaabchacaqGKbGaaeyzaiaabAgacaqGMbaaaa@3ED1@ of T ^ HT y ( or  T ^ GREG ) , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWGubGbaK aadaWgaaWcbaGaaeisaiaabsfacaWG5baabeaakmaabmaabaGaae4B aiaabkhacaqGGaGabmivayaajaWaaSbaaSqaaiaabEeacaqGsbGaae yraiaabEeaaeqaaaGccaGLOaGaayzkaaGaaiilaaaa@4537@ and (2) as the average over all samples of the deff s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGKbGaae yzaiaabAgacaqGMbacbaGaa8xgGiaabohaaaa@3DBF@ of T ^ HT y MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWGubGbaK aadaWgaaWcbaGaaeisaiaabsfacaWG5baabeaaaaa@3C1A@ computed from the “survey” package. The “survey” package’s default method of estimating the deff MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGKbGaae yzaiaabAgacaqGMbaaaa@3C06@ from a particular sample uses a with-replacement variance estimate in the numerator. This corresponds to the sample design used to derive deff H . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGKbGaae yzaiaabAgacaqGMbWaaSbaaSqaaiaadIeaaeqaaOGaaiOlaaaa@3DBB@ Results are displayed in Table 4.2.

For both variables of interest, we see large positive biases for the Kish design effect, and the design effects involving approximations. Thus, ignoring correlation components accounted for in the ‘exact’ Spencer and proposed design effects would lead to over-estimating the design effects.

The proposed estimator is closer to the “survey” package design effects than to the empirical simulation deff s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGKbGaae yzaiaabAgacaqGMbacbaGaa8xgGiaabohaaaa@3DBF@ of the GREG. Although the relbiases of deff H MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGKbGaae yzaiaabAgacaqGMbWaaSbaaSqaaiaadIeaaeqaaaaa@3CFF@ are fairly large for Total Expenses when computed with respect to edeff, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGLbGaae izaiaabwgacaqGMbGaaeOzaiaabYcaaaa@3D9D@ the empirical deff s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGKbGaae yzaiaabAgacaqGMbacbaGaa8xgGiaabohaaaa@3DBF@ themselves are small. We highlight the small magnitude of the Total Expenses ( y 2 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaadaqadaqaai aadMhadaWgaaWcbaGaaGOmaaqabaaakiaawIcacaGLPaaaaaa@3BDE@ variable deff MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGKbGaae yzaiaabAgacaqGMbaaaa@3C06@ of 0.02 to put the relbiases into context. For example, the relbias of 12.9% for the exact version of our proposed estimator for n = 500 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGUbGaey ypa0JaaGynaiaaicdacaaIWaaaaa@3C91@ for y 2 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG5bWaaS baaSqaaiaaikdaaeqaaaaa@3A4B@ corresponds to a difference in the third decimal place. Specifically, in this scenario, on average we over-estimate the deff MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGKbGaae yzaiaabAgacaqGMbaaaa@3C06@ by 0.003.

We can understand why calibration is more efficient for Expenses than for Liabilities by examining the distributions of y i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG5bWaaS baaSqaaiaadMgaaeqaaaaa@3A7D@ and u i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG1bWaaS baaSqaaiaadMgaaeqaaaaa@3A79@ in one particular sample. Figures 4.2 and 4.3 show boxplots of u i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG1bWaaS baaSqaaiaadMgaaeqaaaaa@3A79@ and y i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG5bWaaS baaSqaaiaadMgaaeqaaaaa@3A7D@ for each variable and sample size.

Table 4.2
Relative bias of design effect estimates, 10,000 πps MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqipu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacqaHapaCca qGWbGaae4Caaaa@3C05@ samples drawn from the SOI 2007 pseudopopulation EO data
Table summary
This table displays the results of Relative bias of design effect estimates Variable of Interest, Total Liabilities
(weakly correlated with X) and Total Expenses
(strongly correlated with X) , calculated using Relative biases w.r.t. empirical deff’s and Relative biases w.r.t. average of survey package deff’s units of measure (appearing as column headers).
  Variable of Interest
Total Liabilities
(weakly correlated with X)
Total Expenses
(strongly correlated with X)
n=100 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacaWGUbGaey ypa0JaaGymaiaaicdacaaIWaaaaa@3EBA@ n=500 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacaWGUbGaey ypa0JaaGynaiaaicdacaaIWaaaaa@3EBE@ n=1,000 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacaWGUbGaey ypa0JaaeymaiaabYcacaqGWaGaaeimaiaabcdaaaa@4007@ n=100 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacaWGUbGaey ypa0JaaGymaiaaicdacaaIWaaaaa@3EBA@ n=500 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacaWGUbGaey ypa0JaaGynaiaaicdacaaIWaaaaa@3EBE@ n=1,000 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacaWGUbGaey ypa0JaaeymaiaabYcacaqGWaGaaeimaiaabcdaaaa@4007@
Empirical deff’s*  
HT 0.51 0.50 0.50 0.56 0.65 0.64
GREG 0.43 0.42 0.42 0.02 0.02 0.02
  Relative biases w.r.t. empirical deff’s
Kish** 158.7 158.3 158.3 132.8 101.7 104.7
Spencer**  
Exact 2.6 2.0 1.8 9.9 -4.5 -2.2
Zero-corr. approx. 96.1 98.0 98.4 91.2 70.1 73.7
Large N MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacqGHsislca WGobaaaa@3C48@ approx. 96.7 98.9 99.3 101.7 78.1 81.7
Proposed***  
Exact -6.3 -1.6 0.2 25.3 12.9 8.1
Zero-corr. approx. 83.4 94.0 98.2 129.9 116.6 108.7
  Relative biases w.r.t. average of “survey” package deff’s
Kish** 219.7 211.3 209.4 6400.5 7786.2 8287.2
Spencer**  
Exact 3.1 0.8 0.5 3.5 -1.0 -1.5
Zero-corr. approx. 97.1 95.8 95.8 80.1 76.2 74.8
Large N MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacqGHsislca WGobaaaa@3C48@ approx. 97.7 96.7 96.7 90.0 84.5 82.8
Proposed***  
Exact -0.9 -0.2 -0.1 11.3 -0.4 -0.1
Zero-corr. approx. 94.0 96.8 97.6 104.2 91.0 93.0

 

Figure 4.2 of section 4 Design of the Dutch Labour Force Survey

Description for Figure 4.2

Boxplots of y i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFCI8FfYJH8YrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbb a9q8WqFfea0=yr0RYxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9 Ff0dmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWG5bWaaSbaaS qaaiaadMgaaeqaaaaa@378C@ and u i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFCI8FfYJH8YrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbb a9q8WqFfea0=yr0RYxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9 Ff0dmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWG1bWaaSbaaS qaaiaadMgaaeqaaOGaeyOeI0caaa@387F@ values for sample sizes n=100, 500 and 1,000, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFCI8FfYJH8YrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbb a9q8WqFfea0=yr0RYxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9 Ff0dmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGUbGaeyypa0 JaaGymaiaaicdacaaIWaGaaiilaiaabccacaaI1aGaaGimaiaaicda caqGGaGaaeyyaiaab6gacaqGKbGaaeiiaiaabgdacaqGSaGaaeimai aabcdacaqGWaGaaiilaaaa@4550@ for total liabilities variable. The u i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFCI8FfYJH8YrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbb a9q8WqFfea0=yr0RYxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9 Ff0dmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWG1bWaaSbaaS qaaiaadMgaaeqaaOGaeyOeI0caaa@387F@ values in all of these samples have shorter ranges of values, less extreme values and less variation than y i . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFCI8FfYJH8YrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbb a9q8WqFfea0=yr0RYxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9 Ff0dmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWG5bWaaSbaaS qaaiaadMgaaeqaaOGaaiOlaaaa@3848@

 

Figure 4.3 of section 4 Design of the Dutch Labour Force Survey

Description for Figure 4.3

Boxplots of y i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFCI8FfYJH8YrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbb a9q8WqFfea0=yr0RYxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9 Ff0dmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWG5bWaaSbaaS qaaiaadMgaaeqaaaaa@378C@ and u i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFCI8FfYJH8YrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbb a9q8WqFfea0=yr0RYxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9 Ff0dmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWG1bWaaSbaaS qaaiaadMgaaeqaaOGaeyOeI0caaa@387F@ values for sample sizes n=100, 500 and 1,000, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFCI8FfYJH8YrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbb a9q8WqFfea0=yr0RYxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9 Ff0dmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGUbGaeyypa0 JaaGymaiaaicdacaaIWaGaaiilaiaabccacaaI1aGaaGimaiaaicda caqGGaGaaeyyaiaab6gacaqGKbGaaeiiaiaabgdacaqGSaGaaeimai aabcdacaqGWaGaaiilaaaa@4550@ for total expenses variable. The u i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFCI8FfYJH8YrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbb a9q8WqFfea0=yr0RYxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9 Ff0dmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWG1bWaaSbaaS qaaiaadMgaaeqaaOGaeyOeI0caaa@387F@ values in all of these samples have shorter ranges of values, less extreme values and less variation than y i . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFCI8FfYJH8YrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbb a9q8WqFfea0=yr0RYxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9 Ff0dmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWG5bWaaSbaaS qaaiaadMgaaeqaaOGaaiOlaaaa@3848@

 

The u i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9pC0xbbf9=e0dfrpm0dXdirVu0=vr 0=vr0=fdbaqaaeGacaGaaiaabeqaamaabaabaaGcbaGaamyDamaaBa aaleaacaWGPbaabeaakiabgkHiTaaa@3B60@ values in all of these samples have shorter ranges of values and less variation than y i , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG5bWaaS baaSqaaiaadMgaaeqaaOGaaiilaaaa@3B37@ particularly for the Total Expenses variable. This occurs since the Total Expenses variable is highly correlated with the calibration variable Total Revenue (see Figure 4.1) and explains why the direct and proposed design effect measures are so much smaller for Total Expenses.

4.2 Simulation study with a binary variable

The second simulation study illustrates the performance of the proposed estimator when estimating the total of a binary variable in a single-stage survey that uses poststratification.

We use the “nhis.large” population, which has N = 21,588 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGobGaey ypa0JaaeOmaiaabgdacaqGSaGaaeynaiaabIdacaqG4aaaaa@3E84@ units, from the “PracTools” R package (Valliant, Dever and Kreuter 2015) to gauge the impact of poststratification weighting adjustments. The binary variable used is whether or not a person received Medicaid or not. Receipt of Medicaid, which is a social welfare program in the US, is an example of a variable that is collected in some telephone surveys. Missing values of Medicaid recipiency were recoded to be “no” responses. There is a fairly strong relationship between race-ethnicity, age, and whether Medicaid is received, as shown in Table 4.3 or Table 14.1 in Valliant, Dever and Kreuter (2013). The 15 age × race-ethnicity cells in the table will be used as poststrata, which is a typical procedure in telephone surveys.

Table 4.3
Population percentages of persons receiving medicaid, by age group and Hispanic status
Table summary
This table displays the results of Population percentages of persons receiving medicaid. The information is grouped by Age Group (appearing as row headers), Hispanic Status (appearing as column headers).
Age Group Hispanic Status
Hispanic Non-Hispanic White Non-Hispanic Black or Other
< 18 years 31.8 12.9 30.9
18-24 10.5 6.5 12.2
25-44 7.5 3.8 8.6
45-64 2.4 3.0 6.2
65+ 26.8 3.7 16.2

 

In our simulation, we selected 10,000 simple random samples without replacement from the NHIS population. The HT estimator for the total number of persons receiving Medicaid is N y ¯ s , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGobGabm yEayaaraWaaSbaaSqaaiaadohaaeqaaOGaaiilaaaa@3C2C@ where y ¯ s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaaceWG5bGbae badaWgaaWcbaGaam4Caaqabaaaaa@3A9F@ is the proportion in sample s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGZbaaaa@395D@ that receives Medicaid. Due to the relatively large number of poststrata and varying number of persons receiving Medicaid by poststratum, we include results only for samples of size n = 500  and 1,000 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGUbGaey ypa0JaaGynaiaaicdacaaIWaGaaeiiaiaabggacaqGUbGaaeizaiaa bccacaqGXaGaaeilaiaabcdacaqGWaGaaeimaaaa@440F@ since no collapsing of poststrata within a given particular sample was needed for these sample sizes.

The base weights for the HT MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9pC0xbbf9=e0dfrpm0dXdirVu0=vr 0=vr0=fdbaqaaeGacaGaaiaabeqaamaabaabaaGcbaGaaeisaiaabs facqGHsislaaa@3AE4@ estimator are simply w i = N / n . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWG3bWaaS baaSqaaiaadMgaaeqaaOGaeyypa0ZaaSGbaeaacaWGobaabaGaamOB aaaacaGGUaaaaa@3E19@ The variance of the poststratified estimator is 91% of that of N y ¯ s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGobGabm yEayaaraWaaSbaaSqaaiaadohaaeqaaaaa@3B72@ in samples of n = 500 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGUbGaey ypa0JaaGynaiaaicdacaaIWaaaaa@3C91@ and 88% in samples of n = 1,000 . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWGUbGaey ypa0JaaeymaiaabYcacaqGWaGaaeimaiaabcdacaGGUaaaaa@3E8C@ Since the base weights are constant, Spencer’s design effects are not computable in this example. Therefore, only results for the Kish and proposed design effects are shown in Table 4.4.

Table 4.4
Relative bias of design effect estimates, 10,000 pps MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqipu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaWHWbGaaC iCaiaahohaaaa@3B43@ samples drawn from the NHIS pseudopopulation data
Table summary
This table displays the results of Relative bias of design effect estimates Number of Persons Receiving Medicaid, calculated using XXXXX units of measure (appearing as column headers).
  Number of Persons Receiving Medicaid
n=500 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacaWGUbGaey ypa0JaaGynaiaaicdacaaIWaaaaa@3EBE@ n=1,000 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqk0Jf9crFfpeea0xh9v8qiW7rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meqabeqadiqaceGabeqabeWabeqaeeaakeaacaWGUbGaey ypa0JaaeymaiaabYcacaqGWaGaaeimaiaabcdaaaa@4007@
Empirical deff’s*  
HT 0.97 0.95
GREG 0.91 0.88
  w.r.t.
empirical deff
w.r.t.
“survey” deff
w.r.t.
empirical deff
w.r.t.
“survey” deff
Relative biases (percent)  
Kish** 6.0 17.5 7.0 17.6
Proposed***  
Exact -1.4 3.2 -0.9 5.0
Zero-corr. approx. -1.5 2.9 -1.2 4.7

 

The Kish design effect has positive biases of 17.5% and 17.6% when computed with respect to the empirical deff s . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGKbGaae yzaiaabAgacaqGMbacbaGaa8xgGiaabohacaqGUaaaaa@3E70@ The exact proposed design effects are positively biased with respect to the “survey” deff MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpipeea0xe9LqFf0x e9q8qqvqFr0dXdbrVc=b0P0xb9peuD0xXddrpe0=1qpeea0=yrVue9 Fve9Fve8meaabaqaciaacaGaaeqabaWaaeaaeaaakeaacaqGKbGaae yzaiaabAgacaqGMbaaaa@3C06@ (3.2 and 5.0%), but much less so than the Kish estimator. In this example, the zero-correlation approximation is very similar to the exact version of the proposed estimator. The correlation components were negligible for these weighting adjustments within three decimal places.

Date modified: