Model-assisted calibration of non-probability sample survey data using adaptive LASSO
Section 6. Conclusion

In this manuscript, we developed the LASSO calibration estimator of population totals, T ^ y LASSO , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaaceWGubGbaKaadaqhaaWcbaGaamyEaa qaaiaabYeacaqGbbGaae4uaiaabofacaqGpbaaaOGaaGzaVlaacYca aaa@3A5A@ given population auxiliary data. We also derived closed-form variance estimates for T ^ y LASSO . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaaceWGubGbaKaadaqhaaWcbaGaamyEaa qaaiaabYeacaqGbbGaae4uaiaabofacaqGpbaaaOGaaGzaVlaac6ca aaa@3A5C@ Simulation results show that the point estimates are approximately unbiased under simple-random sampling and informative sampling. For sample selections that are related to analysis variables, LASSO was able to significantly reduce sample bias even without the correct design weights. LASSO tends to outperform stepwise-selected working models when covariates are highly collinear. For analysis with many categorical variables, where there are natural correlations between the categories, LASSO calibration estimator can perform better than traditional calibration estimators, even if the effect sizes are small. The improvement is modest in the continuous variable setting, but substantial when the outcome of interest is binary, as shown in simulations and in the NHIS data example. We have demonstrated theoretically and through simulations that LASSO calibration holds great promise in making unbiased inference of population totals from non-probability samples. Although asymptotic closed-form variance estimates did not produce very accurate nominal coverage, the naive bootstrap is a viable alternative approach. In an application to estimate population total of individuals diagnosed with cancer, without correct design weights, the LASSO calibration estimator was able to produce an estimate that is the closest to the estimate based on correct survey weights. LASSO calibration estimator also has the smallest standard error of all the estimators considered, although the bootstrap variance estimate that was used did not fully account for the clustering in the NHIS, which generally increases standard errors. The application shows that LASSO calibration can generate inference to the population for a specific outcome variable, and the inference is both more accurate and precise than traditional calibration estimators.

The question arises when use of LASSO model-assisted calibration should be used instead of traditional calibration methods such as GREG. Both theoretical and empirical results in this paper suggest that there is little to be lost in terms of statistical efficiency to use LASSO model-assisted calibration, it does require additional effort on the part of the analyst to implement. While we cannot give specific cutoffs, our analysis suggests that this effort will be worthwhile when a) there are large numbers of potential calibration variables, b) many of these calibration variables are likely to be highly correlated, and c) the outcome is binary rather than continuous. We believe that conditions a) and b), at least, are increasing likely to be encountered in non-probability settings, where administrative datasets might provide these types of calibration variables and subsets of data obtained through various means will contain the core variable of interest.

While LASSO provides particularly convenient and rapid implementation, there are, of course, other modern regression methods that could be considered in addition to LASSO to develop penalized regression models for high-dimensional model-assisted regression, including approaches such as ridge regression, principle components, or Bayesian additive regression trees (Chipman, George and McCulloch, 2010). These approaches provide opportunity for further research in this area.

Finally, we note that this work is only a part of a larger and rapidly expanding literature on inference from non-probability surveys. In addition to the work of McConville et al. (2017), the “Mr. P” (multi-level regression and poststratification or MRP) approach of Wang et al. (2015) also uses high dimensional covariates to adjust non-probabilities samples, by use of a hierarchical model rather than penalized regression. Quasi-randomization (Elliott, 2009; Elliott, Resler, Flannagan and Rupp, 2010; Elliott and Valliant, 2017) and sample matching (Rivers, 2007; Vavreck and Rivers, 2008) also provide alternatives that use data from either known population quantities or probability sampling estimates to deal with selection bias issues in non-probability samples. Each have their strengths and weaknesses relative to each other and to model-assisted LASSO. The MRP approach makes distributional assumptions that might improve efficiency, but might reduce robustness, and is non-trivial to implement in its fully Bayesian form. Quasi-randomization forfeits the link to a particular outcome variable, making the weights it develops general purpose but likely less effective, while sample matching requires intervention at the design stage to sample elements from the non-probability frame that match elements from the population, ala quota sampling. The decision to use model-assisted LASSO calibration should be made in the context of these tradeoffs.

Acknowledgements

The authors thank Professor Fred M. Feinberg and Associate Research Scientist Sunghee Lee for their helpful review and suggestions, as well as the Editor, Associate Editor, and three referees whose suggestions greatly contributed to improving this paper.

Appendix

Determining estimates for adaptive LASSO

In practice, we do not observe the theoretical rate of growth of λ n , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH7oaBdaWgaaWcbaGaamOBaaqaba GccaGGSaaaaa@357E@ which optimize model fit measures such as AIC or BIC, unless we have obtained many samples of the same population with various sample sizes. Given a sample, the choices of λ n MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH7oaBdaWgaaWcbaGaamOBaaqaba aaaa@34C4@ and γ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaHZoWzaaa@3398@ depend on the modeler. In R g l m n e t MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWGNbGaamiBaiaad2gacaWGUbGaam yzaiaadshaaaa@3796@ implementation (Friedman et al., 2010), a range of λ n MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH7oaBdaWgaaWcbaGaamOBaaqaba aaaa@34C4@ is determined by the following scheme: 

  1. Set γ = 0. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaHZoWzcaaI9aGaaGimaiaac6caaa a@35CB@
  2. Determine λ n max MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH7oaBdaqhaaWcbaGaamOBaaqaai aab2gacaqGHbGaaeiEaaaaaaa@3794@ by finding the smallest λ n MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH7oaBdaWgaaWcbaGaamOBaaqaba aaaa@34C4@ that sets all coefficients to 0.
  3. If sample size n MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWGUbaaaa@32E4@ is larger than the number of parameters in the regression model, set λ n min = 0 .0001 λ n max . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH7oaBdaqhaaWcbaGaamOBaaqaai aab2gacaqGPbGaaeOBaaaakiaai2dacaqGWaGaaeOlaiaabcdacaqG WaGaaeimaiaabgdacqaH7oaBdaqhaaWcbaGaamOBaaqaaiaab2gaca qGHbGaaeiEaaaakiaaygW7caGGUaaaaa@447D@ If sample size n MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWGUbaaaa@32E4@ is smaller than the number of parameters, set λ n min = 0 .01 λ n max MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH7oaBdaqhaaWcbaGaamOBaaqaai aab2gacaqGPbGaaeOBaaaakiaai2dacaqGWaGaaeOlaiaabcdacaqG XaGaeq4UdW2aa0baaSqaaiaad6gaaeaacaqGTbGaaeyyaiaabIhaaa aaaa@40D1@ (to set parameters to 0 sooner).
  4. Generate a grid of λ n , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH7oaBdaWgaaWcbaGaamOBaaqaba GccaGGSaaaaa@357E@ typically 100 equally spaced points between λ n min MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH7oaBdaqhaaWcbaGaamOBaaqaai aab2gacaqGPbGaaeOBaaaaaaa@3792@ and λ n max . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH7oaBdaqhaaWcbaGaamOBaaqaai aab2gacaqGHbGaaeiEaaaakiaaygW7caGGUaaaaa@39DA@

The initial range of values of λ n MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH7oaBdaWgaaWcbaGaamOBaaqaba aaaa@34C4@ is determined independently of γ . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaHZoWzcaGGUaaaaa@344A@ Choices of γ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaHZoWzaaa@3398@ are less data-driven. Some modelers choose one of γ = 0 .1 , 0 .5 , 1, 2. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaHZoWzcaaI9aGaaeimaiaab6caca qGXaGaaGilaiaaysW7caqGWaGaaeOlaiaabwdacaaISaGaaGjbVlaa igdacaaISaGaaGjbVlaaikdacaGGUaaaaa@4185@ Here we determine ( λ n , γ ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaadaqadaqaaiabeU7aSnaaBaaaleaaca WGUbaabeaakiaaiYcacaaMe8Uaeq4SdCgacaGLOaGaayzkaaaaaa@3A41@ through cross-validation as follows: 

  1. Obtain α j = 1 / | β ^ j MLE | . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaHXoqydaWgaaWcbaGaamOAaaqaba GccqGH9aqpdaWcgaqaaiaaigdacaaMc8oabaGaaGPaVpaaemaabaGa aGPaVlqbek7aIzaajaWaa0baaSqaaiaadQgaaeaacaqGnbGaaeitai aabweaaaGccaaMc8oacaGLhWUaayjcSdaaaiaac6caaaa@45CA@
  2. Determine 100 equally spaced values of λ n MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH7oaBdaWgaaWcbaGaamOBaaqaba aaaa@34C4@ based on R g l m n e t MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWGNbGaamiBaiaad2gacaWGUbGaam yzaiaadshaaaa@3796@ ’s implementation.
  3. For each pair ( λ n , γ ) , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaadaqadaqaaiabeU7aSnaaBaaaleaaca WGUbaabeaakiaaiYcacaaMe8Uaeq4SdCgacaGLOaGaayzkaaGaaiil aaaa@3AF1@ λ n MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH7oaBdaWgaaWcbaGaamOBaaqaba aaaa@34C4@ from Step 2, and γ = 0 .1 , 0 .5 , 1 , 2 , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaHZoWzcaaI9aGaaeimaiaab6caca qGXaGaaGilaiaaysW7caaMc8Uaaeimaiaab6cacaqG1aGaaGilaiaa ysW7caaMc8UaaeymaiaaiYcacaaMe8UaaGPaVlaabkdacaGGSaaaaa@4616@ split data into 5 folds. Use 4 folds to obtain β ^ . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaaceWHYoGbaKaacaGIUaaaaa@33F9@
  4. Apply β ^ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaaceWHYoGbaKaaaaa@333F@ to the last fold not used to estimate β ^ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaaceWHYoGbaKaaaaa@333F@ and calculate a metric. For continuous y , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWH5bGaaiilaaaa@33A3@ we calculate the mean-absolute-error (MAE), i s A ( k ) | μ ^ i y i | . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaadaaeqaqabSqaaiaadMgacqGHiiIZca WGZbWaaSbaaWqaaiaadgeadaqadaqaaiaadUgaaiaawIcacaGLPaaa aeqaaaWcbeqdcqGHris5aOWaaqWaaeaacaaMc8UafqiVd0MbaKaada WgaaWcbaGaamyAaaqabaGccqGHsislcaWG5bWaaSbaaSqaaiaadMga aeqaaOGaaGPaVdGaay5bSlaawIa7aiaac6caaaa@47A3@ For binary y , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWH5bGaaiilaaaa@33A3@ we calculate the area under curve (AUC) (calculated through R g l m n e t : : a u c MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWGNbGaamiBaiaad2gacaWGUbGaam yzaiaadshacaaI6aGaaGOoaiaadggacaWG1bGaam4yaaaa@3BE6@ function).
  5. Average the 5 metrics for each pair of ( λ n , γ ) , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaadaqadaqaaiabeU7aSnaaBaaaleaaca WGUbaabeaakiaaiYcacaaMe8Uaeq4SdCgacaGLOaGaayzkaaGaaiil aaaa@3AF1@ and choose the pair with the best average metric: minimum MAE for continuous y , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWH5bGaaiilaaaa@33A3@ maximum AUC for binary y . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWH5bGaaiOlaaaa@33A5@

The adaptive LASSO coefficient estimates are then obtained by solving equations (3.1) or (3.2) in Section 3.1 given the selected ( λ n , γ ) . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaadaqadaqaaiabeU7aSnaaBaaaleaaca WGUbaabeaakiaaiYcacaaMe8Uaeq4SdCgacaGLOaGaayzkaaGaaiOl aaaa@3AF3@ The R code used to perform cross-validation is provided in the on-line supplemental material.

Asymptotic unbiasedness and variance of model-assisted LASSO calibration estimator of a population total

Lemma 1: Assume the superpopulation model:

E ξ ( y k | x k ) = μ ( x k , β ) , V ξ ( y k | x k ) = ν k 2 σ 2 . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWGfbWaaSbaaSqaaiabe67a4bqaba GcdaqadaqaamaaeiaabaGaamyEamaaBaaaleaacaWGRbaabeaakiaa ykW7aiaawIa7aiaaykW7caWH4bWaaSbaaSqaaiaadUgaaeqaaaGcca GLOaGaayzkaaGaaGjbVlaaykW7caaI9aGaaGjbVlaaykW7cqaH8oqB daqadaqaaiaahIhadaWgaaWcbaGaam4AaaqabaGccaaISaGaaGjbVl aahk7aaiaawIcacaGLPaaacaaISaGaamOvamaaBaaaleaacqaH+oaE aeqaaOWaaeWaaeaadaabcaqaaiaadMhadaWgaaWcbaGaam4Aaaqaba GccaaMc8oacaGLiWoacaaMc8UaaCiEamaaBaaaleaacaWGRbaabeaa aOGaayjkaiaawMcaaiaai2dacqaH9oGBdaqhaaWcbaGaam4Aaaqaai aaikdaaaGccqaHdpWCdaahaaWcbeqaaiaaikdaaaGccaGGUaaaaa@64E5@

Let B MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWHcbaaaa@32BC@  be the finite-population quasilikelihood estimate of β , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWHYoGaaOilaaaa@33E7@   B β . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWHcbGaeyOKH4QaaCOSdiaak6caaa a@36A1@  Under conditions (1)-(5) in Section 3.2, the model-assisted asymptotic estimator of population total is:

T ^ y M C = i s A d i A ( y i μ i B M C ) + i = 1 N μ i B M C + o p ( N n ) ( A .1 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaaceWGubGbaKaadaqhaaWcbaGaamyEaa qaaiaad2eacaWGdbaaaOGaaGypaiaaysW7caaMe8+aaabuaeqaleaa caWGPbGaeyicI4Saam4CamaaBaaameaacaWGbbaabeaaaSqab0Gaey yeIuoakiaaysW7caWGKbWaa0baaSqaaiaadMgaaeaacaWGbbaaaOWa aeWaaeaacaWG5bWaaSbaaSqaaiaadMgaaeqaaOGaeyOeI0IaeqiVd0 2aaSbaaSqaaiaadMgaaeqaaOGaamOqamaaCaaaleqabaGaamytaiaa doeaaaaakiaawIcacaGLPaaacqGHRaWkdaaeWbqabSqaaiaadMgaca aI9aGaaGymaaqaaiaad6eaa0GaeyyeIuoakiaaysW7cqaH8oqBdaWg aaWcbaGaamyAaaqabaGccaWGcbWaaWbaaSqabeaacaWGnbGaam4qaa aakiabgUcaRiaad+gadaWgaaWcbaGaamiCaaqabaGcdaqadaqaamaa laaabaGaamOtaaqaamaakaaabaGaamOBaaWcbeaaaaaakiaawIcaca GLPaaacaaMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaacIcacaqGbbGa aeOlaiaabgdacaGGPaaaaa@6DC8@

where

μ i = μ ( x i , B ) B M C = i = 1 N ( μ i μ ¯ ) ( y i y ¯ ) i = 1 N ( μ i μ ¯ ) 2 . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaafaqaaeGacaaabaGaeqiVd02aaSbaaS qaaiaadMgaaeqaaaGcbaGaaGypaiaaysW7caaMe8UaeqiVd02aaeWa aeaacaWH4bWaaSbaaSqaaiaadMgaaeqaaOGaaGilaiaaysW7caWHcb aacaGLOaGaayzkaaaabaGaamOqamaaCaaaleqabaGaamytaiaadoea aaaakeaacaaI9aGaaGjbVlaaysW7daWcaaqaamaaqadabaWaaeWaae aacqaH8oqBdaWgaaWcbaGaamyAaaqabaGccqGHsislcuaH8oqBgaqe aaGaayjkaiaawMcaamaabmaabaGaamyEamaaBaaaleaacaWGPbaabe aakiabgkHiTiqadMhagaqeaaGaayjkaiaawMcaaaWcbaGaamyAaiaa i2dacaaIXaaabaGaamOtaaqdcqGHris5aaGcbaWaaabmaeaadaqada qaaiabeY7aTnaaBaaaleaacaWGPbaabeaakiabgkHiTiqbeY7aTzaa raaacaGLOaGaayzkaaWaaWbaaSqabeaacaaIYaaaaaqaaiaadMgaca aI9aGaaGymaaqaaiaad6eaa0GaeyyeIuoaaaGccaGGUaaaaaaa@67CD@

Proof. The proof is adapted and expanded from the proof of Theorem 1 in Wu and Sitter (2001), with slight modifications in notations to be consistent with this paper. We begin by deriving the asymptotic model-assisted estimator for a population mean, y ¯ ^ MC = N 1 T ^ y MC MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaaceWG5bGbaeHbaKaadaahaaWcbeqaai aab2eacaqGdbaaaOGaaGypaiaad6eadaahaaWcbeqaaiabgkHiTiaa igdaaaGcceWGubGbaKaadaqhaaWcbaGaamyEaaqaaiaab2eacaqGdb aaaaaa@3C06@  (see equation (2.7)). By conditions (2) and (3), the second order Taylor series expansion of μ ( x i , B ^ ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH8oqBdaqadaqaaiaahIhadaWgaa WcbaGaamyAaaqabaGccaaISaGaaGjbVlqahkeagaqcaaGaayjkaiaa wMcaaaaa@3A73@  around B MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWHcbaaaa@32BC@  is:

μ( x i , B ^ )=μ( x i , B )+ { μ( x i ,t ) t | t=B } T ( B ^ B )+ ( B ^ B ) T { 2 μ( x i ,t ) t t T | t= B * }( B ^ B )(A.2) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH8oqBdaqadaqaaiaahIhadaWgaa WcbaGaamyAaaqabaGccaaISaGabCOqayaajaaacaGLOaGaayzkaaGa aGypaiabeY7aTnaabmaabaGaaCiEamaaBaaaleaacaWGPbaabeaaki aaiYcacaWHcbaacaGLOaGaayzkaaGaey4kaSYaaiWaaeaadaabcaqa amaalaaabaGaeyOaIyRaeqiVd02aaeWaaeaacaWH4bWaaSbaaSqaai aadMgaaeqaaOGaaGilaiaaysW7caWH0baacaGLOaGaayzkaaaabaGa eyOaIyRaaCiDaaaaaiaawIa7amaaBaaaleaacaWH0bGaaGypaiaahk eaaeqaaaGccaGL7bGaayzFaaWaaWbaaSqabeaacaWGubaaaOWaaeWa aeaaceWHcbGbaKaacqGHsislcaWHcbaacaGLOaGaayzkaaGaey4kaS YaaeWaaeaaceWHcbGbaKaacqGHsislcaWHcbaacaGLOaGaayzkaaWa aWbaaSqabeaacaWGubaaaOWaaiWaaeaadaWcaaqaaiabgkGi2oaaCa aaleqabaGaaGOmaaaakiabeY7aTnaabmaabaGaaCiEamaaBaaaleaa caWGPbaabeaakiaaiYcacaaMe8UaaCiDaaGaayjkaiaawMcaaaqaai abgkGi2kaahshacqGHciITcaWH0bWaaWbaaSqabeaacaWGubaaaaaa kiaaiYhadaWgaaWcbaGaaCiDaiaai2dacaWHcbWaaWbaaWqabeaaca GGQaaaaaWcbeaaaOGaay5Eaiaaw2haamaabmaabaGabCOqayaajaGa eyOeI0IaaCOqaaGaayjkaiaawMcaaiaaywW7caaMf8UaaGzbVlaacI cacaqGbbGaaeOlaiaabkdacaGGPaaaaa@83C5@

for B * ( B ^ , B ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWHcbWaaWbaaSqabeaacaGGQaaaaO GaeyicI48aaeWaaeaaceWHcbGbaKaacaaISaGaaCOqaaGaayjkaiaa wMcaaaaa@390A@ or ( B , B ^ ) . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaadaqadaqaaiaahkeacaaISaGaaGjbVl qahkeagaqcaaGaayjkaiaawMcaaiaac6caaaa@3815@ Let

h ( x i , B ) = μ ( x i , t ) t | t = B k ( x i , Β * ) = 2 μ ( x i , t ) t t T | t = B * MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9w8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaafaqaaeGacaaabaGaaCiAamaabmaaba GaaCiEamaaBaaaleaacaWGPbaabeaakiaaiYcacaaMe8UaaCOqaaGa ayjkaiaawMcaaaqaaiaai2dadaWcaaqaaiabgkGi2kabeY7aTnaabm aabaGaaCiEamaaBaaaleaacaWGPbaabeaakiaaiYcacaaMe8UaaCiD aaGaayjkaiaawMcaaaqaaiabgkGi2kaahshaaaGaaGiFamaaBaaale aacaWH0bGaaGypaiaahkeaaeqaaaGcbaGaaC4AamaabmaabaGaaCiE amaaBaaaleaacaWGPbaabeaakiaaiYcacaaMe8UaaCOKdmaaCaaale qabaGaaiOkaaaaaOGaayjkaiaawMcaaaqaaiaai2dadaabcaqaamaa laaabaGaeyOaIy7aaWbaaSqabeaacaaIYaaaaOGaeqiVd02aaeWaae aacaWH4bWaaSbaaSqaaiaadMgaaeqaaOGaaGilaiaaysW7caWH0baa caGLOaGaayzkaaaabaGaeyOaIyRaaCiDaiabgkGi2kaahshadaahaa WcbeqaaiaadsfaaaaaaaGccaGLiWoadaWgaaWcbaGaaCiDaiaai2da caWHcbWaaWbaaWqabeaacaGGQaaaaaWcbeaaaaaaaa@6A83@

Note that h MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWHObaaaa@32E2@  is a vector of length m MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWGTbaaaa@32E3@  and k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWHRbaaaa@32E5@  is a matrix of size m × m , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWGTbGaey41aqRaamyBaiaacYcaaa a@369C@  where m MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWGTbaaaa@32E3@  is the number of parameters in β . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWHYoGaaOOlaaaa@33E9@  By conditions (2) and (3),

max i | h ( x i , B ) | h ( x i , B ) ( A .3 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaqGTbGaaeyyaiaabIhadaWgaaWcba GaamyAaaqabaGcdaabdaqaaiaaykW7caWHObWaaeWaaeaacaWH4bWa aSbaaSqaaiaadMgaaeqaaOGaaGilaiaaysW7caWHcbaacaGLOaGaay zkaaGaaGPaVdGaay5bSlaawIa7aiaaysW7caaMe8UaeyizImQaaGjb VlaaysW7caWGObWaaeWaaeaacaWH4bWaaSbaaSqaaiaadMgaaeqaaO GaaGilaiaaysW7caWHcbaacaGLOaGaayzkaaGaaGzbVlaaywW7caaM f8UaaGzbVlaaywW7caGGOaGaaeyqaiaab6cacaqGZaGaaiykaaaa@5EA4@

max k , j | k ( x i , B * ) | k ( x i , B * ) . ( A .4 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaqGTbGaaeyyaiaabIhadaWgaaWcba Gaam4AaiaaygW7caaISaGaaGjbVlaadQgaaeqaaOWaaqWaaeaacaaM c8UaaC4AaiaaiIcacaWH4bWaaSbaaSqaaiaadMgaaeqaaOGaaGilai aaysW7caWHcbWaaWbaaSqabeaacaGGQaaaaOGaaGykaiaaykW7aiaa wEa7caGLiWoacaaMe8UaaGjbVlabgsMiJkaaysW7caaMe8Uaam4Aam aabmaabaGaaCiEamaaBaaaleaacaWGPbaabeaakiaaiYcacaaMe8Ua aCOqamaaCaaaleqabaGaaiOkaaaaaOGaayjkaiaawMcaaiaac6caca aMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaacIcacaqGbbGaaeOlaiaa bsdacaGGPaaaaa@65C1@

Conditions (1) and (3) imply that

μ ( x i , B ^ ) = μ ( x i , B ) + O p ( 1 / n ) ( A .5 ) μ i + O p ( 1 / n ) . ( A .6 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9W8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaafaqaaeGadaaabaGaeqiVd02aaeWaae aacaWH4bWaaSbaaSqaaiaadMgaaeqaaOGaaGilaiaaysW7ceWHcbGb aKaaaiaawIcacaGLPaaaaeaacaaI9aGaeqiVd02aaeWaaeaacaWH4b WaaSbaaSqaaiaadMgaaeqaaOGaaGilaiaaysW7caWHcbaacaGLOaGa ayzkaaGaey4kaSIaam4tamaaBaaaleaacaWGWbaabeaakmaabmaaba WaaSGbaeaacaaIXaaabaWaaOaaaeaacaWGUbaaleqaaaaaaOGaayjk aiaawMcaaaqaaiaaywW7caaMf8UaaGzbVlaaywW7caGGOaGaaeyqai aab6cacaqG1aGaaiykaaqaaaqaaiabggMi6kabeY7aTnaaBaaaleaa caWGPbaabeaakiabgUcaRiaad+eadaWgaaWcbaGaamiCaaqabaGcda qadaqaamaalyaabaGaaGymaaqaamaakaaabaGaamOBaaWcbeaaaaaa kiaawIcacaGLPaaacaGGUaaabaGaaGzbVlaaywW7caaMf8UaaGzbVl aacIcacaqGbbGaaeOlaiaabAdacaGGPaaaaaaa@694D@

By equation (2.2) in Section 2.1 and the boundedness conditions of (2) and (3) in Section 3.2.2 imply

N 1 i s A d i A μ ( x i , B ^ ) = N 1 i s A d i A μ ( x i , B ) + N 1 ( i s A d i A h ( x i , B ) ) T ( B ^ B ) + ( B ^ B ) T N 1 ( i s A d i A k ( x i , B * ) ) ( B ^ B ) = N 1 i s A d i A μ ( x i , B ) + N 1 ( i s A d i A h ( x i , B ) ) T ( B ^ B ) + O p ( 1 n ) . ( A .7 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaafaqaaeWacaaabaGaamOtamaaCaaale qabaGaeyOeI0IaaGymaaaakmaaqafabeWcbaGaamyAaiabgIGiolaa dohadaWgaaadbaGaamyqaaqabaaaleqaniabggHiLdGccaaMc8Uaam izamaaDaaaleaacaWGPbaabaGaamyqaaaakiabeY7aTnaabmaabaGa aCiEamaaBaaaleaacaWGPbaabeaakiaaiYcacaaMe8UabCOqayaaja aacaGLOaGaayzkaaaabaGaaGypaiaad6eadaahaaWcbeqaaiabgkHi TiaaigdaaaGcdaaeqbqabSqaaiaadMgacqGHiiIZcaWGZbWaaSbaaW qaaiaadgeaaeqaaaWcbeqdcqGHris5aOGaaGPaVlaadsgadaqhaaWc baGaamyAaaqaaiaadgeaaaGccqaH8oqBdaqadaqaaiaahIhadaWgaa WcbaGaamyAaaqabaGccaaISaGaaGjbVlaahkeaaiaawIcacaGLPaaa cqGHRaWkcaWGobWaaWbaaSqabeaacqGHsislcaaIXaaaaOWaaeWaae aadaaeqbqabSqaaiaadMgacqGHiiIZcaWGZbWaaSbaaWqaaiaadgea aeqaaaWcbeqdcqGHris5aOGaaGPaVlaadsgadaqhaaWcbaGaamyAaa qaaiaadgeaaaGccaWHObWaaeWaaeaacaWH4bWaaSbaaSqaaiaadMga aeqaaOGaaGilaiaaysW7caWHcbaacaGLOaGaayzkaaaacaGLOaGaay zkaaWaaWbaaSqabeaacaWGubaaaOWaaeWaaeaaceWHcbGbaKaacqGH sislcaaMe8UaaCOqaaGaayjkaiaawMcaaaqaaaqaaiaaykW7caaMc8 Uaey4kaSIaaGjbVlaaysW7daqadaqaaiqahkeagaqcaiabgkHiTiaa hkeaaiaawIcacaGLPaaadaahaaWcbeqaaiaadsfaaaGccaWGobWaaW baaSqabeaacqGHsislcaaIXaaaaOWaaeWaaeaadaaeqbqabSqaaiaa dMgacqGHiiIZcaWGZbWaaSbaaWqaaiaadgeaaeqaaaWcbeqdcqGHri s5aOGaaGPaVlaadsgadaqhaaWcbaGaamyAaaqaaiaadgeaaaGccaWH RbWaaeWaaeaacaWH4bWaaSbaaSqaaiaadMgaaeqaaOGaaGilaiaays W7caWHcbWaaWbaaSqabeaacaGGQaaaaaGccaGLOaGaayzkaaaacaGL OaGaayzkaaWaaeWaaeaaceWHcbGbaKaacqGHsislcaWHcbaacaGLOa GaayzkaaaabaaabaGaaGypaiaad6eadaahaaWcbeqaaiabgkHiTiaa igdaaaGcdaaeqbqabSqaaiaadMgacqGHiiIZcaWGZbWaaSbaaWqaai aadgeaaeqaaaWcbeqdcqGHris5aOGaaGPaVlaadsgadaqhaaWcbaGa amyAaaqaaiaadgeaaaGccqaH8oqBdaqadaqaaiaahIhadaWgaaWcba GaamyAaaqabaGccaaISaGaaGjbVlaahkeaaiaawIcacaGLPaaacqGH RaWkcaWGobWaaWbaaSqabeaacqGHsislcaaIXaaaaOWaaeWaaeaada aeqbqabSqaaiaadMgacqGHiiIZcaWGZbWaaSbaaWqaaiaadgeaaeqa aaWcbeqdcqGHris5aOGaaGPaVlaadsgadaqhaaWcbaGaamyAaaqaai aadgeaaaGccaWHObWaaeWaaeaacaWH4bWaaSbaaSqaaiaadMgaaeqa aOGaaGilaiaaysW7caWHcbaacaGLOaGaayzkaaaacaGLOaGaayzkaa WaaWbaaSqabeaacaWGubaaaOWaaeWaaeaaceWHcbGbaKaacqGHsisl caaMe8UaaCOqaaGaayjkaiaawMcaaiabgUcaRiaad+eadaWgaaWcba GaamiCaaqabaGcdaqadaqaamaalaaabaGaaGymaaqaaiaad6gaaaaa caGLOaGaayzkaaGaaiOlaiaaywW7caaMf8UaaGzbVlaacIcacaqGbb GaaeOlaiaabEdacaGGPaaaaaaa@EA06@

By conditions (1), (4), and equation (A.7):

N 1 k = 1 N μ ( x k , B ^ ) N 1 i s A d i A μ ( x i , B ^ ) = N 1 k = 1 N μ ( x i , B ) N 1 i s A d i A μ ( x i , B ) + O p ( 1 n ) . ( A .8 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8qrpq0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaafaqaaeGacaaabaGaamOtamaaCaaale qabaGaeyOeI0IaaGymaaaakmaaqahabeWcbaGaam4Aaiaai2dacaaI XaaabaGaamOtaaqdcqGHris5aOGaaGPaVlabeY7aTnaabmaabaGaaC iEamaaBaaaleaacaWGRbaabeaakiaaiYcacaaMe8UabCOqayaajaaa caGLOaGaayzkaaaabaGaeyOeI0IaaGjbVlaaykW7caWGobWaaWbaaS qabeaacqGHsislcaaIXaaaaOWaaabuaeqaleaacaWGPbGaeyicI4Sa am4CamaaBaaameaacaWGbbaabeaaaSqab0GaeyyeIuoakiaaykW7ca WGKbWaa0baaSqaaiaadMgaaeaacaWGbbaaaOGaeqiVd02aaeWaaeaa caWH4bWaaSbaaSqaaiaadMgaaeqaaOGaaGilaiaaysW7ceWHcbGbaK aaaiaawIcacaGLPaaaaeaaaeaacaaI9aGaaGPaVlaaysW7caWGobWa aWbaaSqabeaacqGHsislcaaIXaaaaOWaaabCaeqaleaacaWGRbGaaG ypaiaaigdaaeaacaWGobaaniabggHiLdGccaaMc8UaeqiVd02aaeWa aeaacaWH4bWaaSbaaSqaaiaadMgaaeqaaOGaaGilaiaaysW7caWHcb aacaGLOaGaayzkaaGaeyOeI0IaamOtamaaCaaaleqabaGaeyOeI0Ia aGymaaaakmaaqafabeWcbaGaamyAaiabgIGiolaadohadaWgaaadba GaamyqaaqabaaaleqaniabggHiLdGccaaMc8UaamizamaaDaaaleaa caWGPbaabaGaamyqaaaakiabeY7aTnaabmaabaGaaCiEamaaBaaale aacaWGPbaabeaakiaaiYcacaaMe8UaaCOqaaGaayjkaiaawMcaaiab gUcaRiaad+eadaWgaaWcbaGaamiCaaqabaGcdaqadaqaamaalaaaba GaaGymaaqaamaakaaabaGaamOBaaWcbeaaaaaakiaawIcacaGLPaaa caGGUaGaaGzbVlaaywW7caaMf8UaaGzbVlaacIcacaqGbbGaaeOlai aabIdacaGGPaaaaaaa@9C14@

Using conditions (1) and (3),

μ ^ ¯ = i s A d i A μ ( x i , B ^ ) / i s A d i A = ( i s A d i A ) 1 i s A d i A ( μ ( x i , B ) + h T ( x i , B ) ( B ^ B ) + ( B ^ B ) T k ( x i , B * ) ( B ^ B ) ) = ( i s A d i A ) 1 i s A d i A ( μ ( x i , B ) + h T ( x i , B ) ( B ^ B ) ) + O p ( 1 / n ) = μ ¯ + ( i s A d i A ) 1 i s A d i A h T ( x i , B ) ( B ^ B ) + O p ( 1 / n ) ( b y c o n d i t i o n ( 1 ) a n d ( 18 ) ) = μ ¯ + O p ( 1 / n ) + O p ( 1 / n ) = μ ¯ + O p ( 1 / n ) ( A .9 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaafaqaaeWbcaaaaeaacuaH8oqBgaqcga qeaaqaaiaai2dadaWcgaqaamaaqafabeWcbaGaamyAaiabgIGiolaa dohadaWgaaadbaGaamyqaaqabaaaleqaniabggHiLdGccaaMc8Uaam izamaaDaaaleaacaWGPbaabaGaamyqaaaakiabeY7aTnaabmaabaGa aCiEamaaBaaaleaacaWGPbaabeaakiaaiYcacaaMe8UabCOqayaaja aacaGLOaGaayzkaaaabaWaaabuaeqaleaacaWGPbGaeyicI4Saam4C amaaBaaameaacaWGbbaabeaaaSqab0GaeyyeIuoakiaaykW7caWGKb Waa0baaSqaaiaadMgaaeaacaWGbbaaaaaaaOqaaaqaaiaai2dadaqa daqaamaaqafabeWcbaGaamyAaiabgIGiolaadohadaWgaaadbaGaam yqaaqabaaaleqaniabggHiLdGccaaMc8UaamizamaaDaaaleaacaWG PbaabaGaamyqaaaaaOGaayjkaiaawMcaamaaCaaaleqabaGaeyOeI0 IaaGymaaaakmaaqafabeWcbaGaamyAaiabgIGiolaadohadaWgaaad baGaamyqaaqabaaaleqaniabggHiLdGccaaMc8UaamizamaaDaaale aacaWGPbaabaGaamyqaaaakmaabmaabaGaeqiVd02aaeWaaeaacaWH 4bWaaSbaaSqaaiaadMgaaeqaaOGaaGilaiaaysW7caWHcbaacaGLOa GaayzkaaGaey4kaSIaaCiAamaaCaaaleqabaGaamivaaaakmaabmaa baGaaCiEamaaBaaaleaacaWGPbaabeaakiaaiYcacaaMe8UaaCOqaa GaayjkaiaawMcaamaabmaabaGabCOqayaajaGaeyOeI0IaaCOqaaGa ayjkaiaawMcaaiabgUcaRmaabmaabaGabCOqayaajaGaeyOeI0IaaC OqaaGaayjkaiaawMcaamaaCaaaleqabaGaamivaaaakiaahUgadaqa daqaaiaahIhadaWgaaWcbaGaamyAaaqabaGccaaISaGaaGjbVlaahk eadaahaaWcbeqaaiaacQcaaaaakiaawIcacaGLPaaadaqadaqaaiqa hkeagaqcaiabgkHiTiaahkeaaiaawIcacaGLPaaaaiaawIcacaGLPa aaaeaaaeaacaaI9aWaaeWaaeaadaaeqbqabSqaaiaadMgacqGHiiIZ caWGZbWaaSbaaWqaaiaadgeaaeqaaaWcbeqdcqGHris5aOGaaGPaVl aadsgadaqhaaWcbaGaamyAaaqaaiaadgeaaaaakiaawIcacaGLPaaa daahaaWcbeqaaiabgkHiTiaaigdaaaGcdaaeqbqabSqaaiaadMgacq GHiiIZcaWGZbWaaSbaaWqaaiaadgeaaeqaaaWcbeqdcqGHris5aOGa aGPaVlaadsgadaqhaaWcbaGaamyAaaqaaiaadgeaaaGcdaqadaqaai abeY7aTnaabmaabaGaaCiEamaaBaaaleaacaWGPbaabeaakiaaiYca caaMe8UaaCOqaaGaayjkaiaawMcaaiabgUcaRiaahIgadaahaaWcbe qaaiaadsfaaaGcdaqadaqaaiaahIhadaWgaaWcbaGaamyAaaqabaGc caaISaGaaGjbVlaahkeaaiaawIcacaGLPaaadaqadaqaaiqahkeaga qcaiabgkHiTiaahkeaaiaawIcacaGLPaaaaiaawIcacaGLPaaacqGH RaWkcaWGpbWaaSbaaSqaaiaadchaaeqaaOWaaeWaaeaadaWcgaqaai aaigdaaeaacaWGUbaaaaGaayjkaiaawMcaaaqaaaqaaiaai2dacuaH 8oqBgaqeaiabgUcaRmaabmaabaWaaabuaeqaleaacaWGPbGaeyicI4 Saam4CamaaBaaameaacaWGbbaabeaaaSqab0GaeyyeIuoakiaaykW7 caWGKbWaa0baaSqaaiaadMgaaeaacaWGbbaaaaGccaGLOaGaayzkaa WaaWbaaSqabeaacqGHsislcaaIXaaaaOWaaabuaeqaleaacaWGPbGa eyicI4Saam4CamaaBaaameaacaWGbbaabeaaaSqab0GaeyyeIuoaki aaykW7caWGKbWaa0baaSqaaiaadMgaaeaacaWGbbaaaOGaaCiAamaa CaaaleqabaGaamivaaaakmaabmaabaGaaCiEamaaBaaaleaacaWGPb aabeaakiaaiYcacaaMe8UaaCOqaaGaayjkaiaawMcaamaabmaabaGa bCOqayaajaGaeyOeI0IaaCOqaaGaayjkaiaawMcaaiabgUcaRiaad+ eadaWgaaWcbaGaamiCaaqabaGcdaqadaqaamaalyaabaGaaGymaaqa aiaad6gaaaaacaGLOaGaayzkaaaabaaabaWaaeWaaeaacaWGIbGaam yEaiaaysW7caWGJbGaam4Baiaad6gacaWGKbGaamyAaiaadshacaWG PbGaam4Baiaad6gacaaMe8+aaeWaaeaacaaIXaaacaGLOaGaayzkaa GaaGjbVlaadggacaWGUbGaamizaiaaysW7daqadaqaaiaaigdacaaI 4aaacaGLOaGaayzkaaaacaGLOaGaayzkaaaabaaabaGaaGypaiqbeY 7aTzaaraGaey4kaSIaam4tamaaBaaaleaacaWGWbaabeaakmaabmaa baWaaSGbaeaacaaIXaaabaWaaOaaaeaacaWGUbaaleqaaaaaaOGaay jkaiaawMcaaiabgUcaRiaad+eadaWgaaWcbaGaamiCaaqabaGcdaqa daqaamaalyaabaGaaGymaaqaaiaad6gaaaaacaGLOaGaayzkaaaaba aabaGaaGypaiqbeY7aTzaaraGaey4kaSIaam4tamaaBaaaleaacaWG WbaabeaakmaabmaabaWaaSGbaeaacaaIXaaabaWaaOaaaeaacaWGUb aaleqaaaaaaOGaayjkaiaawMcaaiaaywW7caaMf8UaaGzbVlaaywW7 caaMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7caaMf8UaaGzbVl aaywW7caaMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7caaMf8Ua aGzbVlaaywW7caaMf8UaaGzbVlaacIcacaqGbbGaaeOlaiaabMdaca GGPaaaaaaa@5A89@

for μ ¯ = i s A d i A μ i / i s A d i A . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacuaH8oqBgaqeaiaai2dadaWcgaqaam aaqababeWcbaGaamyAaiabgIGiolaadohadaWgaaadbaGaamyqaaqa baaaleqaniabggHiLdGccaaMe8UaamizamaaDaaaleaacaWGPbaaba GaamyqaaaakiabeY7aTnaaBaaaleaacaWGPbaabeaaaOqaamaaqaba beWcbaGaamyAaiabgIGiolaadohadaWgaaadbaGaamyqaaqabaaale qaniabggHiLdGccaWGKbWaa0baaSqaaiaadMgaaeaacaWGbbaaaaaa kiaaygW7caGGUaaaaa@4D92@

Then from (A.2) and (A.9) and using conditions (1)-(3), we have

N 1 i s A d i A ( μ ^ i μ ¯ ^ ) = N 1 i s A d i A ( μ ( x i , B ) + h T ( x i , B ) ( B ^ B ) + ( B ^ B ) T k ( x i , B * ) ( B ^ B ) μ ¯ ) = N 1 i s A d i A ( μ i μ ¯ ) + N 1 i s A d i A h T ( x i , B ) ( B ^ B ) + N 1 i s A d i A ( B ^ B ) T k ( x i , B * ) ( B ^ B ) O p ( 1 / n ) = N 1 i s A d i A ( μ i μ ¯ ) + O p ( 1 / n ) + O p ( 1 / n ) O p ( 1 / n ) = N 1 i s A d i A ( μ i μ ¯ ) + O p ( 1 / n ) . ( A .10 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9w8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaafaqaaeqbcaaaaeaacaWGobWaaWbaaS qabeaacqGHsislcaaIXaaaaOWaaabuaeqaleaacaWGPbGaeyicI4Sa am4CamaaBaaameaacaWGbbaabeaaaSqab0GaeyyeIuoakiaaykW7ca WGKbWaa0baaSqaaiaadMgaaeaacaWGbbaaaOWaaeWaaeaacuaH8oqB gaqcamaaBaaaleaacaWGPbaabeaakiabgkHiTiqbeY7aTzaaryaaja aacaGLOaGaayzkaaaabaGaaGypaiaad6eadaahaaWcbeqaaiabgkHi TiaaigdaaaGcdaaeqbqabSqaaiaadMgacqGHiiIZcaWGZbWaaSbaaW qaaiaadgeaaeqaaaWcbeqdcqGHris5aOGaaGPaVlaadsgadaqhaaWc baGaamyAaaqaaiaadgeaaaGcdaqadaqaaiabeY7aTnaabmaabaGaaC iEamaaBaaaleaacaWGPbaabeaakiaaiYcacaWHcbaacaGLOaGaayzk aaGaey4kaSIaaCiAamaaCaaaleqabaGaamivaaaakmaabmaabaGaaC iEamaaBaaaleaacaWGPbaabeaakiaaiYcacaWHcbaacaGLOaGaayzk aaWaaeWaaeaaceWHcbGbaKaacqGHsislcaWHcbaacaGLOaGaayzkaa Gaey4kaSYaaeWaaeaaceWHcbGbaKaacqGHsislcaWHcbaacaGLOaGa ayzkaaWaaWbaaSqabeaacaWGubaaaOGaaC4AamaabmaabaGaaCiEam aaBaaaleaacaWGPbaabeaakiaaiYcacaWHcbWaaWbaaSqabeaacaGG QaaaaaGccaGLOaGaayzkaaWaaeWaaeaaceWHcbGbaKaacqGHsislca WHcbaacaGLOaGaayzkaaGaeyOeI0IafqiVd0MbaebaaiaawIcacaGL PaaaaeaaaeaacaaI9aGaamOtamaaCaaaleqabaGaeyOeI0IaaGymaa aakmaaqafabeWcbaGaamyAaiabgIGiolaadohadaWgaaadbaGaamyq aaqabaaaleqaniabggHiLdGccaaMc8UaamizamaaDaaaleaacaWGPb aabaGaamyqaaaakmaabmaabaGaeqiVd02aaSbaaSqaaiaadMgaaeqa aOGaeyOeI0IafqiVd0MbaebaaiaawIcacaGLPaaacqGHRaWkcaWGob WaaWbaaSqabeaacqGHsislcaaIXaaaaOWaaabuaeqaleaacaWGPbGa eyicI4Saam4CamaaBaaameaacaWGbbaabeaaaSqab0GaeyyeIuoaki aaykW7caWGKbWaa0baaSqaaiaadMgaaeaacaWGbbaaaOGaaCiAamaa CaaaleqabaGaamivaaaakmaabmaabaGaaCiEamaaBaaaleaacaWGPb aabeaakiaaiYcacaWHcbaacaGLOaGaayzkaaWaaeWaaeaaceWHcbGb aKaacqGHsislcaWHcbaacaGLOaGaayzkaaaabaaabaGaaGPaVlaayk W7cqGHRaWkcaaMc8UaaGPaVlaad6eadaahaaWcbeqaaiabgkHiTiaa igdaaaGcdaaeqbqabSqaaiaadMgacqGHiiIZcaWGZbWaaSbaaWqaai aadgeaaeqaaaWcbeqdcqGHris5aOGaaGPaVlaadsgadaqhaaWcbaGa amyAaaqaaiaadgeaaaGcdaqadaqaaiqahkeagaqcaiabgkHiTiaahk eaaiaawIcacaGLPaaadaahaaWcbeqaaiaadsfaaaGccaWHRbWaaeWa aeaacaWH4bWaaSbaaSqaaiaadMgaaeqaaOGaaGilaiaahkeadaahaa WcbeqaaiaacQcaaaaakiaawIcacaGLPaaadaqadaqaaiqahkeagaqc aiabgkHiTiaahkeaaiaawIcacaGLPaaacqGHsislcaWGpbWaaSbaaS qaaiaadchaaeqaaOWaaeWaaeaadaWcgaqaaiaaigdaaeaadaGcaaqa aiaad6gaaSqabaaaaaGccaGLOaGaayzkaaaabaaabaGaaGypaiaad6 eadaahaaWcbeqaaiabgkHiTiaaigdaaaGcdaaeqbqabSqaaiaadMga cqGHiiIZcaWGZbWaaSbaaWqaaiaadgeaaeqaaaWcbeqdcqGHris5aO GaaGPaVlaadsgadaqhaaWcbaGaamyAaaqaaiaadgeaaaGcdaqadaqa aiabeY7aTnaaBaaaleaacaWGPbaabeaakiabgkHiTiqbeY7aTzaara aacaGLOaGaayzkaaGaey4kaSIaam4tamaaBaaaleaacaWGWbaabeaa kmaabmaabaWaaSGbaeaacaaIXaaabaWaaOaaaeaacaWGUbaaleqaaa aaaOGaayjkaiaawMcaaiabgUcaRiaad+eadaWgaaWcbaGaamiCaaqa baGcdaqadaqaamaalyaabaGaaGymaaqaaiaad6gaaaaacaGLOaGaay zkaaGaeyOeI0Iaam4tamaaBaaaleaacaWGWbaabeaakmaabmaabaWa aSGbaeaacaaIXaaabaWaaOaaaeaacaWGUbaaleqaaaaaaOGaayjkai aawMcaaaqaaaqaaiaai2dacaWGobWaaWbaaSqabeaacqGHsislcaaI XaaaaOWaaabuaeqaleaacaWGPbGaeyicI4Saam4CamaaBaaameaaca WGbbaabeaaaSqab0GaeyyeIuoakiaaykW7caWGKbWaa0baaSqaaiaa dMgaaeaacaWGbbaaaOWaaeWaaeaacqaH8oqBdaWgaaWcbaGaamyAaa qabaGccqGHsislcuaH8oqBgaqeaaGaayjkaiaawMcaaiabgUcaRiaa d+eadaWgaaWcbaGaamiCaaqabaGcdaqadaqaamaalyaabaGaaGymaa qaamaakaaabaGaamOBaaWcbeaaaaaakiaawIcacaGLPaaacaGGUaGa aGzbVlaaywW7caaMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7ca aMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaacIcacaqGbbGaaeOlaiaa bgdacaqGWaGaaiykaaaaaaa@360C@

Similarly,

N 1 i s A d i A ( μ ^ i μ ^ ¯ ) 2 = N 1 i s A d i A ( μ i μ ¯ ) 2 + O p ( 1 / n ) . ( A .11 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWGobWaaWbaaSqabeaacqGHsislca aIXaaaaOWaaabuaeqaleaacaWGPbGaeyicI4Saam4CamaaBaaameaa caWGbbaabeaaaSqab0GaeyyeIuoakiaaykW7caWGKbWaa0baaSqaai aadMgaaeaacaWGbbaaaOWaaeWaaeaacuaH8oqBgaqcamaaBaaaleaa caWGPbaabeaakiabgkHiTiqbeY7aTzaajyaaraaacaGLOaGaayzkaa WaaWbaaSqabeaacaaIYaaaaOGaaGypaiaad6eadaahaaWcbeqaaiab gkHiTiaaigdaaaGcdaaeqbqabSqaaiaadMgacqGHiiIZcaWGZbWaaS baaWqaaiaadgeaaeqaaaWcbeqdcqGHris5aOGaaGPaVlaadsgadaqh aaWcbaGaamyAaaqaaiaadgeaaaGcdaqadaqaaiabeY7aTnaaBaaale aacaWGPbaabeaakiabgkHiTiqbeY7aTzaaraaacaGLOaGaayzkaaWa aWbaaSqabeaacaaIYaaaaOGaey4kaSIaam4tamaaBaaaleaacaWGWb aabeaakmaabmaabaWaaSGbaeaacaaIXaaabaGaamOBaaaaaiaawIca caGLPaaacaGGUaGaaGzbVlaaywW7caaMf8UaaGzbVlaacIcacaqGbb GaaeOlaiaabgdacaqGXaGaaiykaaaa@6F94@

From (A.10) and (A.11) we have:

B ^ M C = i s A d i A ( μ ^ i μ ¯ ^ ) ( y i y ¯ ) i s A d i A ( μ ^ i μ ¯ ^ ) 2 = N 1 i s A d i A ( μ ^ i μ ¯ ^ ) ( y i y ¯ ) N 1 i s A d i A ( μ ^ i μ ¯ ^ ) 2 = i s A d i A ( μ i μ ¯ ) ( y i y ¯ ) + O p ( 1/ n ) i s A d i A ( μ i μ ¯ ) 2 + O p ( 1/ n ) B M C a s n . ( A .12 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9w8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaafaqaaeWacaaabaGabmOqayaajaWaaW baaSqabeaacaWGnbGaam4qaaaaaOqaaiaai2dadaWcaaqaamaaqaba baGaamizamaaDaaaleaacaWGPbaabaGaamyqaaaakmaabmaabaGafq iVd0MbaKaadaWgaaWcbaGaamyAaaqabaGccqGHsislcuaH8oqBgaqe gaqcaaGaayjkaiaawMcaamaabmaabaGaamyEamaaBaaaleaacaWGPb aabeaakiabgkHiTiqadMhagaqeaaGaayjkaiaawMcaaaWcbaGaamyA aiabgIGiolaadohadaWgaaadbaGaamyqaaqabaaaleqaniabggHiLd aakeaadaaeqaqaaiaadsgadaqhaaWcbaGaamyAaaqaaiaadgeaaaGc daqadaqaaiqbeY7aTzaajaWaaSbaaSqaaiaadMgaaeqaaOGaeyOeI0 IafqiVd0MbaeHbaKaaaiaawIcacaGLPaaadaahaaWcbeqaaiaaikda aaaabaGaamyAaiabgIGiolaadohadaWgaaadbaGaamyqaaqabaaale qaniabggHiLdaaaOGaaGypamaalaaabaGaamOtamaaCaaaleqabaGa eyOeI0IaaGymaaaakmaaqababaGaamizamaaDaaaleaacaWGPbaaba GaamyqaaaakmaabmaabaGafqiVd0MbaKaadaWgaaWcbaGaamyAaaqa baGccqGHsislcuaH8oqBgaqegaqcaaGaayjkaiaawMcaamaabmaaba GaamyEamaaBaaaleaacaWGPbaabeaakiabgkHiTiqadMhagaqeaaGa ayjkaiaawMcaaaWcbaGaamyAaiabgIGiolaadohadaWgaaadbaGaam yqaaqabaaaleqaniabggHiLdaakeaacaWGobWaaWbaaSqabeaacqGH sislcaaIXaaaaOWaaabeaeaacaWGKbWaa0baaSqaaiaadMgaaeaaca WGbbaaaOWaaeWaaeaacuaH8oqBgaqcamaaBaaaleaacaWGPbaabeaa kiabgkHiTiqbeY7aTzaaryaajaaacaGLOaGaayzkaaWaaWbaaSqabe aacaaIYaaaaaqaaiaadMgacqGHiiIZcaWGZbWaaSbaaWqaaiaadgea aeqaaaWcbeqdcqGHris5aaaaaOqaaaqaaiaai2dadaWcaaqaamaaqa babaGaamizamaaDaaaleaacaWGPbaabaGaamyqaaaakmaabmaabaGa eqiVd02aaSbaaSqaaiaadMgaaeqaaOGaeyOeI0IafqiVd0Mbaebaai aawIcacaGLPaaadaqadaqaaiaadMhadaWgaaWcbaGaamyAaaqabaGc cqGHsislceWG5bGbaebaaiaawIcacaGLPaaacqGHRaWkcaWGpbWaaS baaSqaaiaadchaaeqaaOWaaeWaaeaadaWcaaqaaiaaigdaaeaadaGc aaqaaiaad6gaaSqabaaaaaGccaGLOaGaayzkaaaaleaacaWGPbGaey icI4Saam4CamaaBaaameaacaWGbbaabeaaaSqab0GaeyyeIuoaaOqa amaaqababaGaamizamaaDaaaleaacaWGPbaabaGaamyqaaaakmaabm aabaGaeqiVd02aaSbaaSqaaiaadMgaaeqaaOGaeyOeI0IafqiVd0Mb aebaaiaawIcacaGLPaaadaahaaWcbeqaaiaaikdaaaGccqGHRaWkca WGpbWaaSbaaSqaaiaadchaaeqaaOWaaeWaaeaadaWcaaqaaiaaigda aeaacaWGUbaaaaGaayjkaiaawMcaaaWcbaGaamyAaiabgIGiolaado hadaWgaaadbaGaamyqaaqabaaaleqaniabggHiLdaaaaGcbaaabaGa eyOKH4QaamOqamaaCaaaleqabaGaamytaiaadoeaaaGccaaMf8UaaG jcVlaadggacaWGZbGaaGjbVlaad6gacaaMe8UaeyOKH4QaaGjbVlab g6HiLkaac6cacaaMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7ca aMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaa ywW7caaMf8UaaiikaiaabgeacaqGUaGaaeymaiaabkdacaGGPaaaaa aa@EF28@

Thus B ^ M C = B M C + o p ( 1 ) , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaaceWGcbGbaKaadaahaaWcbeqaaiaad2 eacaWGdbaaaOGaaGypaiaadkeadaahaaWcbeqaaiaad2eacaWGdbaa aOGaey4kaSIaam4BamaaBaaaleaacaWGWbaabeaakmaabmaabaGaaG ymaaGaayjkaiaawMcaaiaacYcaaaa@3DED@ and we have:

y ¯ ^ M C = N 1 T ^ y M C = N 1 d A y + ( N 1 k = 1 N μ ( x k , B ^ ) + i s A N 1 d i A μ ( x i , B ^ ) ) B ^ M C = N 1 d A y + ( N 1 k = 1 N μ ( x k , B ) N 1 i s A d i A μ ( x i , B ) + O p ( 1 n ) ) ( B M C + o p ( 1 ) ) = N 1 d A y + ( N 1 k = 1 N μ ( x k , B ) N 1 i s A d i A μ ( x i , B ) ) B M C + o p ( 1 n ) . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9w8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaafaqaaeabcaaaaeaaceWG5bGbaeHbaK aadaahaaWcbeqaaiaad2eacaWGdbaaaaGcbaGaaGypaiaad6eadaah aaWcbeqaaiabgkHiTiaaigdaaaGcceWGubGbaKaadaqhaaWcbaGaam yEaaqaaiaad2eacaWGdbaaaaGcbaaabaGaaGypaiaad6eadaahaaWc beqaaiabgkHiTiaaigdaaaGccaWHKbWaaWbaaSqabeaacaWGbbaaaO GaaCyEaiabgUcaRmaabmaabaGaamOtamaaCaaaleqabaGaeyOeI0Ia aGymaaaakmaaqahabeWcbaGaam4Aaiaai2dacaaIXaaabaGaamOtaa qdcqGHris5aOGaaGPaVlabeY7aTnaabmaabaGaaCiEamaaBaaaleaa caWGRbaabeaakiaaiYcaceWHcbGbaKaaaiaawIcacaGLPaaacqGHRa WkdaaeqbqabSqaaiaadMgacqGHiiIZcaWGZbWaaSbaaWqaaiaadgea aeqaaaWcbeqdcqGHris5aOGaaGPaVlaad6eadaahaaWcbeqaaiabgk HiTiaaigdaaaGccaWGKbWaa0baaSqaaiaadMgaaeaacaWGbbaaaOGa eqiVd02aaeWaaeaacaWH4bWaaSbaaSqaaiaadMgaaeqaaOGaaGilai qahkeagaqcaaGaayjkaiaawMcaaaGaayjkaiaawMcaaiqadkeagaqc amaaCaaaleqabaGaamytaiaadoeaaaaakeaaaeaacaaI9aGaamOtam aaCaaaleqabaGaeyOeI0IaaGymaaaakiaahsgadaahaaWcbeqaaiaa dgeaaaGccaWH5bGaey4kaSYaaeWaaeaacaWGobWaaWbaaSqabeaacq GHsislcaaIXaaaaOWaaabCaeqaleaacaWGRbGaaGypaiaaigdaaeaa caWGobaaniabggHiLdGccaaMc8UaeqiVd02aaeWaaeaacaWH4bWaaS baaSqaaiaadUgaaeqaaOGaaGilaiaahkeaaiaawIcacaGLPaaacqGH sislcaWGobWaaWbaaSqabeaacqGHsislcaaIXaaaaOWaaabuaeqale aacaWGPbGaeyicI4Saam4CamaaBaaameaacaWGbbaabeaaaSqab0Ga eyyeIuoakiaaykW7caWGKbWaa0baaSqaaiaadMgaaeaacaWGbbaaaO GaeqiVd02aaeWaaeaacaWH4bWaaSbaaSqaaiaadMgaaeqaaOGaaGil aiaahkeaaiaawIcacaGLPaaacqGHRaWkcaWGpbWaaSbaaSqaaiaadc haaeqaaOWaaeWaaeaadaWcaaqaaiaaigdaaeaadaGcaaqaaiaad6ga aSqabaaaaaGccaGLOaGaayzkaaaacaGLOaGaayzkaaWaaeWaaeaaca WGcbWaaWbaaSqabeaacaWGnbGaam4qaaaakiabgUcaRiaad+gadaWg aaWcbaGaamiCaaqabaGcdaqadaqaaiaaigdaaiaawIcacaGLPaaaai aawIcacaGLPaaaaeaaaeaacaaI9aGaamOtamaaCaaaleqabaGaeyOe I0IaaGymaaaakiaahsgadaahaaWcbeqaaiaadgeaaaGccaWH5bGaey 4kaSYaaeWaaeaacaWGobWaaWbaaSqabeaacqGHsislcaaIXaaaaOWa aabCaeqaleaacaWGRbGaaGypaiaaigdaaeaacaWGobaaniabggHiLd GccaaMc8UaeqiVd02aaeWaaeaacaWH4bWaaSbaaSqaaiaadUgaaeqa aOGaaGilaiaahkeaaiaawIcacaGLPaaacqGHsislcaWGobWaaWbaaS qabeaacqGHsislcaaIXaaaaOWaaabuaeqaleaacaWGPbGaeyicI4Sa am4CamaaBaaameaacaWGbbaabeaaaSqab0GaeyyeIuoakiaaykW7ca WGKbWaa0baaSqaaiaadMgaaeaacaWGbbaaaOGaeqiVd02aaeWaaeaa caWH4bWaaSbaaSqaaiaadMgaaeqaaOGaaGilaiaahkeaaiaawIcaca GLPaaaaiaawIcacaGLPaaacaWGcbWaaWbaaSqabeaacaWGnbGaam4q aaaakiabgUcaRiaad+gadaWgaaWcbaGaamiCaaqabaGcdaqadaqaam aalaaabaGaaGymaaqaamaakaaabaGaamOBaaWcbeaaaaaakiaawIca caGLPaaacaGGUaaaaaaa@E54E@

Since N = O p ( N ) , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWGobGaaGypaiaad+eadaWgaaWcba GaamiCaaqabaGcdaqadaqaaiaad6eaaiaawIcacaGLPaaacaGGSaaa aa@3896@ we have N o P ( 1 / n ) = O p ( N ) o p ( 1 / n ) = o p ( N / n ) . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWGobGaeyyXICTaam4BamaaBaaale aacaWGqbaabeaakmaabmaabaWaaSGbaeaacaaIXaaabaWaaOaaaeaa caWGUbaaleqaaaaaaOGaayjkaiaawMcaaiabg2da9iaad+eadaWgaa WcbaGaamiCaaqabaGcdaqadaqaaiaad6eaaiaawIcacaGLPaaacaWG VbWaaSbaaSqaaiaadchaaeqaaOWaaeWaaeaadaWcgaqaaiaaigdaae aadaGcaaqaaiaad6gaaSqabaaaaaGccaGLOaGaayzkaaGaaGypaiaa d+gadaWgaaWcbaGaamiCaaqabaGcdaqadaqaamaalyaabaGaamOtaa qaamaakaaabaGaamOBaaWcbeaaaaaakiaawIcacaGLPaaacaGGUaaa aa@4C93@ Thus,

T ^ y M C = N y ¯ ^ M C = N ( N 1 d A y + ( N 1 k = 1 N μ ( x k , B ) N 1 i s A μ ( x i , B ) ) B M C + o p ( 1 n ) ) = d A y + ( k = 1 N μ ( x k , B ) i s A μ ( x i , B ) ) B M C + o p ( N n ) = i i n s A d i A ( y i μ i B M C ) + i = 1 N μ i B M C + o p ( N n ) . ( A .13 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9w8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaafaqaaeWacaaabaGabmivayaajaWaa0 baaSqaaiaadMhaaeaacaWGnbGaam4qaaaaaOqaaiaai2dacaWGobGa aGjcVlqadMhagaqegaqcamaaCaaaleqabaGaamytaiaadoeaaaGcca aI9aGaamOtamaabmaabaGaamOtamaaCaaaleqabaGaeyOeI0IaaGym aaaakiaahsgadaahaaWcbeqaaiaadgeaaaGccaWH5bGaey4kaSYaae WaaeaacaWGobWaaWbaaSqabeaacqGHsislcaaIXaaaaOWaaabCaeqa leaacaWGRbGaaGypaiaaigdaaeaacaWGobaaniabggHiLdGccaaMc8 UaeqiVd02aaeWaaeaacaWH4bWaaSbaaSqaaiaadUgaaeqaaOGaaGil aiaahkeaaiaawIcacaGLPaaacqGHsislcaWGobWaaWbaaSqabeaacq GHsislcaaIXaaaaOWaaabuaeqaleaacaWGPbGaeyicI4Saam4Camaa BaaameaacaWGbbaabeaaaSqab0GaeyyeIuoakiaaykW7cqaH8oqBda qadaqaaiaahIhadaWgaaWcbaGaamyAaaqabaGccaaISaGaaCOqaaGa ayjkaiaawMcaaaGaayjkaiaawMcaaiaadkeadaahaaWcbeqaaiaad2 eacaWGdbaaaOGaey4kaSIaam4BamaaBaaaleaacaWGWbaabeaakmaa bmaabaWaaSaaaeaacaaIXaaabaWaaOaaaeaacaWGUbaaleqaaaaaaO GaayjkaiaawMcaaaGaayjkaiaawMcaaaqaaaqaaiaai2dacaWHKbWa aWbaaSqabeaacaWGbbaaaOGaaCyEaiabgUcaRmaabmaabaWaaabCae qaleaacaWGRbGaaGypaiaaigdaaeaacaWGobaaniabggHiLdGccaaM c8UaeqiVd02aaeWaaeaacaWH4bWaaSbaaSqaaiaadUgaaeqaaOGaaG ilaiaahkeaaiaawIcacaGLPaaacqGHsisldaaeqbqabSqaaiaadMga cqGHiiIZcaWGZbWaaSbaaWqaaiaadgeaaeqaaaWcbeqdcqGHris5aO GaaGPaVlabeY7aTnaabmaabaGaaCiEamaaBaaaleaacaWGPbaabeaa kiaaiYcacaWHcbaacaGLOaGaayzkaaaacaGLOaGaayzkaaGaamOqam aaCaaaleqabaGaamytaiaadoeaaaGccqGHRaWkcaWGVbWaaSbaaSqa aiaadchaaeqaaOWaaeWaaeaadaWcaaqaaiaad6eaaeaadaGcaaqaai aad6gaaSqabaaaaaGccaGLOaGaayzkaaaabaaabaGaaGypamaaqafa beWcbaGaamyAaiaadMgacaWGUbGaam4CamaaBaaameaacaWGbbaabe aaaSqab0GaeyyeIuoakiaaykW7caWGKbWaa0baaSqaaiaadMgaaeaa caWGbbaaaOWaaeWaaeaacaWG5bWaaSbaaSqaaiaadMgaaeqaaOGaey OeI0IaeqiVd02aaSbaaSqaaiaadMgaaeqaaOGaamOqamaaCaaaleqa baGaamytaiaadoeaaaaakiaawIcacaGLPaaacqGHRaWkdaaeWbqabS qaaiaadMgacaaI9aGaaGymaaqaaiaad6eaa0GaeyyeIuoakiaaykW7 cqaH8oqBdaWgaaWcbaGaamyAaaqabaGccaWGcbWaaWbaaSqabeaaca WGnbGaam4qaaaakiabgUcaRiaad+gadaWgaaWcbaGaamiCaaqabaGc daqadaqaamaalyaabaGaamOtaaqaamaakaaabaGaamOBaaWcbeaaaa aakiaawIcacaGLPaaacaGGUaGaaGzbVlaaywW7caaMf8UaaGzbVlaa ywW7caaMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7caGGOaGaae yqaiaab6cacaqGXaGaae4maiaacMcaaaaaaa@E0B8@

Theorem 2: Suppose the parameters in a full regression model have both zero and non-zero components, without loss of generality, let the first p MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWGWbaaaa@32E6@  be non-zero and the last q MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWGXbaaaa@32E7@  be zero:

β F = ( β ( p × 1 ) ( 1 ) β ( q × 1 ) ( 2 ) ) , β ( 1 ) = β a n d β ( 2 ) = 0 ( q × 1 ) . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWHYoWaaWbaaSqabeaacaWGgbaaaO GaaGypamaabmaabaqbaeqabiqaaaqaaiaahk7adaqhaaWcbaWaaeWa aeaacaWGWbGaey41aqRaaGymaaGaayjkaiaawMcaaaqaamaabmaaba GaaGymaaGaayjkaiaawMcaaaaaaOqaaiaahk7adaqhaaWcbaWaaeWa aeaacaWGXbGaey41aqRaaGymaaGaayjkaiaawMcaaaqaamaabmaaba GaaGOmaaGaayjkaiaawMcaaaaaaaaakiaawIcacaGLPaaacaGGSaGa aGjbVlaaykW7caWHYoWaaWbaaSqabeaadaqadaqaaiaaigdaaiaawI cacaGLPaaaaaGccqGH9aqpcaWHYoGaaGjbVlaaysW7caWGHbGaamOB aiaadsgacaaMe8UaaGjbVlaahk7adaahaaWcbeqaamaabmaabaGaaG OmaaGaayjkaiaawMcaaaaakiabg2da9iaahcdadaWgaaWcbaWaaeWa aeaacaWGXbGaey41aqRaaGymaaGaayjkaiaawMcaaaqabaGccaGGUa aaaa@670E@

Under conditions (1)-(5), the asymptotic LASSO calibration estimator of total is:

T ^ y L A S S O = i s A d i A ( y i μ i B M C ) + i = 1 N μ i B M C + o p ( N n ) . ( A .14 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaaceWGubGbaKaadaqhaaWcbaGaamyEaa qaaiaadYeacaWGbbGaam4uaiaadofacaWGpbaaaOGaaGypamaaqafa beWcbaGaamyAaiabgIGiolaadohadaWgaaadbaGaamyqaaqabaaale qaniabggHiLdGccaaMc8UaamizamaaDaaaleaacaWGPbaabaGaamyq aaaakmaabmaabaGaamyEamaaBaaaleaacaWGPbaabeaakiabgkHiTi abeY7aTnaaBaaaleaacaWGPbaabeaakiaadkeadaahaaWcbeqaaiaa d2eacaWGdbaaaaGccaGLOaGaayzkaaGaey4kaSYaaabCaeqaleaaca WGPbGaaGypaiaaigdaaeaacaWGobaaniabggHiLdGccaaMc8UaeqiV d02aaSbaaSqaaiaadMgaaeqaaOGaamOqamaaCaaaleqabaGaamytai aadoeaaaGccqGHRaWkcaWGVbWaaSbaaSqaaiaadchaaeqaaOWaaeWa aeaadaWcaaqaaiaad6eaaeaadaGcaaqaaiaad6gaaSqabaaaaaGcca GLOaGaayzkaaGaaiOlaiaaywW7caaMf8UaaGzbVlaaywW7caaMf8Ua aiikaiaabgeacaqGUaGaaeymaiaabsdacaGGPaaaaa@6E94@

Proof. Under condition (5), the adaptive LASSO regression satisfies the oracle property through Theorems 1 and 4 in Zou (2006):

P r ( B ( 2 ) = 0 ) 1 n ( B ^ ( 1 ) B ) N ( 0 , C ) B β MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaafaqaaeWacaaabaGaaGPaVlaaykW7ca WGqbGaamOCamaabmaabaGaaCOqamaaCaaaleqabaWaaeWaaeaacaaI YaaacaGLOaGaayzkaaaaaOGaaGypaiaahcdaaiaawIcacaGLPaaaae aacqGHsgIRcaaMe8UaaGjbVlaaigdaaeaadaGcaaqaaiaad6gaaSqa baGcdaqadaqaaiqahkeagaqcamaaCaaaleqabaWaaeWaaeaacaaIXa aacaGLOaGaayzkaaaaaOGaeyOeI0IaaCOqaaGaayjkaiaawMcaaaqa aiabgkziUkaaysW7caaMe8UaamOtamaabmaabaGaaCimaiaaiYcaca WHdbaacaGLOaGaayzkaaaabaGaaGPaVlaaykW7caaMc8UaaGPaVlaa ykW7caaMc8UaaGPaVlaaykW7caaMc8UaaGPaVlaaykW7caaMc8UaaG PaVlaaykW7caaMc8UaaGPaVlaaykW7caaMc8UaaGPaVlaaykW7caaM c8UaaGPaVlaaykW7caaMc8UaaCOqaaqaaiabgkziUkaaysW7caaMe8 UaaCOSdaaaaaa@805B@

where C = Σ ( B ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWHdbGaaGypaiabfo6atnaabmaaba GaaCOqaaGaayjkaiaawMcaaaaa@375C@  is the covariance matrix of B ( 1 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWHcbWaaWbaaSqabeaadaqadaqaai aaigdaaiaawIcacaGLPaaaaaaaaa@352D@  under the linear model, and C = I 1 ( B ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWHdbGaaGypaiaadMeadaahaaWcbe qaaiabgkHiTiaaigdaaaGcdaqadaqaaiaahkeaaiaawIcacaGLPaaa aaa@3885@  is the inverse of Fisher information matrix of B ( 1 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWHcbWaaWbaaSqabeaadaqadaqaai aaigdaaiaawIcacaGLPaaaaaaaaa@352D@  under generalized linear model. By Slutsky’s theorem, the oracle property implies B ^ ( 1 ) = B + O p ( 1 / n ) . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaaceWHcbGbaKaadaahaaWcbeqaamaabm aabaGaaGymaaGaayjkaiaawMcaaaaakiaai2dacaWHcbGaey4kaSIa am4tamaaBaaaleaacaWGWbaabeaakmaabmaabaWaaSGbaeaacaaIXa aabaWaaOaaaeaacaWGUbaaleqaaaaaaOGaayjkaiaawMcaaiaac6ca aaa@3DDE@  By condition (1) and Lemma 1:

T ^ y L A S S O T ^ y M C = i i n s A d i A ( y i μ i B M C ) + i = 1 N μ i B M C + o p ( N n ) . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaafaqaaeGacaaabaGabmivayaajaWaa0 baaSqaaiaadMhaaeaacaWGmbGaamyqaiaadofacaWGtbGaam4taaaa aOqaaiabgIKi7kqadsfagaqcamaaDaaaleaacaWG5baabaGaamytai aadoeaaaaakeaaaeaacaaI9aWaaabuaeqaleaacaWGPbGaamyAaiaa d6gacaWGZbWaaSbaaWqaaiaadgeaaeqaaaWcbeqdcqGHris5aOGaaG PaVlaadsgadaqhaaWcbaGaamyAaaqaaiaadgeaaaGcdaqadaqaaiaa dMhadaWgaaWcbaGaamyAaaqabaGccqGHsislcqaH8oqBdaWgaaWcba GaamyAaaqabaGccaWGcbWaaWbaaSqabeaacaWGnbGaam4qaaaaaOGa ayjkaiaawMcaaiabgUcaRmaaqahabeWcbaGaamyAaiaai2dacaaIXa aabaGaamOtaaqdcqGHris5aOGaaGPaVlabeY7aTnaaBaaaleaacaWG PbaabeaakiaadkeadaahaaWcbeqaaiaad2eacaWGdbaaaOGaey4kaS Iaam4BamaaBaaaleaacaWGWbaabeaakmaabmaabaWaaSaaaeaacaWG obaabaWaaOaaaeaacaWGUbaaleqaaaaaaOGaayjkaiaawMcaaiaac6 caaaaaaa@686A@

Theorem 3: T ^ y L A S S O MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaaceWGubGbaKaadaqhaaWcbaGaamyEaa qaaiaadYeacaWGbbGaam4uaiaadofacaWGpbaaaaaa@3820@  is model-unbiased.  

Proof. Under the assumption of our theoretical framework, the superpopulation parameters are a subset of the full LASSO regression parameters, we can prove the model-unbiasedness of T ^ y L A S S O MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaaceWGubGbaKaadaqhaaWcbaGaamyEaa qaaiaadYeacaWGbbGaam4uaiaadofacaWGpbaaaaaa@3820@  by taking expectations with respect to model ξ . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacqaH+oaEcaGGUaaaaa@3466@  First note that:

E ξ [ B M C ] = E ξ [ i = 1 N ( μ i μ ¯ ) ( y i y ¯ ) i = 1 N ( μ i μ ¯ ) 2 ] = i = 1 N ( μ i μ ¯ ) ( μ i μ ¯ ) i = 1 N ( μ i μ ¯ ) 2 = 1. MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaacaWGfbWaaSbaaSqaaiabe67a4bqaba GcdaWadaqaaiaadkeadaahaaWcbeqaaiaad2eacaWGdbaaaaGccaGL BbGaayzxaaGaaGypaiaadweadaWgaaWcbaGaeqOVdGhabeaakmaadm aabaWaaSaaaeaadaaeWaqaamaabmaabaGaeqiVd02aaSbaaSqaaiaa dMgaaeqaaOGaeyOeI0IafqiVd0MbaebaaiaawIcacaGLPaaadaqada qaaiaadMhadaWgaaWcbaGaamyAaaqabaGccqGHsislceWG5bGbaeba aiaawIcacaGLPaaaaSqaaiaadMgacaaI9aGaaGymaaqaaiaad6eaa0 GaeyyeIuoaaOqaamaaqadabaWaaeWaaeaacqaH8oqBdaWgaaWcbaGa amyAaaqabaGccqGHsislcuaH8oqBgaqeaaGaayjkaiaawMcaamaaCa aaleqabaGaaGOmaaaaaeaacaWGPbGaaGypaiaaigdaaeaacaWGobaa niabggHiLdaaaaGccaGLBbGaayzxaaGaaGypamaalaaabaWaaabmae aadaqadaqaaiabeY7aTnaaBaaaleaacaWGPbaabeaakiabgkHiTiqb eY7aTzaaraaacaGLOaGaayzkaaWaaeWaaeaacqaH8oqBdaWgaaWcba GaamyAaaqabaGccqGHsislcuaH8oqBgaqeaaGaayjkaiaawMcaaaWc baGaamyAaiaai2dacaaIXaaabaGaamOtaaqdcqGHris5aaGcbaWaaa bmaeaadaqadaqaaiabeY7aTnaaBaaaleaacaWGPbaabeaakiabgkHi TiqbeY7aTzaaraaacaGLOaGaayzkaaWaaWbaaSqabeaacaaIYaaaaa qaaiaadMgacaaI9aGaaGymaaqaaiaad6eaa0GaeyyeIuoaaaGccaaI 9aGaaGymaiaac6caaaa@8210@

Thus

E ξ [ T ^ y L A S S O T ] E ξ [ i s A d i A ( y i μ i B M C ) + i = 1 N μ i B M C i = 1 N y i ] = i s A d i A ( μ i μ i ) + i = 1 N μ i i = 1 N μ i ( s i n c e E ξ [ B M C ] = 1 ) = 0. MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaafaqaaeWacaaabaGaamyramaaBaaale aacqaH+oaEaeqaaOWaamWaaeaaceWGubGbaKaadaqhaaWcbaGaamyE aaqaaiaadYeacaWGbbGaam4uaiaadofacaWGpbaaaOGaeyOeI0Iaam ivaaGaay5waiaaw2faaaqaaiabgIKi7kaadweadaWgaaWcbaGaeqOV dGhabeaakmaadmaabaWaaabuaeqaleaacaWGPbGaeyicI4Saam4Cam aaBaaameaacaWGbbaabeaaaSqab0GaeyyeIuoakiaaykW7caWGKbWa a0baaSqaaiaadMgaaeaacaWGbbaaaOWaaeWaaeaacaWG5bWaaSbaaS qaaiaadMgaaeqaaOGaeyOeI0IaeqiVd02aaSbaaSqaaiaadMgaaeqa aOGaamOqamaaCaaaleqabaGaamytaiaadoeaaaaakiaawIcacaGLPa aacqGHRaWkdaaeWbqabSqaaiaadMgacaaI9aGaaGymaaqaaiaad6ea a0GaeyyeIuoakiaaykW7cqaH8oqBdaWgaaWcbaGaamyAaaqabaGcca WGcbWaaWbaaSqabeaacaWGnbGaam4qaaaakiabgkHiTmaaqahabeWc baGaamyAaiaai2dacaaIXaaabaGaamOtaaqdcqGHris5aOGaaGPaVl aadMhadaWgaaWcbaGaamyAaaqabaaakiaawUfacaGLDbaaaeaaaeaa caaI9aWaaabuaeqaleaacaWGPbGaeyicI4Saam4CamaaBaaameaaca WGbbaabeaaaSqab0GaeyyeIuoakiaaykW7caWGKbWaa0baaSqaaiaa dMgaaeaacaWGbbaaaOWaaeWaaeaacqaH8oqBdaWgaaWcbaGaamyAaa qabaGccqGHsislcqaH8oqBdaWgaaWcbaGaamyAaaqabaaakiaawIca caGLPaaacqGHRaWkdaaeWbqabSqaaiaadMgacaaI9aGaaGymaaqaai aad6eaa0GaeyyeIuoakiaaykW7cqaH8oqBdaWgaaWcbaGaamyAaaqa baGccqGHsisldaaeWbqabSqaaiaadMgacaaI9aGaaGymaaqaaiaad6 eaa0GaeyyeIuoakiaaykW7cqaH8oqBdaWgaaWcbaGaamyAaaqabaGc caaMf8UaaGjcVpaabmaabaGaam4CaiaadMgacaWGUbGaam4yaiaadw gacaaMe8UaamyramaaBaaaleaacqaH+oaEaeqaaOWaamWaaeaacaWG cbWaaWbaaSqabeaacaWGnbGaam4qaaaaaOGaay5waiaaw2faaiaays W7caaI9aGaaGjbVlaaigdaaiaawIcacaGLPaaaaeaaaeaacaaI9aGa aGimaiaac6caaaaaaa@B44F@

Thus, as long as LASSO regression parameters include the superpopulation parameters, T ^ y LASSO MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9y8WrFr0xc9vqFj0db9qqvqFr0dXdHiVc=b YP0xH8peuj0lXxdrpe0db9Wqpepic9qr=xfr=xfr=tmeaabaqaciGa caGaaeqabaqaaeaadaaakeaaceWGubGbaKaadaqhaaWcbaGaamyEaa qaaiaabYeacaqGbbGaae4uaiaabofacaqGpbaaaaaa@3816@ is model-unbiased regardless of design weights. This property is essential in non-probability samples, where there are no initial design weights to guarantee unbiasedness.

References

Baker, R., Brick, J.M., Bates, N.A., Battaglia, M., Couper, M.P., Dever, J.A., Gile, K. and Tourangeau, R. (2013). Summary report of the AAPOR task force on non-probability sampling. Journal of Survey Statistics and Methodology, 1, 90-143.

Chipman, H.A., George, E.I. and McCulloch, R.E. (2010). BART: Bayesian additive regression trees. Annals of Applied Statistics, 4, 266-298.

Cardot, H., Goga, C. and Shehzad, M.-A. (2017). Calibration and partial calibration on principal components when the number of auxiliary variables is large. Statistica Sinica, 27, 243-260.

Centers for Disease Control and Prevention (2005). 2004 National Health Interview Survey (NHIS) Public Use Data Release: NHIS Survey Description. National Center for Health Statistics: Hyattsville, Maryland. www.cdc.gov/nchs/data/nhis/srvydesc.pdf.

Czanner, G., Sarma, S.V., Eden, U.T. and Brown, E.N. (2008). A signal-to-noise ratio estimator for generalized linear model systems. Proceedings of the World Congress on Engineering, vol. 2.

Deville, J.-C., and Särndal, C.-E. (1992). Calibration estimators in survey sampling. Journal of the American Statistical Association, 87, 376-382.

Dormann, C.F., Elith, J., Bacher, S., Buchmann, C., Carl, G., Carre, G., Marquez, J.R.G., Gruber, B., Lafourcade, B., Leitao, P.J. and Mnkemller, T. (2013). Collinearity: A review of methods to deal with it and a simulation study evaluating their performance. Ecology, 36, 27-46.

Elliott, M.R. (2009). Combining data from probability and nonprobability samples using pseudo-weights. Survey Practice, 2(6).

Elliott, M.R, Resler, A., Flannagan, C. and Rupp, J. (2010). Combining data from probability and non-probability samples using pseudo-weights.  Accident Analysis and Prevention, 42, 530-539.

Elliott, M.R., and Valliant, R. (2017). Inference for non-probability samples. Statistical Science, 32, 249-264.

Fan, J., and Li, R. (2001). Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association, 96, 1348-1360.

Friedman, J., Hastie, T. and Tibshirani, R. (2010). Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software, 33, 1-22.

Frankel, M.R., and Frankel, L.R. (1987). Fifty years of survey sampling in the United States. Public Opinion Quarterly, S127-S138.

Fuller, W.A. (2009). Sampling Statistics. New York: John Wiley & Sons, Inc.

Goga, C., Muhammad-Shehzad, A. and Vanheuverzwyn, A. (2011). Principal component regression with survey data: Application on the French media audience. Proceedings of the 58th World Statistics Congress of the International Statistical Institute, 3847-3852.

Groves, R.M. (2006). Nonresponse rates and nonresponse bias in household surveys. Public Opinion Quarterly, 70, 646-675.

Kamarianakis, Y., Shen, W. and Wynter, L. (2012). Real-time road traffic forecasting using regime-switching space-time models and adaptive LASSO. Applied Stochastic Models in Business and Industry, 28, 297-315.

Kohannim, O., Hibar, D.P., Stein, J.L., Jahanshad, N., Hua, X., Rajagopalan, P., Toga, A., Jack Jr, C.R., Weiner, M.W., de Zubicaray, G.I. and McMahon, K.L. (2012). Discovery and replication of gene influences on brain structure using LASSO regression. Frontiers in Neuroscience, 6, 115.

Kohut, A., Keeter, S., Doherty, C., Dimock, M. and Christian, L. (2012). Assessing the representativeness of public opinion surveys. Pew Research Center for The People & The Press. http://www.people-press.org/2012/05/15/assessing-the-representativeness-of-public-opinion-surveys/.

Mosteller, F. (1949). The Pre-Election Polls of 1948: The Report to the Committee on Analysis of Pre-Election Polls and Forecasts, vol. 60, Social Science Research Council.

McConville, K. (2011). Improved Estimation for Complex Surveys Using Modern Regression Techniques. Unpublished PhD Thesis, Colorado State University.

McConville, K., Breidt, F.J., Lee, T.M. and Moisen, G.G. (2017). Model-assisted survey regression estimation with the LASSO. Journal of Survey Statistics and Methodology, 5, 131-158.

Park, M., and Yang, M. (2008). Ridge regression estimation for survey samples. Communication in Statistics - Theory and Methods, 37, 532-543.

Rivers, D. (2007). Sampling for web surveys. Proceedings of the Joint Statistical Meetings, American Statistical Association.

Särndal, C.-E., Swensson, B. and Wretman, J. (1989). The weighted residual technique for estimating the variance of the general regression estimator of the finite population total. Biometrika, 76, 527-537.

Särndal, C.-E., Swensson, B. and Wretman, J. (1992). Model Assisted Survey Sampling. New York: Springer.

Skinner, C., and Silva, P. (1997). Variable selection for regression estimation in the presence of nonresponse. Proceedings of the Survey Research Methods Section, American Statistical Association, 76-81.

Stephan, F.F. (1948). History of the uses of modern sampling procedures. Journal of the American Statistical Association, 43, 12-39.

Terhanian, G., and Bremer, J. (2012). A smarter way to select respondents for surveys? International Journal of Market Research, 54, 751-780.

Tibshirani, R. (1996). Regression shrinkage and selection via the LASSO. Journal of the Royal Statistical Society, 58, 267-288.

Tourangeau, R., Conrad, F.G. and Couper, M.P. (2013). The Science of Web Surveys. Oxford University Press, Oxford, UK.

Vavreck, L., and Rivers, D. (2008). The 2006 Cooperative Congressional Election Study. Journal of Elections, Public Opinion, and Parties, 355-366.

Wang, W., Rothschild, D., Goel, S. and Gelman, A. (2015). Forecasting elections with non-representative Polls. International Journal of Forecasting, 31, 980-991.

Wu, C., and Sitter, R.R. (2001). A model-calibration approach to using complete auxiliary information from survey data. Journal of the American Statistical Association, 96, 185-193.

Wu, T.T., Chen, Y.F., Hastie, T., Sobel, E. and Lange, K. (2009). Genome-wide association analysis by LASSO penalized logistic regression. Bioinformatics, 25, 714-721.

Zou, H. (2006). The adaptive LASSO and its oracle properties. Journal of the American Statistical Association, 101, 1418-1429.


Date modified: