Comments on “Statistical inference with non-probability survey samples”
Section 2. Additional approaches to combining data from probability and non-probability surveys

Dr. Wu’s paper follows the general prescription of 1) using model estimation and subsequent calibration to probability-sample-estimated covariate distributions, 2) developing propensity score estimates based on discrepancies between the probability- and non-probability sample data, and 3) doubly-robust methods that combine 1) and 2) in a manner such that only one of the two underlying models needs to be correct.

2.1  Propensity score estimators

Rivers (2007) appears to have been the first to suggest estimating propensity score using logistic regression with membership in the non-probability sample as the outcome and taking the reciprocal of the resulting propensity scores to use as inclusion weights. This approach was formalized further in Valliant and Dever (2011). Separately, using simple results from Bayes’ theorem and discriminant analysis first described in Elliott and Davis (2005), Elliott, Resler, Flannagan and Rupp (2010) and Elliott (2013) developed a somewhat different estimator of the form

π ^ i A ( x i ,α )= P ^ ( i S A )P( i S B ) P ^ ( i S A |i S A ori S B , x i ,α ) P ^ ( i S B |i S A ori S B , x i ,α ) .(2.1) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacuaHapaCgaqcamaaDaaaleaacaWGPb aabaGaamyqaaaakiaaykW7daqadeqaaiaahIhadaWgaaWcbaGaamyA aaqabaGccaaISaGaaGjbVlaahg7aaiaawIcacaGLPaaacaaMe8UaaG jbVlaai2dacaaMe8UaaGjbVlqadcfagaqcaiaaykW7daqadeqaaiaa dMgacqGHiiIZcaWGtbWaaSbaaSqaaiaadgeaaeqaaaGccaGLOaGaay zkaaGaaGjbVlabg2Hi1kaaysW7caWGqbGaaGPaVpaabmqabaGaamyA aiaaysW7cqGHiiIZcaaMe8Uaam4uamaaBaaaleaacaWGcbaabeaaaO GaayjkaiaawMcaaiaaysW7daWcaaqaaiqadcfagaqcamaabmqabaWa aqGabeaacaWGPbGaaGjbVlabgIGiolaaysW7caWGtbWaaSbaaSqaai aadgeaaeqaaOGaaGPaVdGaayjcSdGaaGjbVlaadMgacaaMe8Uaeyic I4SaaGjbVlaadofadaWgaaWcbaGaamyqaaqabaGccaaMe8UaaGjbVl aab+gacaqGYbGaaGjbVlaaysW7caWGPbGaaGjbVlabgIGiolaaysW7 caWGtbWaaSbaaSqaaiaadkeaaeqaaOGaaGilaiaaysW7caWH4bWaaS baaSqaaiaadMgaaeqaaOGaaGilaiaaysW7caWHXoaacaGLOaGaayzk aaaabaGabmiuayaajaWaaeWabeaadaabceqaaiaadMgacaaMe8Uaey icI4SaaGjbVlaadofadaWgaaWcbaGaamOqaaqabaGccaaMc8oacaGL iWoacaaMe8UaamyAaiaaysW7cqGHiiIZcaaMe8Uaam4uamaaBaaale aacaWGbbaabeaakiaaysW7caaMe8Uaae4BaiaabkhacaaMe8UaaGjb VlaadMgacaaMe8UaeyicI4SaaGjbVlaadofadaWgaaWcbaGaamOqaa qabaGccaaISaGaaGjbVlaahIhadaWgaaWcbaGaamyAaaqabaGccaaI SaGaaGjbVlaahg7aaiaawIcacaGLPaaaaaGaaGOlaiaaywW7caaMf8 UaaGzbVlaaywW7caaMf8UaaiikaiaaikdacaGGUaGaaGymaiaacMca aaa@C560@

P ^ ( i S A |i S A ori S B , x i ,α ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaaceWGqbGbaKaacaaMc8+aaeWabeaada abceqaaiaadMgacaaMe8UaeyicI4SaaGjbVlaadofadaWgaaWcbaGa amyqaaqabaGccaaMc8oacaGLiWoacaaMe8UaamyAaiaaysW7cqGHii IZcaaMe8Uaam4uamaaBaaaleaacaWGbbaabeaakiaaysW7caaMe8Ua ae4BaiaabkhacaaMe8UaaGjbVlaadMgacaaMe8UaeyicI4SaaGjbVl aadofadaWgaaWcbaGaamOqaaqabaGccaaISaGaaGjbVlaahIhadaWg aaWcbaGaamyAaaqabaGccaaISaGaaGjbVlaahg7aaiaawIcacaGLPa aaaaa@607A@  can be obtained using logistic regression, or using one of the suite of machine learning-type approaches such as support vector machines (Soentpiet, 1999), targeted maximum likelihood estimation (Van Der Laan and Rubin, 2006), or Bayesian Additive Regression Trees (BART) (Chipman, George and McCulloch, 2010), and P ^ ( i S A |i S B ori S B , x i ,α ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaaceWGqbGbaKaacaaMc8+aaeWabeaada abceqaaiaadMgacaaMe8UaeyicI4SaaGjbVlaadofadaWgaaWcbaGa amyqaaqabaGccaaMc8oacaGLiWoacaaMe8UaamyAaiaaysW7cqGHii IZcaaMe8Uaam4uamaaBaaaleaacaWGcbaabeaakiaaysW7caaMe8Ua ae4BaiaabkhacaaMe8UaaGjbVlaadMgacaaMe8UaeyicI4SaaGjbVl aadofadaWgaaWcbaGaamOqaaqabaGccaaISaGaaGjbVlaahIhadaWg aaWcbaGaamyAaaqabaGccaaISaGaaGjbVlaahg7aaiaawIcacaGLPa aaaaa@607B@  obtained as 1 P ^ ( i S A |i S A ori S B , x i ,α ). MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaaIXaGaaGjbVlabgkHiTiaaysW7ce WGqbGbaKaacaaMc8+aaeWabeaadaabceqaaiaadMgacaaMe8Uaeyic I4SaaGjbVlaadofadaWgaaWcbaGaamyqaaqabaGccaaMc8oacaGLiW oacaaMe8UaamyAaiaaysW7cqGHiiIZcaaMe8Uaam4uamaaBaaaleaa caWGbbaabeaakiaaysW7caaMe8Uaae4BaiaabkhacaaMe8UaaGjbVl aadMgacaaMe8UaeyicI4SaaGjbVlaadofadaWgaaWcbaGaamOqaaqa baGccaaISaGaaGjbVlaahIhadaWgaaWcbaGaamyAaaqabaGccaaISa GaaGjbVlaahg7aaiaawIcacaGLPaaacaGGUaaaaa@65EE@  In principle P( i S B )= 1/ d i B MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGqbGaaGPaVpaabmqabaGaamyAai aaysW7cqGHiiIZcaaMe8Uaam4uamaaBaaaleaacaWGcbaabeaaaOGa ayjkaiaawMcaaiaaysW7caaI9aGaaGjbVpaalyaabaGaaGymaiaayI W7aeaacaWGKbWaa0baaSqaaiaadMgaaeaacaWGcbaaaaaaaaa@4606@  is known since sampling probabilities are known for all elements of the population, including those in the non-probability sample, but in practice analysts with access only to public use data may have to estimate this as well. (In addition, d i B MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGKbWaa0baaSqaaiaadMgaaeaaca WGcbaaaaaa@3478@  may include calibration and non-response adjustments that are not known for the non-probability sample elements.) This last point is critical as use of the probability sample to develop propensity scores using only the discrepancies between the non-probability sample and the probability sample will be biased unless the probability sample used an equal probability (epsem) design, as noted by Wu.

In contrast, Chen, Li and Wu (2020) shows that using a pseudo-likelihood approach to estimating π ^ i A ( x i ,α ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacuaHapaCgaqcamaaDaaaleaacaWGPb aabaGaamyqaaaakiaaykW7daqadeqaaiaahIhadaWgaaWcbaGaamyA aaqabaGccaaISaGaaGjbVlaahg7aaiaawIcacaGLPaaaaaa@3E1F@  directly from the population likelihood for the indicators I( i S A ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGjbGaaGPaVpaabmqabaGaamyAai aaysW7cqGHiiIZcaaMe8Uaam4uamaaBaaaleaacaWGbbaabeaaaOGa ayjkaiaawMcaaaaa@3CF0@  as a function of x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWH4bWaaSbaaSqaaiaadMgaaeqaaa aa@33C8@  yields an estimator that does not require P( i S B ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGqbGaaGPaVpaabmqabaGaamyAai aaysW7cqGHiiIZcaaMe8Uaam4uamaaBaaaleaacaWGcbaabeaaaOGa ayjkaiaawMcaaaaa@3CF8@  for elements in the non-probability sample under the restriction that π i A ( x i ,α ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacqaHapaCdaqhaaWcbaGaamyAaaqaai aadgeaaaGccaaMc8+aaeWabeaacaWH4bWaaSbaaSqaaiaadMgaaeqa aOGaaGilaiaaysW7caWHXoaacaGLOaGaayzkaaaaaa@3E0F@  follows a generalized linear model with a canonical link, i.e., logistic regression.

(None of these approaches actually has the correct intercept to obtain a true propensity score; however, as noted in Wu, weighted estimation usually uses Hájek-type estimators [using weights to estimate a population total for denominators; Hájek, 1971] so that propensity scores estimated up to a normalizing constant are sufficient.)

2.2  Doubly-robust estimators

If inference is focused on a particular variable Y MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGzbaaaa@328B@  available only in the non-probability sample, we can return to the model-assisted estimators that date back to Cassel, Särndal and Wretman (1976), which posit a model for the expectation E( y i | x i )= m i . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGfbGaaGPaVpaabmqabaWaaqGabe aacaWG5bWaaSbaaSqaaiaadMgaaeqaaOGaaGPaVdGaayjcSdGaaGjb VlaahIhadaWgaaWcbaGaamyAaaqabaaakiaawIcacaGLPaaacaaMe8 UaaGypaiaaysW7caWGTbWaaSbaaSqaaiaadMgaaeqaaOGaaiOlaaaa @452B@  Combining this with propensity score estimates of the probability of being in the non-probability sample (which we are treating as an “unknown probability sample” MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaacbaqcLbwaqa aaaaaaaaWdbiaa=nbiaaa@37A3@  more about this under Assumptions below) yields estimators of the form

1 N ^ A i S A y i m ^ i π ^ i A + 1 N ^ B i S B d i B m ^ i (2.2) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaadaWcaaqaaiaaigdaaeaaceWGobGbaK aadaahaaWcbeqaaiaadgeaaaaaaOGaaGjbVpaaqafabeWcbaGaamyA aiaaykW7cqGHiiIZcaaMc8Uaam4uamaaBaaameaacaWGbbaabeaaaS qab0GaeyyeIuoakiaaysW7daWcaaqaaiaadMhadaWgaaWcbaGaamyA aaqabaGccaaMe8UaeyOeI0IaaGjbVlqad2gagaqcamaaBaaaleaaca WGPbaabeaaaOqaaiqbec8aWzaajaWaa0baaSqaaiaadMgaaeaacaWG bbaaaaaakiaaysW7caaMe8Uaey4kaSIaaGjbVlaaysW7daWcaaqaai aaigdaaeaaceWGobGbaKaadaahaaWcbeqaaiaadkeaaaaaaOWaaabu aeqaleaacaWGPbGaaGPaVlabgIGiolaaykW7caWGtbWaaSbaaWqaai aadkeaaeqaaaWcbeqdcqGHris5aOGaaGPaVlaadsgadaqhaaWcbaGa amyAaaqaaiaadkeaaaGcceWGTbGbaKaadaWgaaWcbaGaamyAaaqaba GccaaMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaacIcacaaIYaGaaiOl aiaaikdacaGGPaaaaa@7232@

corresponding to μ ^ DR2 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacuaH8oqBgaqcamaaBaaaleaacaqGeb GaaeOuaiaaikdaaeqaaaaa@35F7@  of (4.11) in Wu. The intuition is that any bias due to the model misspecification in estimation of m i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGTbWaaSbaaSqaaiaadMgaaeqaaa aa@33B9@  in 1 N ^ B i S B d i B m ^ i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaadaWcbaWcbaGaaGymaaqaaiqad6eaga qcamaaCaaameqabaGaamOqaaaaaaGccaaMe8+aaabeaeqaleaacaWG PbGaeyicI4Saam4uamaaBaaameaacaWGcbaabeaaaSqab0GaeyyeIu oakiaaykW7caWGKbWaa0baaSqaaiaadMgaaeaacaWGcbaaaOGabmyB ayaajaWaaSbaaSqaaiaadMgaaeqaaaaa@42A6@  will be equal to and opposite in sign of 1 N ^ A i S A y i m ^ i π ^ i A MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaadaWcbaWcbaGaaGymaaqaaiqad6eaga qcamaaCaaameqabaGaamyqaaaaaaGcdaaeqaqabSqaaiaadMgacaaM c8UaeyicI4SaaGPaVlaadofadaWgaaadbaGaamyqaaqabaaaleqani abggHiLdGccaaMe8+aaSqaaSqaaiaadMhadaWgaaadbaGaamyAaaqa baWccaaMe8UaeyOeI0IaaGjbVlqad2gagaqcamaaBaaameaacaWGPb aabeaaaSqaaiqbec8aWzaajaWaa0baaWqaaiaadMgaaeaacaWGbbaa aaaaaaa@4B5C@  if the model for π i A MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacqaHapaCdaqhaaWcbaGaamyAaaqaai aadgeaaaaaaa@354B@  is correctly specified. Conversely, if the model for π i A MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacqaHapaCdaqhaaWcbaGaamyAaaqaai aadgeaaaaaaa@354B@  is misspecified but m i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGTbWaaSbaaSqaaiaadMgaaeqaaa aa@33B9@  is correctly specified, y i m ^ i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWG5bWaaSbaaSqaaiaadMgaaeqaaO GaaGjbVlabgkHiTiaaysW7ceWGTbGbaKaadaWgaaWcbaGaamyAaaqa baaaaa@39F2@  will be iid with mean zero and consequently 1 N ^ A i S A y i m ^ i π ^ i A MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaadaWcbaWcbaGaaGymaaqaaiqad6eaga qcamaaCaaameqabaGaamyqaaaaaaGcdaaeqaqabSqaaiaadMgacaaM c8UaeyicI4SaaGPaVlaadofadaWgaaadbaGaamyqaaqabaaaleqani abggHiLdGcdaWcbaWcbaGaamyEamaaBaaameaacaWGPbaabeaaliaa ysW7cqGHsislcaaMe8UabmyBayaajaWaaSbaaWqaaiaadMgaaeqaaa WcbaGafqiWdaNbaKaadaqhaaadbaGaamyAaaqaaiaadgeaaaaaaaaa @49CF@  will also have mean 0, yielding an unbiased estimator. Chen, Valliant and Elliott (2019) used LASSO for prediction in combination with generalized regression estimators (McConville, Breidt, Lee and Moisen, 2017) when X MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWHybaaaa@328E@  is of high dimension. As Wu notes, Wu and Sitter (2001) show the equivalence between GREG applied to predicted values and DR estimators of the form in (2.2), which indicates that the Chen et al. (2019) approach was equivalent to (2.2) with LASSO estimation for m i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGTbWaaSbaaSqaaiaadMgaaeqaaa aa@33B9@  and an assumption of simple random sampling for the non-probability sample.

A disadvantage of using (2.1) as opposed to Chen et al. (2020) as the estimator of π i A , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacqaHapaCdaqhaaWcbaGaamyAaaqaai aadgeaaaGccaGGSaaaaa@3605@  and thus of d i A , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGKbWaa0baaSqaaiaadMgaaeaaca WGbbaaaOGaaiilaaaa@3531@  is the requirement that the probability sample weights d i B MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGKbWaa0baaSqaaiaadMgaaeaaca WGcbaaaaaa@3478@  be known or at least estimated for the non-probability sample. An advantage of using (2.1), is that non-linear models and machine learning methods can be used in estimation. Rafei, Flannagan and Elliott (2020) uses BART to estimate both m i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGTbWaaSbaaSqaaiaadMgaaeqaaa aa@33B9@  and π i A , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacqaHapaCdaqhaaWcbaGaamyAaaqaai aadgeaaaGccaGGSaaaaa@3605@  reducing the impact of potential model misspecification. Simulations showed considerable improvement in bias and variance reduction over the method of Chen et al. (2020) when the linear models is misspecified. Variance estimation can proceed by adapting Rubin’s multiple imputation rules: from M MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGnbaaaa@327F@  independent draws from BART, the mean of the variances computed treating the draw of d i A MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGKbWaa0baaSqaaiaadMgaaeaaca WGbbaaaaaa@3477@  as known using standard complex sample design estimators and added to M+1 M MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaadaWcbaWcbaGaamytaiaaysW7cqGHRa WkcaaMe8UaaGymaaqaaiaad2eaaaaaaa@3824@  times the variance of the point estimates computed across the draws of d i A MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGKbWaa0baaSqaaiaadMgaaeaaca WGbbaaaaaa@3477@  yield an approximately unbiased variance estimator.

An alternative approach to doubly-robust estimation uses the fact that the propensity score is the coarsest possible “balancing score” that contains all of the information about the association between the sampling indicator and the outcome of interest. This has led to the development of mean estimators that use smooth functions of weights to produce consistent estimators that can be more efficient when weights are highly variable or only weakly related to the outcome (Elliott and Little, 2000; Zheng and Little, 2005). Zhou, Elliott and Little (2019) extended this idea into the causal inference setting in non-randomized settings, in which probability of assignment to a treatment or exposure (propensity score) is estimated as a function of covariates P Z ( x i ,α ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGqbWaaSbaaSqaaiaadQfaaeqaaO GaaGPaVpaabmqabaGaaCiEamaaBaaaleaacaWGPbaabeaakiaaiYca caaMe8UaaCySdaGaayjkaiaawMcaaaaa@3C51@  using logistic regression, and then non-observed potential outcomes Y z MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGzbWaaWbaaSqabeaacaWG6baaaa aa@33B7@  under treatment arm z i z i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaq=dc9fs0xc9q8qqaqFn0dXdir=xcv k9pIe9q8qqaq=dir=f0=yqaqVeLsFr0=vr0=vr0db8meaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWG6bWaa0baaSqaaiaadMgaaeaakm aaCaaameqabaqcLbwacWaGyBOmGikaaaaakiaaysW7cqGHGjsUcaaM e8UaamOEamaaBaaaleaacaWGPbaabeaaaaa@412E@ for observed treatment z i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWG6bWaaSbaaSqaaiaadMgaaeqaaa aa@33C6@  are imputed from

Y i Z ~N( s( P ^ Z * ( x i , α ^ )| θ Z ) )+ g Z ( P ^ * ( x i , α ^ ), x i | β Z ), σ 2 (2.3) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGzbWaa0baaSqaaiaadMgaaeaaca WGAbaaaOGaaGjbVJqaaiaa=5hacaaMe8UaamOtaiaaykW7daqadeqa aiaadohacaaMc8+aaeWabeaaceWGqbGbaKaadaqhaaWcbaGaamOwaa qaaiaacQcaaaGccaaMc8UaaGikaiaahIhadaWgaaWcbaGaamyAaaqa baGccaaISaGaaGjbVlqahg7agaqcaiaaiMcacaaMe8+aaqqabeaaca aMc8UaaCiUdmaaBaaaleaacaWGAbaabeaaaOGaay5bSdaacaGLOaGa ayzkaaaacaGLOaGaayzkaaGaaGjbVlabgUcaRiaaysW7caWGNbWaaS baaSqaaiaadQfaaeqaaOGaaGPaVpaabmqabaGabmiuayaajaWaaWba aSqabeaacaGGQaaaaOGaaGPaVlaaiIcacaWH4bWaaSbaaSqaaiaadM gaaeqaaOGaaGilaiaaysW7ceWHXoGbaKaacaaIPaGaaGilaiaaysW7 caWH4bWaaSbaaSqaaiaadMgaaeqaaOGaaGjbVpaaeeqabaGaaGjbVl aahk7adaWgaaWcbaGaamOwaaqabaaakiaawEa7aaGaayjkaiaawMca aiaaiYcacaaMe8Uaeq4Wdm3aaWbaaSqabeaacaaIYaaaaOGaaGzbVl aaywW7caaMf8UaaGzbVlaaywW7caGGOaGaaGOmaiaac6cacaaIZaGa aiykaaaa@8227@

where P * MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGqbWaaWbaaSqabeaacaGGQaaaaa aa@335D@  is the logit transformation of P, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGqbGaaiilaaaa@3332@   s( P ^ Z * | θ Z ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGZbGaaGjbVlaacIcadaabceqaai qadcfagaqcamaaDaaaleaacaWGAbaabaGaaiOkaaaakiaaykW7aiaa wIa7aiaaysW7caWH4oWaaSbaaSqaaiaadQfaaeqaaOGaaiykaaaa@3F3C@  denotes a penalized spline with fixed knots (Eilers and Marx, 1996) of propensity, and g Z ( P ^ * , x i | β Z ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGNbWaaSbaaSqaaiaadQfaaeqaaO GaaGPaVlaacIcaceWGqbGbaKaadaahaaWcbeqaaiaacQcaaaGccaaI SaGaaGjbVlaahIhadaWgaaWcbaGaamyAaaqabaGcdaabbeqaaiaays W7caWHYoWaaSbaaSqaaiaadQfaaeqaaaGccaGLhWoacaGGPaaaaa@4239@  is a general function of covariates including the propensity scores. The resulting estimator is doubly robust in the sense that if either P Z ( x i ,α ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGqbWaaSbaaSqaaiaadQfaaeqaaO GaaGPaVpaabmqabaGaaCiEamaaBaaaleaacaWGPbaabeaakiaaiYca caaMe8UaaCySdaGaayjkaiaawMcaaaaa@3C51@  or E( Y z )= g Z ( P ^ * , x i | β Z ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGfbGaaGPaVlaaiIcacaWGzbWaaW baaSqabeaacaWG6baaaOGaaGykaiaaysW7caaI9aGaaGjbVlaadEga daWgaaWcbaGaamOwaaqabaGccaaMc8UaaGikaiqadcfagaqcamaaCa aaleqabaGaaiOkaaaakiaaygW7caaISaGaaGjbVlaahIhadaWgaaWc baGaamyAaaqabaGcdaabbeqaaiaaysW7caWHYoWaaSbaaSqaaiaadQ faaeqaaaGccaGLhWoacaaIPaaaaa@4D7E@  is correctly specified, Y (z) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGzbWaaWbaaSqabeaacaaIOaGaam OEaiaaiMcaaaaaaa@351C@  will be approximately unbiased; see Zhang and Little (2009). This can be implemented in the non-probability setting by replacing P ^ Z ( x i ,α) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaaceWGqbGbaKaadaWgaaWcbaGaamOwaa qabaGccaaMc8UaaGikaiaahIhadaWgaaWcbaGaamyAaaqabaGccaaI SaGaaGjbVlaahg7acaaIPaaaaa@3C3C@  in the mean model for (2.3) with π ^ i A MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacuaHapaCgaqcamaaDaaaleaacaWGPb aabaGaamyqaaaaaaa@355B@  estimated using (2.1) to obtain a draw of Y i (b) . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGzbWaa0baaSqaaiaadMgaaeaaca aIOaGaamOyaiaaiMcaaaGccaGGUaaaaa@36AE@  (Note this requires obtaining π ^ i A MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacuaHapaCgaqcamaaDaaaleaacaWGPb aabaGaamyqaaaaaaa@355B@  for the probability sample elements requiring prediction.) Inference can proceed by obtaining b=1,,B MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGIbGaaGjbVlaai2dacaaMe8UaaG ymaiaaiYcacaaMe8UaeSOjGSKaaiilaiaaysW7caWGcbaaaa@3D99@  draws from the posterior distribution of the estimated population quantity of interest, e.g., for the population mean

Y (b) = i S R N i (b) Y i (b) + i S A ( y i Y i (b) ) N MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGzbWaaWbaaSqabeaacaaIOaGaam OyaiaaiMcaaaGccaaMe8UaaGjbVlaai2dacaaMe8UaaGjbVpaalaaa baWaaabeaeaacaWGobWaa0baaSqaaiaadMgaaeaacaaIOaGaamOyai aaiMcaaaGccaWGzbWaa0baaSqaaiaadMgaaeaacaaIOaGaamOyaiaa iMcaaaaabaGaamyAaiaaykW7cqGHiiIZcaaMc8Uaam4uamaaBaaame aacaWGsbaabeaaaSqab0GaeyyeIuoakiaaysW7cqGHRaWkcaaMe8+a aabeaeaadaqadeqaaiaadMhadaWgaaWcbaGaamyAaaqabaGccaaMe8 UaeyOeI0IaaGjbVlaadMfadaqhaaWcbaGaamyAaaqaaiaaiIcacaWG IbGaaGykaaaaaOGaayjkaiaawMcaaaWcbaGaamyAaiaaykW7cqGHii IZcaaMc8Uaam4uamaaBaaameaacaWGbbaabeaaaSqab0GaeyyeIuoa aOqaaiaad6eaaaaaaa@680E@

where now N i (b) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGobWaa0baaSqaaiaadMgaaeaaca aIOaGaamOyaiaaiMcaaaaaaa@35E7@  is a estimate of the population represented by the weight d i R MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGKbWaa0baaSqaaiaadMgaaeaaca WGsbaaaaaa@3488@  obtained from a finite population Bayesian bootstrap (Little and Zheng, 2007); more complete FBPP extensions to complex sample designs that include clustering and stratification are available in Dong, Elliott and Raghunathan (2014).

As in the estimation of (2.1), the non-parametric (spline) component of (2.3) can be replaced with other machine-learning estimators; see Chapter 4 of Rafei (2021) for implementation using Gaussian processes. Also, extensions to non-normal models are direct, although not necessarily computational easy.

2.3  Poststratified estimators

Wu also describes the use of poststratified estimators in the context of quota sampling, which is not only a very old form of non-probability sampling but indeed the standard before Neyman made the case for stratified random sampling (Neyman, 1934). Wu’s Section 5 suggests a robust alternative to the propensity score estimates obtained by ordering observations in the probability sample by π ^ i , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacuaHapaCgaqcamaaBaaaleaacaWGPb aabeaakiaacYcaaaa@354E@  stratifying into K MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGlbaaaa@327D@  strata based on this ordering, and computing the predicted proportion of the population belonging to the k th MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGRbWaaWbaaSqabeaacaqG0bGaae iAaaaaaaa@34AC@  stratum as proportion of the sample weights W k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGxbWaaSbaaSqaaiaadUgaaeqaaa aa@33A5@  in this stratum using the probability sample, with

μ ^ PST = k W ^ k y ¯ k (2.4) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacuaH8oqBgaqcamaaBaaaleaacaqGqb Gaae4uaiaabsfaaeqaaOGaaGjbVlaaysW7caaI9aGaaGjbVlaaysW7 daaeqbqabSqaaiaadUgaaeqaniabggHiLdGccaaMc8Uabm4vayaaja WaaSbaaSqaaiaadUgaaeqaaOGabmyEayaaraWaaSbaaSqaaiaadUga aeqaaOGaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7caGGOaGaaGOmai aac6cacaaI0aGaaiykaaaa@5165@

where y ¯ k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaaceWG5bGbaebadaWgaaWcbaGaam4Aaa qabaaaaa@33DF@  is the mean within the k th MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGRbWaaWbaaSqabeaacaqG0bGaae iAaaaaaaa@34AC@  stratum in the non-probability sample. Wu notes the tradeoff between choosing K MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGlbaaaa@327D@  to be large enough to retain homogeneity within units but small enough to obtain stable estimates of y ¯ k , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaaceWG5bGbaebadaWgaaWcbaGaam4Aaa qabaGccaGGSaaaaa@3499@  suggesting 30 as the old “rule of thumb” for “large [enough] sample sizes”. I would add that a more formal approach discussed in Little (1986) suggests a method to generate strata (there in the context of non-response adjustment) that minimizes mean square error by maximizing the between-stratum-to-within-stratum variance. It would seem such an approach would be appropriate to consider in the non-probability post-stratified estimator as well.

A more direct approach to obtain estimates using a post-stratified type estimator is multilevel regression and poststratification (Wang, Rothschild, Goel and Gelman, 2015; Downes and Carlin, 2020). Here only data from the non-probability sample is used in the outcome model:

E( Y k[i] )= β 0 + x k T β+ j a l[k] j (2.5) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGfbWaaeWabeaacaWGzbWaaSbaaS qaaiaadUgacaaMc8UaaG4waiaadMgacaaIDbaabeaaaOGaayjkaiaa wMcaaiaaysW7caaMe8UaaGypaiaaysW7caaMe8UaeqOSdi2aaSbaaS qaaiaaicdaaeqaaOGaaGjbVlabgUcaRiaaysW7caWH4bWaa0baaSqa aiaadUgaaeaacaWGubaaaOGaaCOSdiabgUcaRmaaqafabeWcbaGaam OAaaqab0GaeyyeIuoakiaaysW7caWGHbWaa0baaSqaaiaadYgacaaM c8UaaG4waiaadUgacaaIDbaabaGaamOAaaaakiaaywW7caaMf8UaaG zbVlaaywW7caaMf8UaaiikaiaaikdacaGGUaGaaGynaiaacMcaaaa@642C@

where k=1,,K MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGRbGaaGjbVlaai2dacaaMe8UaaG ymaiaaiYcacaaMe8UaeSOjGSKaaiilaiaaysW7caWGlbaaaa@3DAB@  indexes the poststratum developed from j=1,,J MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGQbGaaGjbVlaai2dacaaMe8UaaG ymaiaaiYcacaaMe8UaeSOjGSKaaiilaiaaysW7caWGkbaaaa@3DA9@  variables, a l[k] j ~N(0, σ j 2 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGHbWaa0baaSqaaiaadYgacaaMc8 UaaG4waiaadUgacaaIDbaabaGaamOAaaaakiaaysW7caaMe8ocbaGa a8NFaiaaysW7caaMe8UaamOtaiaaykW7caaIOaGaaGimaiaaiYcaca aMe8Uaeq4Wdm3aa0baaSqaaiaadQgaaeaacaaIYaaaaOGaaGykaaaa @4A92@  for l=1,, L j MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGSbGaaGjbVlaai2dacaaMe8UaaG ymaiaaiYcacaaMe8UaeSOjGSKaaiilaiaaysW7caWGmbWaaSbaaSqa aiaadQgaaeqaaaaa@3EC8@  and l[k] MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGSbGaaGPaVlaaiUfacaWGRbGaaG yxaaaa@36E5@  maps the postratum cell k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGRbaaaa@329D@  to the appropriate category l MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGSbaaaa@329E@  of variable j. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGQbGaaiOlaaaa@334E@  The poststratifed estimator is still given by (2.4) with W ^ k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaaceWGxbGbaKaadaWgaaWcbaGaam4Aaa qabaaaaa@33B5@  now replaced with known population totals W k ; MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGxbWaaSbaaSqaaiaadUgaaeqaaO Gaai4oaaaa@346E@  posterior inference is obtained though posterior draws of β 0 , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacqaHYoGydaWgaaWcbaGaaGimaaqaba GccaGGSaaaaa@34EE@   β, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWHYoGaaiilaaaa@339B@  and a l[k] j MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGHbWaa0baaSqaaiaadYgacaaMc8 UaaG4waiaadUgacaaIDbaabaGaamOAaaaaaaa@38E7@  to obtain a draw of

μ ^ PST (b) = k W k [ 1 n k ik ( β 0 (b) + x k T β (b) + j a l[k] j(b) ) ]. MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacuaH8oqBgaqcamaaDaaaleaacaqGqb Gaae4uaiaabsfaaeaacaGGOaGaamOyaiaacMcaaaGccaaMe8UaaGjb Vlaai2dacaaMe8UaaGjbVpaaqafabeWcbaGaam4Aaaqab0GaeyyeIu oakiaaykW7caWGxbWaaSbaaSqaaiaadUgaaeqaaOGaaGjbVpaadmaa baWaaSaaaeaacaaIXaaabaGaamOBamaaBaaaleaacaWGRbaabeaaaa GccaaMe8+aaabuaeqaleaacaWGPbGaaGPaVlabgIGiolaaykW7caWG RbaabeqdcqGHris5aOGaaGjbVpaabmaabaGaeqOSdi2aa0baaSqaai aaicdaaeaacaaIOaGaamOyaiaaiMcaaaGccaaMe8Uaey4kaSIaaGjb VlaahIhadaqhaaWcbaGaam4AaaqaaiaadsfaaaGccaWHYoWaaWbaaS qabeaacaaIOaGaamOyaiaaiMcaaaGccaaMe8Uaey4kaSIaaGjbVpaa qafabeWcbaGaamOAaaqab0GaeyyeIuoakiaaykW7caWGHbWaa0baaS qaaiaadYgacaaMc8UaaG4waiaadUgacaaIDbaabaGaamOAaiaaykW7 caaIOaGaamOyaiaaiMcaaaaakiaawIcacaGLPaaaaiaawUfacaGLDb aacaaMc8UaaiOlaaaa@7F53@

Though not technically doubly-robust, it has been shown to work well in some applications where J MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGkbaaaa@327C@  is large enough to capture all of the important discrepancies between the probability and non-probability sample, and the non-probability sample is sufficiently large to allow reasonably accurate estimation of a l[k] j . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGHbWaa0baaSqaaiaadYgacaaMc8 UaaG4waiaadUgacaaIDbaabaGaamOAaaaakiaac6caaaa@39A3@  In the absence of known joint distributions of a high dimensional X, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWHybGaaiilaaaa@333E@  this approach has the weakness of relying on estimated distributions, which are unstable. A possible alternative might be replace the simple y ¯ k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaaceWG5bGbaebadaWgaaWcbaGaam4Aaa qabaaaaa@33DF@  with (2.5) in Wu’s poststratified estimator (2.4), using the fact that the sampling weights d i R MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWGKbWaa0baaSqaaiaadMgaaeaaca WGsbaaaaaa@3488@  summarize the information about X MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8rrps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGaaiaadaaakeaacaWHybaaaa@328E@  in the probability sample similar to that of the propensity score for non-probability sample.


Date modified: