Comments on “Statistical inference with non-probability survey samples” – Miniaturizing data defect correlation: A versatile strategy for handling non-probability samples
Section 4. Quasi-randomization or super-population implementations

In a nutshell, the quasi-randomization approach focuses on making W I π I MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGxbWaaSbaaSqaaiaadMeaaeqaaO GaeqiWda3aaSbaaSqaaiaadMeaaeqaaaaa@3658@  a constant variable (induced by FPI I). MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGjbGaaiykaiaac6caaaa@33EE@  When our sample is genuinely selected by a probabilistic scheme by design, then π i =Pr( R i =1| x i ), MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacqaHapaCdaWgaaWcbaGaamyAaaqaba GccaaMe8Uaeyypa0JaaGjbVlGaccfacaGGYbGaaGPaVlaaiIcacaWG sbWaaSbaaSqaaiaadMgaaeqaaOGaaGjbVlabg2da9iaaysW7caaIXa GaaGjbVpaaeeqabaGaaGPaVlaahIhadaWgaaWcbaGaamyAaaqabaaa kiaawEa7aiaaiMcacaGGSaaaaa@4BD6@  for iN, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGPbGaaGjbVlabgIGiolaaysW7tC vAUfKttLearyat1nwAKfgidfgBSL2zYfgCOLhaiqGacqWFobGtcaGG Saaaaa@4291@  is a design probability, free of y i , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWG5bWaaSbaaSqaaiaadMgaaeqaaO Gaaiilaaaa@3493@  but it can depend on x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWH4bWaaSbaaSqaaiaadMgaaeqaaa aa@33DC@  for example when x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWH4bWaaSbaaSqaaiaadMgaaeqaaa aa@33DC@  includes a stratifying variable. When the design probability is unavailable, we first need to invoke a divine probability. This could be a natural one given by the finite population, such as the propensity π i = Pr I ( R I =1| A I = A i ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacqaHapaCdaWgaaWcbaGaamyAaaqaba GccaaMe8Uaeyypa0JaaGjbVlGaccfacaGGYbWaaSbaaSqaaiaadMea aeqaaOGaaGPaVlaaiIcacaWGsbWaaSbaaSqaaiaadMeaaeqaaOGaaG jbVlabg2da9iaaysW7caaIXaGaaGjbVpaaeeqabaGaaGPaVlaadgea daWgaaWcbaGaamysaaqabaaakiaawEa7aiaaysW7cqGH9aqpcaaMe8 UaamyqamaaBaaaleaacaWGPbaabeaakiaaiMcaaaa@51B9@  induced by FPI, where A i ={ y i , x i }, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGbbWaaSbaaSqaaiaadMgaaeqaaO GaaGjbVlabg2da9iaaysW7caaI7bGaamyEamaaBaaaleaacaWGPbaa beaakiaaiYcacaaMe8UaaCiEamaaBaaaleaacaWGPbaabeaakiaai2 hacaGGSaaaaa@4111@  or an imagined super-population one such as the R i s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGsbWaaSbaaSqaaiaadMgaaeqaaG qaaOGaa8xgGiaabohaaaa@3575@  being generated independently from Ber( π i ), MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaqGcbGaaeyzaiaabkhacaaMc8UaaG ikaiabec8aWnaaBaaaleaacaWGPbaabeaakiaaiMcacaGGSaaaaa@3AE4@  where π i =Pr( R i =1| A i )>0. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacqaHapaCdaWgaaWcbaGaamyAaaqaba GccaaMe8Uaeyypa0JaaGjbVlGaccfacaGGYbGaaGPaVlaaiIcacaWG sbWaaSbaaSqaaiaadMgaaeqaaOGaaGjbVlabg2da9iaaysW7caaIXa GaaGjbVpaaeeqabaGaaGPaVlaadgeadaWgaaWcbaGaamyAaaqabaaa kiaawEa7aiaaiMcacaaMe8UaeyOpa4JaaGjbVlaaicdacaGGUaaaaa@5079@  This positivity assumption is necessary if the finite population is pre-specified, or its imposition defines the finite population that can be studied. (This is a practically rather relevant consideration, such as in election polling, where the finite population may not be always pre-specified even theoretically.) Since these divine probabilities are unknown and serve as our estimand, we need to assume some device probabilities, such as via a generalized linear model π i =g( y i , x i ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacqaHapaCdaWgaaWcbaGaamyAaaqaba GccaaMe8Uaeyypa0JaaGjbVlaadEgacaaMc8UaaGikaiaadMhadaWg aaWcbaGaamyAaaqabaGccaaISaGaaGjbVlaahIhadaWgaaWcbaGaam yAaaqabaGccaaIPaaaaa@4328@  to proceed, even though we don’t really believe in any particular choice of g. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGNbGaaiOlaaaa@335F@

For our current discussion, suppose our divine probability is given by the super-population Bernoulli model. Let n R = i=1 N R i , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGUbWaaSbaaSqaaiaadkfaaeqaaO GaaGjbVlabg2da9iaaysW7daaeWaqabSqaaiaadMgacaaMc8UaaGyp aiaaykW7caaIXaaabaGaamOtaaqdcqGHris5aOGaaGPaVlaadkfada WgaaWcbaGaamyAaaqabaGccaGGSaaaaa@447D@  and p ˜ (A)=Pr( n R >0| A)=1 Π iN (1 π i ), MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaaceWGWbGbaGaacaaMc8UaaGikaiaahg eacaaIPaGaaGjbVlabg2da9iaaysW7ciGGqbGaaiOCaiaaykW7caaI OaGaamOBamaaBaaaleaacaWGsbaabeaakiaaysW7cqGH+aGpcaaMe8 UaaGimaiaaysW7daabbeqaaiaaykW7caWHbbaacaGLhWoacaaIPaGa aGjbVlabg2da9iaaysW7caaIXaGaaGjbVlabgkHiTiaaysW7cqqHGo audaWgaaWcbaGaamyAaiaaykW7cqGHiiIZcaaMc8UaamOtaaqabaGc caaMc8UaaGikaiaaigdacaaMe8UaeyOeI0IaaGjbVlabec8aWnaaBa aaleaacaWGPbaabeaakiaaiMcacaGGSaaaaa@67E2@  where A={ A i ,iN}. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWHbbGaaGjbVlabg2da9iaaysW7ca aI7bGaamyqamaaBaaaleaacaWGPbaabeaakiaaiYcacaaMe8UaamyA aiaaysW7cqGHiiIZcaaMe8+exLMBb50ujbqegWuDJLgzHbYqHXgBPD MCHbhA5baceiGae8Nta4KaaGPaVlaai2hacaGGUaaaaa@4F41@  Because the R i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGsbWaaSbaaSqaaiaadMgaaeqaaa aa@33B2@  here is controlled by a divine probability, the sample size n R MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGUbWaaSbaaSqaaiaadkfaaeqaaa aa@33B7@  is no longer a design variable to be conditioned upon in our replication scheme; it is generally no longer an ancillary statistic. Nevertheless, we should condition on n R >0, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGUbWaaSbaaSqaaiaadkfaaeqaaO GaaGjbVlabg6da+iaaysW7caaIWaGaaiilaaaa@394D@  a universal requirement for constructing data-driven estimates for G ¯ . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaaceWGhbGbaebacaGGUaaaaa@3357@  Fortunately this conditioning does not create mathematical complications to the simplicity granted by the independence among π i ,iN MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacqaHapaCdaWgaaWcbaGaamyAaaqaba GccaaISaGaaGjbVlaadMgacaaMe8UaeyicI4SaaGjbVpXvP5wqonvs aeHbmv3yPrwyGmuySXwANjxyWHwEaGabciab=5eaobaa@4705@  as functions of A i . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGbbWaaSbaaSqaaiaadMgaaeqaaO GaaiOlaaaa@345D@  This is because π ˜ i (A)Pr( R i =1| A, n R >0)= π i / p ˜ (A) , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacuaHapaCgaacamaaBaaaleaacaWGPb aabeaakiaaykW7caaIOaGaaCyqaiaaiMcacaaMe8UaeyyyIORaaGjb VlGaccfacaGGYbGaaGPaVlaaiIcacaWGsbWaaSbaaSqaaiaadMgaae qaaOGaaGjbVlabg2da9iaaysW7caaIXaGaaGjbVpaaeeqabaGaaGPa VlaahgeaaiaawEa7aiaaiYcacaaMe8UaamOBamaaBaaaleaacaWGsb aabeaakiaaysW7cqGH+aGpcaaMe8UaaGimaiaaiMcacaaMe8Uaeyyp a0JaaGjbVpaalyaabaGaeqiWda3aaSbaaSqaaiaadMgaaeqaaOGaaG PaVdqaaiaaykW7ceWGWbGbaGaacaaMc8UaaGikaiaahgeacaaIPaaa aiaacYcaaaa@6711@  but the normalizing constant p ˜ (A) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaaceWGWbGbaGaacaaMc8UaaGikaiaahg eacaaIPaaaaa@367F@  ‒ which depends on the entire A MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWHbbaaaa@328A@ ‒ is not relevant for the developments in this article, such as assigning weights that are proportional to π ˜ i 1 (A). MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacuaHapaCgaacamaaDaaaleaacaWGPb aabaGaeyOeI0IaaGymaaaakiaaiIcacaWHbbGaaGykaiaac6caaaa@393B@

Consequently, under this divine probability, which corresponds to (the true model for) the q MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGXbaaaa@32B7@  -model setting in Wu (2022), we have for any chosen W I , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGxbWaaSbaaSqaaiaadMeaaeqaaO Gaaiilaaaa@3451@  by (3.1)

E( c R ˜ ,z | A, n R >0) = Cov I ( W I E[ R I | A, n R >0], y I m( x I )) = p ˜ 1 (A) Cov I ( W I π I , y I m( x I )),(4.1) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qrpq0xc9fs0xc9q8qqaqFn0dXdir=xcv k9pIe9q8qqaq=dir=f0=yqaqVeLsFr0=vr0=vr0db8meaabaqaciGa caGaaeqabaGabiWadaaakeaafaqaaeGacaaabaGaaeyraiaaykW7ca aIOaGaam4yamaaBaaaleaaceWGsbGbaGaacaGGSaGaaGjbVlaadQha aeqaaOGaaGPaVpaaeeqabaGaaGPaVlaahgeaaiaawEa7aiaaiYcaca aMe8UaamOBamaaBaaaleaacaWGsbaabeaakiaaysW7cqGH+aGpcaaM e8UaaGimaiaaiMcaaeaacaaI9aGaaGjbVlaaboeacaqGVbGaaeODam aaBaaaleaacaWGjbaabeaakiaaykW7caaIOaGaam4vamaaBaaaleaa caWGjbaabeaakiaabweacaaMc8UaaG4waiaadkfadaWgaaWcbaGaam ysaaqabaGccaaMc8+aaqqabeaacaaMc8UaaCyqaaGaay5bSdGaaGil aiaaysW7caWGUbWaaSbaaSqaaiaadkfaaeqaaOGaaGjbVlabg6da+i aaysW7caaIWaGaaGyxaiaaiYcacaaMe8UaamyEamaaBaaaleaacaWG jbaabeaakiaaysW7cqGHsislcaaMe8UaamyBaiaaykW7caaIOaGaaC iEamaaBaaaleaacaWGjbaabeaakiaaiMcacaaIPaaabaaabaGaeyyp a0JaaGjbVlqadchagaacamaaCaaaleqabaGaeyOeI0IaaGymaaaaki aaiIcacaWHbbGaaGykaiaaysW7caqGdbGaae4BaiaabAhadaWgaaWc baGaamysaaqabaGccaaMc8UaaGikaiaadEfadaWgaaWcbaGaamysaa qabaGccqaHapaCdaWgaaWcbaGaamysaaqabaGccaaISaGaaGjbVlaa dMhadaWgaaWcbaGaamysaaqabaGccaaMe8UaeyOeI0IaaGjbVlaad2 gacaaMc8UaaGikaiaahIhadaWgaaWcbaGaamysaaqabaGccaaIPaGa aGykaiaaiYcacaaMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7ca GGOaGaaGinaiaac6cacaaIXaGaaiykaaaaaaa@A95D@

where E MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaqGfbaaaa@3289@  is with respect to the (unknown) divine probability over R I MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGsbWaaSbaaSqaaiaadMeaaeqaaa aa@3392@  (for fixed I). MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGjbGaaiykaiaac6caaaa@33EE@  It follows then that, regardless of whether we want to ensure zero expectation in (3.2) or in (4.1), we will impose W I π I 1, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGxbWaaSbaaSqaaiaadMeaaeqaaO GaeqiWda3aaSbaaSqaaiaadMeaaeqaaOGaaGjbVlabg2Hi1kaaysW7 caaIXaGaaiilaaaa@3C67@  that is, W I π I 1 , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGxbWaaSbaaSqaaiaadMeaaeqaaO GaaGjbVlabg2Hi1kaaysW7cqaHapaCdaqhaaWcbaGaamysaaqaaiab gkHiTiaaigdaaaGccaGGSaaaaa@3D55@  the well-known inverse probability weighting. Therefore, if our postulated model q MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGXbaaaa@32B7@  permits us to reliably capture π i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacqaHapaCdaWgaaWcbaGaamyAaaqaba aaaa@3498@  in reality, then c R ˜ ,z = O p ( N 1/2 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGJbWaaSbaaSqaaiqadkfagaacai aacYcacaaMc8UaamOEaaqabaGccaaMe8Uaeyypa0JaaGjbVlaad+ea daWgaaWcbaGaamiCaaqabaGccaaMc8UaaGikaiaad6eadaahaaWcbe qaaiabgkHiTmaalyaabaGaaGymaaqaaiaaikdaaaaaaOGaaGykaaaa @4392@  because it has mean zero (with respect to the divine probability), and it is a weighted average of N MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGobaaaa@3294@  essentially independent Bernoulli variables, as seen in (3.1).

This is a randomization oriented approach because it treats the entire finite population attribute values A MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWHbbaaaa@328B@  as fixed, and the hypothetical replications are generated only by repeated realizations of the recording indicator R I . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGsbWaaSbaaSqaaiaadMeaaeqaaO GaaiOlaaaa@344E@  Of course, in general, the values of { π i ,iN} MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaaI7bGaeqiWda3aaSbaaSqaaiaadM gaaeqaaOGaaGilaiaaysW7caWGPbGaaGjbVlabgIGiolaaysW7tCvA UfKttLearyat1nwAKfgidfgBSL2zYfgCOLhaiqGacqWFobGtcaaMc8 UaaGyFaaaa@4A9C@  are unknown, and worse they are inestimable from a non-probability sample without further assumptions. To proceed, we pose assumptions such as missing at random, i.e., Pr( R i =1| A i )=Pr( R i =1| x i ), MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaaciGGqbGaaiOCaiaaykW7caaIOaGaam OuamaaBaaaleaacaWGPbaabeaakiaaysW7cqGH9aqpcaaMe8UaaGym aiaaysW7daabbeqaaiaaykW7caWGbbWaaSbaaSqaaiaadMgaaeqaaa GccaGLhWoacaaIPaGaaGjbVlabg2da9iaaysW7ciGGqbGaaiOCaiaa ykW7caaIOaGaamOuamaaBaaaleaacaWGPbaabeaakiaaysW7cqGH9a qpcaaMe8UaaGymaiaaysW7daabbeqaaiaaykW7caWH4bWaaSbaaSqa aiaadMgaaeqaaaGccaGLhWoacaaIPaGaaiilaaaa@5B1E@  and the requirement of an auxiliary sample so that we have some values of x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWH4bWaaSbaaSqaaiaadMgaaeqaaa aa@33DC@  with R i =0. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGsbWaaSbaaSqaaiaadMgaaeqaaO GaaGjbVlabg2da9iaaysW7caaIWaGaaiOlaaaa@3948@  We also have choices on how to estimate the inclusion propensity π i =Pr( R i =1| x i ), MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacqaHapaCdaWgaaWcbaGaamyAaaqaba GccaaMe8Uaeyypa0JaaGjbVlGaccfacaGGYbGaaGPaVlaaiIcacaWG sbWaaSbaaSqaaiaadMgaaeqaaOGaaGjbVlaai2dacaaMe8UaaGymai aaysW7daabbeqaaiaaykW7caWH4bWaaSbaaSqaaiaadMgaaeqaaaGc caGLhWoacaaIPaGaaiilaaaa@4B97@  parametrically or non-parametrically. These assumptions, requirements, and estimation methods are all essential for practical implementation, as carefully reviewed and discussed by Wu (2022); also see Tan (2010) for a detailed comparison of various estimation strategies. Nevertheless, the overarching idea of quasi-randomization methods is to choose W I MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGxbWaaSbaaSqaaiaadMeaaeqaaa aa@3397@  to free R ˜ I = W I R I MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaaceWGsbGbaGaadaWgaaWcbaGaamysaa qabaGccaaMe8Uaeyypa0JaaGjbVlaadEfadaWgaaWcbaGaamysaaqa baGccaWGsbWaaSbaaSqaaiaadMeaaeqaaaaa@3B7C@  from I MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGjbaaaa@328F@  in expectation over the posited hypothetical replications, to regain the freedom guaranteed by probability sampling.

Complementarily, the super-population approaches aim to miniaturize c R ˜ ,z MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGJbWaaSbaaSqaaiqadkfagaacai aacYcacaaMe8UaamOEaaqabaaaaa@36F7@  via making the other variable in c R ˜ ,z , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGJbWaaSbaaSqaaiqadkfagaacai aacYcacaaMe8UaamOEaaqabaGccaGGSaaaaa@37B1@  that is, z I MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWG6bWaaSbaaSqaaiaadMeaaeqaaa aa@33BA@  free of I MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGjbaaaa@328F@  in expectation, but over a different hypothetical replication scheme. Here the idea is to choose an m( x i ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGTbGaaGPaVlaaiIcacaWH4bWaaS baaSqaaiaadMgaaeqaaOGaaGykaaaa@37C8@  that is a good approximation to y i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWG5bWaaSbaaSqaaiaadMgaaeqaaa aa@33D9@  such that the residual z i = y i m( x i ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWG6bWaaSbaaSqaaiaadMgaaeqaaO GaaGjbVlabg2da9iaaysW7caWG5bWaaSbaaSqaaiaadMgaaeqaaOGa aGjbVlabgkHiTiaaysW7caWGTbGaaGPaVlaaiIcacaWH4bWaaSbaaS qaaiaadMgaaeqaaOGaaGykaaaa@4434@  will be zero in expectation conditioning on x. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWH4bGaaiOlaaaa@3374@  Typically, this is done by considering a joint model for { R i , y i } MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaaI7bGaamOuamaaBaaaleaacaWGPb aabeaakiaaiYcacaaMe8UaamyEamaaBaaaleaacaWGPbaabeaakiaa i2haaaa@3A2D@  given x i , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWH4bWaaSbaaSqaaiaadMgaaeqaaO Gaaiilaaaa@3496@  and with a specific regression model ξ(y| x), MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacqaH+oaEcaaMc8UaaGikaiaadMhaca aMc8+aaqqabeaacaaMc8UaaCiEaaGaay5bSdGaaGykaiaacYcaaaa@3DCE@  using the notation in Wu (2022). It is important to recognize that, although we only specify the regression model y i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWG5bWaaSbaaSqaaiaadMgaaeqaaa aa@33D9@  given x i , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWH4bWaaSbaaSqaaiaadMgaaeqaaO Gaaiilaaaa@3496@  we must include R i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGsbWaaSbaaSqaaiaadMgaaeqaaa aa@33B2@  in the replications in order to capture the possible dependence of R i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGsbWaaSbaaSqaaiaadMgaaeqaaa aa@33B2@  on the entire A i ={ y i , x i }, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGbbWaaSbaaSqaaiaadMgaaeqaaO GaaGjbVlabg2da9iaaysW7caaI7bGaamyEamaaBaaaleaacaWGPbaa beaakiaaiYcacaaMe8UaaCiEamaaBaaaleaacaWGPbaabeaakiaai2 hacaGGSaaaaa@4111@  which is the key concern for non-probability samples. Indeed, it is this joint specification that permits the adoption of the missing at random assumption to reduce P( y i | x i , R i )=P( y i | x i ), MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGqbGaaGPaVlaaiIcacaWG5bWaaS baaSqaaiaadMgaaeqaaOGaaGjbVpaaeeqabaGaaGPaVlaahIhadaWg aaWcbaGaamyAaaqabaaakiaawEa7aiaaiYcacaaMe8UaamOuamaaBa aaleaacaWGPbaabeaakiaaiMcacaaMe8Uaeyypa0JaaGjbVlaadcfa caaMc8UaaGikaiaadMhadaWgaaWcbaGaamyAaaqabaGccaaMe8+aaq qabeaacaaMc8UaaCiEamaaBaaaleaacaWGPbaabeaaaOGaay5bSdGa aGykaiaacYcaaaa@5441@  which in turn permits us to focus on specifying a single regression model ξ( y i | x i ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacqaH+oaEcaaMc8UaaGikaiaadMhada WgaaWcbaGaamyAaaqabaGccaaMe8+aaqqabeaacaaMc8UaaCiEamaa BaaaleaacaWGPbaabeaaaOGaay5bSdGaaGykaaaa@3F68@  for both observed and unobserved individuals. Therefore, when we write E ξ , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaqGfbWaaSbaaSqaaiabe67a4bqaba GccaGGSaaaaa@3532@  we mean the expectation with respect to 

P( R i , y i | x i )=P( R i | x i )P( y i | R i , x i )= π i R i (1 π i ) 1 R i ξ( y i | x i ),(4.2) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGqbGaaGPaVlaaiIcacaWGsbWaaS baaSqaaiaadMgaaeqaaOGaaGilaiaaysW7caWG5bWaaSbaaSqaaiaa dMgaaeqaaOGaaGjbVpaaeeqabaGaaGPaVlaahIhadaWgaaWcbaGaam yAaaqabaaakiaawEa7aiaaiMcacaaMe8UaaGjbVlabg2da9iaaysW7 caaMe8UaamiuaiaaykW7caaIOaGaamOuamaaBaaaleaacaWGPbaabe aakiaaykW7daabbeqaaiaaykW7caWH4bWaaSbaaSqaaiaadMgaaeqa aaGccaGLhWoacaaIPaGaaGjbVlaadcfacaaMc8UaaGikaiaadMhada WgaaWcbaGaamyAaaqabaGccaaMc8+aaqqabeaacaaMc8UaamOuamaa BaaaleaacaWGPbaabeaaaOGaay5bSdGaaGilaiaaysW7caWH4bWaaS baaSqaaiaadMgaaeqaaOGaaGykaiaaysW7caaMe8Uaeyypa0JaaGjb VlaaysW7cqaHapaCdaqhaaWcbaGaamyAaaqaaiaadkfadaWgaaadba GaamyAaaqabaaaaOGaaGjbVlaaiIcacaaIXaGaaGjbVlabgkHiTiaa ysW7cqaHapaCdaWgaaWcbaGaamyAaaqabaGccaaIPaWaaWbaaSqabe aacaaIXaGaaGjbVlabgkHiTiaaysW7caWGsbWaaSbaaWqaaiaadMga aeqaaaaakiabe67a4jaaykW7caaIOaGaamyEamaaBaaaleaacaWGPb aabeaakiaaysW7daabbeqaaiaaykW7caWH4bWaaSbaaSqaaiaadMga aeqaaaGccaGLhWoacaaIPaGaaGilaiaaywW7caaMf8UaaGzbVlaayw W7caGGOaGaaGinaiaac6cacaaIYaGaaiykaaaa@9EAB@

where π i =Pr( R i =1| x i ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacqaHapaCdaWgaaWcbaGaamyAaaqaba GccaaMe8Uaeyypa0JaaGjbVlGaccfacaGGYbGaaGPaVlaaiIcacaWG sbWaaSbaaSqaaiaadMgaaeqaaOGaaGjbVlabg2da9iaaysW7caaIXa GaaGjbVpaaeeqabaGaaGPaVlaahIhadaWgaaWcbaGaamyAaaqabaaa kiaawEa7aiaaiMcaaaa@4B26@  is left unspecified, unlike with the quasi-randomization approach.

It follows then that, conditioning on X={ x i ,iN} MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWHybGaaGjbVlabg2da9iaaysW7ca aI7bGaaCiEamaaBaaaleaacaWGPbaabeaakiaaiYcacaaMe8UaamyA aiaaysW7cqGHiiIZcaaMe8+exLMBb50ujbqegWuDJLgzHbYqHXgBPD MCHbhA5baceiGae8Nta4KaaGPaVlaai2haaaa@4EE1@  and n R >0, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGUbWaaSbaaSqaaiaadkfaaeqaaO GaaGjbVlabg6da+iaaysW7caaIWaGaaiilaaaa@394D@  which does not alter P(y| X) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGqbGaaGPaVlaaiIcacaWG5bGaaG jbVpaaeeqabaGaaGPaVlaahIfaaiaawEa7aiaaiMcaaaa@3C12@  because y MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWG5baaaa@32BF@  and R MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGsbaaaa@3298@  are independent given X, MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWHybGaaiilaaaa@3352@  we have

E( c R ˜ ,z | X, n R >0)= [ p ˜ (X)] 1 Cov I ( W I π I ,E[ y I | x I ]m( x I )).(4.3) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaqGfbGaaGPaVlaaiIcacaWGJbWaaS baaSqaaiqadkfagaacaiaacYcacaaMc8UaamOEaaqabaGccaaMc8+a aqqabeaacaaMc8UaaCiwaaGaay5bSdGaaGilaiaaysW7caWGUbWaaS baaSqaaiaadkfaaeqaaOGaaGjbVlabg6da+iaaysW7caaIWaGaaGyk aiaaysW7caaMe8Uaeyypa0JaaGjbVlaaysW7caaIBbGabmiCayaaia GaaGPaVlaaiIcacaWHybGaaGykaiaai2fadaahaaWcbeqaaiabgkHi TiaaigdaaaGccaaMc8Uaae4qaiaab+gacaqG2bWaaSbaaSqaaiaadM eaaeqaaOGaaGPaVlaaiIcacaWGxbWaaSbaaSqaaiaadMeaaeqaaOGa eqiWda3aaSbaaSqaaiaadMeaaeqaaOGaaGilaiaaysW7caqGfbGaaG PaVlaaiUfacaWG5bWaaSbaaSqaaiaadMeaaeqaaOGaaGPaVpaaeeqa baGaaGPaVlaahIhadaWgaaWcbaGaamysaaqabaaakiaawEa7aiaai2 facaaMe8UaeyOeI0IaaGjbVlaad2gacaaMc8UaaGikaiaahIhadaWg aaWcbaGaamysaaqabaGccaaIPaGaaGykaiaai6cacaaMf8UaaGzbVl aaywW7caaMf8UaaiikaiaaisdacaGGUaGaaG4maiaacMcaaaa@8964@

Clearly, (4.3) becomes zero when we choose m( x I )= E ξ [ y I | x I ] MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGTbGaaGPaVlaaiIcacaWH4bWaaS baaSqaaiaadMeaaeqaaOGaaGykaiaaysW7cqGH9aqpcaaMe8Uaaeyr amaaBaaaleaacqaH+oaEaeqaaOGaaGPaVlaaiUfacaWG5bWaaSbaaS qaaiaadMeaaeqaaOGaaGPaVpaaeeqabaGaaGPaVlaahIhadaWgaaWc baGaamysaaqabaaakiaawEa7aiaai2faaaa@4A92@  and that the ξ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacqaH+oaEaaa@3384@  model is (first-order) correctly specified, that is, E ξ [ y I | x I ]=E[ y I | x I ]. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaqGfbWaaSbaaSqaaiabe67a4bqaba GccaaMc8UaaG4waiaadMhadaWgaaWcbaGaamysaaqabaGccaaMc8+a aqqabeaacaaMc8UaaCiEamaaBaaaleaacaWGjbaabeaaaOGaay5bSd GaaGyxaiaaysW7cqGH9aqpcaaMe8UaaeyraiaaykW7caaIBbGaamyE amaaBaaaleaacaWGjbaabeaakiaaykW7daabbeqaaiaaykW7caWH4b WaaSbaaSqaaiaadMeaaeqaaaGccaGLhWoacaaIDbGaaiOlaaaa@522E@  This summarizes the super-population approach, and it renders c R ˜ ,z = O p ( N 1/2 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaebbnrfifHhDYfgasaacH8srps0l bbf9q8WrFfeuY=Hhbbf9v8qqaqFr0xc9pk0xbba9q8WqFfea0=yr0R Yxir=Jbba9q8aq0=yq=He9q8qqQ8frFve9Fve9Ff0dmeaabaqaciGa caGaaeqabaGabiWadaaakeaacaWGJbWaaSbaaSqaaiqadkfagaacai aacYcacaaMc8UaamOEaaqabaGccaaMe8Uaeyypa0JaaGjbVlaad+ea daWgaaWcbaGaamiCaaqabaGccaaMc8UaaGikaiaad6eadaahaaWcbe qaaiabgkHiTmaalyaabaGaaGymaaqaaiaaikdaaaaaaOGaaGykaaaa @4392@  for similar reasons as given for the quasi-randomization framework.


Date modified: