Estimation of response propensities and indicators of representative response using population-level information
Section 2. Population-based response propensities

2.1  General notation

We suppose that a sample survey is undertaken, where a sample s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGZbaaaa@3809@ is selected from a finite population U . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGvbGaai Olaaaa@389D@ The sizes of s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGZbaaaa@3809@ and U MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGvbaaaa@37EB@ are denoted by n MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGUbaaaa@3804@ and N , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGobGaai ilaaaa@3894@ respectively. The units in U MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGvbaaaa@37EB@ are labelled i = 1 , 2 , , N . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGPbGaey ypa0JaaGymaiaacYcacaaMe8UaaGOmaiaacYcacaaMe8UaeSOjGSKa aiilaiaaysW7caWGobGaaiOlaaaa@43DA@ The sample is assumed to be drawn by a probability sampling design p ( . ) , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGWbWaae WaaeaacaGGUaaacaGLOaGaayzkaaGaaiilaaaa@3AF1@ where the sample s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGZbaaaa@3809@ is selected with probability p ( s ) . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGWbWaae WaaeaacaWGZbaacaGLOaGaayzkaaGaaiOlaaaa@3B39@ The first order inclusion probability of unit i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGPbaaaa@37FF@ is denoted π i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacqaHapaCda WgaaWcbaGaamyAaaqabaaaaa@39E8@ and d i = π i 1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGKbWaaS baaSqaaiaadMgaaeqaaOGaeyypa0JaeqiWda3aa0baaSqaaiaadMga aeaacqGHsislcaaIXaaaaaaa@3EA4@ is the design weight. The evaluation study is based on simple random sampling without replacement. Although large-scale national surveys may use more complex two-stage designs, many are generally planned so that all survey units have an equal inclusion probability. We also provide theoretical expressions under more general complex survey designs.

We suppose that the survey is subject to unit nonresponse. The set of responding units is denoted by r , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGYbGaai ilaaaa@38B8@ so r s U . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGYbGaey OGIWSaam4CaiabgkOimlaadwfacaGGUaaaaa@3E84@ We denote summation over the respondents, sample and population by Σ r , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacqqHJoWuda WgaaWcbaGaamOCaaqabaGccaaMb8Uaaiilaaaa@3BFC@ Σ s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacqqHJoWuda WgaaWcbaGaam4Caaqabaaaaa@39B9@ and Σ U , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacqqHJoWuda WgaaWcbaGaamyvaaqabaGccaaMb8Uaaiilaaaa@3BDF@ respectively. Let r i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGYbWaaS baaSqaaiaadMgaaeqaaaaa@3922@ be the response indicator variable so that r i = 1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGYbWaaS baaSqaaiaadMgaaeqaaOGaeyypa0JaaGymaaaa@3AED@ if unit i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGPbaaaa@37FF@ responds and r i = 0 , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGYbWaaS baaSqaaiaadMgaaeqaaOGaeyypa0JaaGimaiaacYcaaaa@3B9C@ otherwise. Hence, r = { i s ; r i = 1 } . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGYbGaey ypa0ZaaiWaaeaacaWGPbGaeyicI4Saam4CaiaacUdacaaMe8UaaGPa VlaadkhadaWgaaWcbaGaamyAaaqabaGccqGH9aqpcaaIXaaacaGL7b GaayzFaaGaaiOlaaaa@470E@ We shall suppose that the typical target of inference is a population mean Y ¯ = N 1 U y i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaaceWGzbGbae bacqGH9aqpcaWGobWaaWbaaSqabeaacqGHsislcaaIXaaaaOWaaabe aeaacaWG5bWaaSbaaSqaaiaadMgaaeqaaaqaaiaadwfaaeqaniabgg HiLdaaaa@4089@ of a survey variable, taking value y i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWG5bWaaS baaSqaaiaadMgaaeqaaaaa@3929@ for unit i . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGPbGaai Olaaaa@38B1@

We suppose that the data available for estimation purposes consists first of the values { y i ; i r } MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaGadaqaai aadMhadaWgaaWcbaGaamyAaaqabaGccaGG7aGaaGjbVlaaykW7caWG PbGaeyicI4SaamOCaaGaay5Eaiaaw2haaaaa@42A4@ of the survey variable, observed only for respondents. Secondly, we suppose that information is available on the values x i = ( x 1 , i , x 2 , i , , x K , i ) T MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWH4bWaaS baaSqaaiaadMgaaeqaaOGaeyypa0ZaaeWaaeaacaWG4bWaaSbaaSqa aiaaigdacaGGSaGaaGPaVlaadMgaaeqaaOGaaiilaiaaysW7caWG4b WaaSbaaSqaaiaaikdacaGGSaGaaGPaVlaadMgaaeqaaOGaaiilaiaa ysW7cqWIMaYscaGGSaGaaGjbVlaadIhadaWgaaWcbaGaam4saiaayg W7caGGSaGaaGPaVlaadMgaaeqaaaGccaGLOaGaayzkaaWaaWbaaSqa beaacaWGubaaaaaa@5589@ of a vector of auxiliary variables X . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWHybGaai Olaaaa@38A4@ We shall usually suppose each x k , i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWG4bWaaS baaSqaaiaadUgacaaMb8UaaiilaiaaykW7caWGPbaabeaaaaa@3DDD@ is a binary indicator variable, where x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWH4bWaaS baaSqaaiaadMgaaeqaaaaa@392C@ represents one or more categorical variables, since this will be the case in the applications we consider, but our presentation allows for general x k , i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWG4bWaaS baaSqaaiaadUgacaaMb8UaaiilaiaaykW7caWGPbaabeaaaaa@3DDD@ values. We assume that values of x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWH4bWaaS baaSqaaiaadMgaaeqaaaaa@392C@ are observed for all respondents so that { y i , x i ; i r } MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaGadaqaai aadMhadaWgaaWcbaGaamyAaaqabaGccaGGSaGaaGjbVlaadIhadaWg aaWcbaGaamyAaaqabaGccaGG7aGaaGjbVlaaykW7caWGPbGaeyicI4 SaamOCaaGaay5Eaiaaw2haaaaa@4702@ is observed.

We distinguish two settings: one in which x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWH4bWaaS baaSqaaiaadMgaaeqaaaaa@392C@ is known for all sample units, i.e., for both respondents and non-respondents, and one in which x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWH4bWaaS baaSqaaiaadMgaaeqaaaaa@392C@ is known only at the aggregate level: the population total U x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaaeqaqaai aahIhadaWgaaWcbaGaamyAaaqabaaabaGaamyvaaqab0GaeyyeIuoa aaa@3BDE@ and/or the population cross-products U x i x i T . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaaeqaqaai aahIhadaWgaaWcbaGaamyAaaqabaGccaaMi8UaaCiEamaaDaaaleaa caWGPbaabaGaamivaaaaaeaacaWGvbaabeqdcqGHris5aOGaaiOlaa aa@412A@ We refer to the two types of information as sample-based auxiliary information and aggregate population-based auxiliary information. The first setting is relevant if the variables making up X MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWHybaaaa@37F2@ are available on a register. However, as outlined in the introduction, in many countries and surveys the availability of auxiliary information on non-respondents may be limited and the second setting using population-based auxiliary information may be more useful.

2.2  Definition of response propensities

The theory of propensity scores was introduced by Rosenbaum and Rubin (1983) and discussed in the context of survey nonresponse by Little (1986; 1988). Response propensities are defined as the conditional expectation of the response indicator variable r i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGYbWaaS baaSqaaiaadMgaaeqaaaaa@3922@ given the values of specified variables and survey conditions: ρ X ( x i ) = E m ( r i | x i ) , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacqaHbpGCda WgaaWcbaGaamiwaaqabaGcdaqadaqaaiaahIhadaWgaaWcbaGaamyA aaqabaaakiaawIcacaGLPaaacqGH9aqpcaWGfbWaaSbaaSqaaiaad2 gaaeqaaOWaaeWaaeaadaabcaqaaiaadkhadaWgaaWcbaGaamyAaaqa baGccaaMc8oacaGLiWoacaaMc8UaaCiEamaaBaaaleaacaWGPbaabe aaaOGaayjkaiaawMcaaiaacYcaaaa@4BAF@ where the vector of auxiliary variables is defined as in Section 2.1. For simplicity, we shall write ρ i = ρ X ( x i ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacqaHbpGCda WgaaWcbaGaamyAaaqabaGccqGH9aqpcqaHbpGCdaWgaaWcbaGaamiw aaqabaGcdaqadaqaaiaahIhadaWgaaWcbaGaamyAaaqabaaakiaawI cacaGLPaaaaaa@417C@ and hence denote the response propensity just by ρ i . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacqaHbpGCda WgaaWcbaGaamyAaaqabaGccaGGUaaaaa@3AA7@ E m ( . ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGfbWaaS baaSqaaiaad2gaaeqaaOWaaeWaaeaacaGGUaaacaGLOaGaayzkaaaa aa@3B3E@ denotes expectation with respect to the model underlying the response mechanism. A detailed discussion of response propensities and their properties is presented in Shlomo et al. (2012). They argue that it is desirable to select auxiliary variables constituting x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWH4bWaaS baaSqaaiaadMgaaeqaaaaa@392C@ in such a way that the missing at random assumption, denoted MAR (Little and Rubin, 2002), holds as closely as possible.

2.3  Estimation of response propensities using population-level information

In the case of sample-based auxiliary information, it is possible to estimate response propensities for all sampled units by means of regression models g ( ρ i ) = x i T β , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGNbWaae WaaeaacqaHbpGCdaWgaaWcbaGaamyAaaqabaaakiaawIcacaGLPaaa cqGH9aqpcaWH4bWaa0baaSqaaiaadMgaaeaacaWGubaaaOGaaGjcVl aahk7acaGGSaaaaa@43EE@ where g ( . ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGNbWaae WaaeaacaGGUaaacaGLOaGaayzkaaaaaa@3A38@ is a link function, r i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGYbWaaS baaSqaaiaadMgaaeqaaaaa@3922@ is the dependent variable, and x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWH4bWaaS baaSqaaiaadMgaaeqaaaaa@392C@ is a vector of explanatory variables. Generally, the response propensities are modelled by generalized linear models. Shlomo et al. (2012) use a logistic link function.

In the population-based setting, it is convenient to consider the identity link function. The identity link function is a good approximation to the more widely used logistic link function when response rates are mid-range, between 30% and 70%, which is the typical response rate obtained in national and other surveys. We demonstrate this fact in the evaluation study presented in Section 4 where three ranges of response rates are investigated: low, medium and high. The identity link function also forms the basis for other representativeness indicators in the literature, such as the imbalance and distance indicators proposed by Särndal (2011) some of which are similar to the g-weights calculated in the Generalized Regression Estimators (GREG).

Under the identity link function we assume that the true response propensities satisfy the “linear probability model”

ρ i = x i T β , i U . ( 2.1 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacqaHbpGCda WgaaWcbaGaamyAaaqabaGccqGH9aqpcaWH4bWaa0baaSqaaiaadMga aeaacaWGubaaaOGaaGjcVlaahk7acaGGSaGaaGjbVlaaysW7caWGPb GaeyicI4Saamyvaiaac6cacaaMf8UaaGzbVlaaywW7caaMf8UaaGzb VlaacIcacaaIYaGaaiOlaiaaigdacaGGPaaaaa@53D8@

The linear probability model in (2.1) can be estimated by weighted least squares, where d i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGKbWaaS baaSqaaiaadMgaaeqaaaaa@3914@ is the design weight. The implied estimator of ρ i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacqaHbpGCda WgaaWcbaGaamyAaaqabaaaaa@39EB@ is given by

ρ ^ i OLS = x i T ( s d i x i x i T ) 1 s d i x i r i , i s . ( 2.2 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacuaHbpGCga qcamaaDaaaleaacaWGPbaabaGaae4taiaabYeacaqGtbaaaOGaeyyp a0JaaCiEamaaDaaaleaacaWGPbaabaGaamivaaaakmaabmaabaWaaa beaeaacaWGKbWaaSbaaSqaaiaadMgaaeqaaOGaaGjcVlaahIhadaWg aaWcbaGaamyAaaqabaGccaaMi8UaaCiEamaaDaaaleaacaWGPbaaba GaamivaaaaaeaacaWGZbaabeqdcqGHris5aaGccaGLOaGaayzkaaWa aWbaaSqabeaacqGHsislcaaIXaaaaOWaaabeaeaacaWGKbWaaSbaaS qaaiaadMgaaeqaaOGaaGjcVlaahIhadaWgaaWcbaGaamyAaaqabaGc caaMi8UaamOCamaaBaaaleaacaWGPbaabeaaaeaacaWGZbaabeqdcq GHris5aOGaaiilaiaaysW7caaMe8UaamyAaiabgIGiolaadohacaGG UaGaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7caGGOaGaaGOmaiaac6 cacaaIYaGaaiykaaaa@707A@

In the case of population-based auxiliary information, we first note that s d i x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaaeqaqaai aadsgadaWgaaWcbaGaamyAaaqabaGccaaMi8UaaCiEamaaBaaaleaa caWGPbaabeaaaeaacaWGZbaabeqdcqGHris5aaaa@3F9A@ and s d i x i x i T MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaaeqaqaai aadsgadaWgaaWcbaGaamyAaaqabaGccaaMi8UaaCiEamaaBaaaleaa caWGPbaabeaakiaayIW7caWH4bWaa0baaSqaaiaadMgaaeaacaWGub aaaaqaaiaadohaaeqaniabggHiLdaaaa@442A@ are unbiased for U x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaaeqaqaai aahIhadaWgaaWcbaGaamyAaaqabaaabaGaamyvaaqab0GaeyyeIuoa aaa@3BDE@ and U x i x i T , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaaeqaqaai aahIhadaWgaaWcbaGaamyAaaqabaGccaaMi8UaaCiEamaaDaaaleaa caWGPbaabaGaamivaaaaaeaacaWGvbaabeqdcqGHris5aOGaaiilaa aa@4128@ respectively and that in large samples we may expect that s d i x i U x i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaaeqaqaai aadsgadaWgaaWcbaGaamyAaaqabaGccaaMi8UaaCiEamaaBaaaleaa caWGPbaabeaakiabgIKi7oaaqababaGaaCiEamaaBaaaleaacaWGPb aabeaaaeaacaWGvbaabeqdcqGHris5aaWcbaGaam4Caaqab0Gaeyye Iuoaaaa@462D@ and s d i x i x i T U x i x i T . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaaeqaqaai aadsgadaWgaaWcbaGaamyAaaqabaGccaaMi8UaaCiEamaaBaaaleaa caWGPbaabeaakiaayIW7caWH4bWaa0baaSqaaiaadMgaaeaacaWGub aaaOGaeyisIS7aaabeaeaacaWH4bWaaSbaaSqaaiaadMgaaeqaaOGa aGjcVlaahIhadaqhaaWcbaGaamyAaaqaaiaadsfaaaaabaGaamyvaa qab0GaeyyeIuoaaSqaaiaadohaaeqaniabggHiLdGccaGGUaaaaa@5009@ It follows from (2.2) that, in the population-based setting, we may approximate ρ ^ i OLS MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacuaHbpGCga qcamaaDaaaleaacaWGPbaabaGaae4taiaabYeacaqGtbaaaaaa@3C73@ by

ρ ˜ i , T 1 = x i T T 1 1 r d k x k , i r ( 2.3 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacuaHbpGCga acamaaBaaaleaacaWGPbGaaiilaiaaykW7caWGubGaaGymaaqabaGc cqGH9aqpcaWH4bWaa0baaSqaaiaadMgaaeaacaWGubaaaOGaaGjcVl aahsfadaqhaaWcbaGaaGymaaqaaiabgkHiTiaaigdaaaGcdaaeqaqa aiaadsgadaWgaaWcbaGaam4AaaqabaGccaaMi8UaaCiEamaaBaaale aacaWGRbaabeaaaeaacaWGYbaabeqdcqGHris5aOGaaiilaiaaysW7 caaMe8UaamyAaiabgIGiolaadkhacaaMf8UaaGzbVlaaywW7caaMf8 UaaGzbVlaacIcacaaIYaGaaiOlaiaaiodacaGGPaaaaa@61F2@

where T 1 = U x j x j T . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWHubWaaS baaSqaaiaaigdaaeqaaOGaeyypa0ZaaabeaeaacaWH4bWaaSbaaSqa aiaadQgaaeqaaOGaaGjcVlaahIhadaqhaaWcbaGaamOAaaqaaiaads faaaaabaGaamyvaaqab0GaeyyeIuoakiaac6caaaa@4400@ We note that ρ ˜ i , T 1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacuaHbpGCga acamaaBaaaleaacaWGPbGaaiilaiaaykW7caWGubGaaGymaaqabaaa aa@3DC9@ is computed only on the set of responding units.

The estimator in (2.3) requires knowledge of the population sums of squares and cross-products U x i x i T MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaaeqaqaai aahIhadaWgaaWcbaGaamyAaaqabaGccaaMi8UaaCiEamaaDaaaleaa caWGPbaabaGaamivaaaaaeaacaWGvbaabeqdcqGHris5aaaa@406E@ of the elements of x i . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWH4bWaaS baaSqaaiaadMgaaeqaaOGaaiOlaaaa@39E8@ However the cross-products might be unknown. In that case, we can estimate s d i x i x i T MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaaeqaqaai aadsgadaWgaaWcbaGaamyAaaqabaGccaaMi8UaaCiEamaaBaaaleaa caWGPbaabeaakiaayIW7caWH4bWaa0baaSqaaiaadMgaaeaacaWGub aaaaqaaiaadohaaeqaniabggHiLdaaaa@442A@ in (2.2) by rewriting

s d i x i x i T = s d i ( x i x ¯ s ) ( x i x ¯ s ) T + N x ¯ s x ¯ s T , ( 2.4 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaaeqaqaai aadsgadaWgaaWcbaGaamyAaaqabaGccaaMc8UaaCiEamaaBaaaleaa caWGPbaabeaakiaaykW7caWH4bWaa0baaSqaaiaadMgaaeaacaWGub aaaaqaaiaadohaaeqaniabggHiLdGccqGH9aqpdaaeqaqaaiaadsga daWgaaWcbaGaamyAaaqabaGcdaqadaqaaiaahIhadaWgaaWcbaGaam yAaaqabaGccqGHsislceWH4bGbaebadaWgaaWcbaGaam4Caaqabaaa kiaawIcacaGLPaaaaSqaaiaadohaaeqaniabggHiLdGcdaqadaqaai aahIhadaWgaaWcbaGaamyAaaqabaGccqGHsislceWH4bGbaebadaWg aaWcbaGaam4CaaqabaaakiaawIcacaGLPaaadaahaaWcbeqaaiaads faaaGccqGHRaWkcaWGobGaaGjcVlqahIhagaqeamaaBaaaleaacaWG ZbaabeaakiaayIW7ceWH4bGbaebadaqhaaWcbaGaam4Caaqaaiaads faaaGccaGGSaGaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7caGGOaGa aGOmaiaac6cacaaI0aGaaiykaaaa@6F2D@

where x ¯ s = s d i x i / N . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaaceWH4bGbae badaWgaaWcbaGaam4CaaqabaGccqGH9aqpdaaeqaqaamaalyaabaGa amizamaaBaaaleaacaWGPbaabeaakiaahIhadaWgaaWcbaGaamyAaa qabaaakeaacaWGobaaaaWcbaGaam4Caaqab0GaeyyeIuoakiaac6ca aaa@4310@ x ¯ s MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaaceWH4bGbae badaWgaaWcbaGaam4Caaqabaaaaa@394E@ may be replaced by x ¯ U MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaaceWH4bGbae badaWgaaWcbaGaamyvaaqabaaaaa@3930@ and the covariance matrix

S x x = N 1 s d i ( x i x ¯ s ) ( x i x ¯ s ) T ( 2.5 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWHtbWaaS baaSqaaiaadIhacaWG4baabeaakiabg2da9iaad6eadaahaaWcbeqa aiabgkHiTiaaigdaaaGcdaaeqaqaaiaadsgadaWgaaWcbaGaamyAaa qabaGcdaqadaqaaiaahIhadaWgaaWcbaGaamyAaaqabaGccqGHsisl ceWH4bGbaebadaWgaaWcbaGaam4CaaqabaaakiaawIcacaGLPaaaaS qaaiaadohaaeqaniabggHiLdGcdaqadaqaaiaahIhadaWgaaWcbaGa amyAaaqabaGccqGHsislceWH4bGbaebadaWgaaWcbaGaam4Caaqaba aakiaawIcacaGLPaaadaahaaWcbeqaaiaadsfaaaGccaaMf8UaaGzb VlaaywW7caaMf8UaaGzbVlaacIcacaaIYaGaaiOlaiaaiwdacaGGPa aaaa@5CE6@

may be replaced by its estimate using the response set

S ^ x x = ( s d j r j ) 1 s d i r i ( x i x ¯ U ) ( x i x ¯ U ) T . ( 2.6 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaaceWHtbGbaK aadaWgaaWcbaGaamiEaiaadIhaaeqaaOGaeyypa0ZaaeWaaeaadaae qaqaaiaadsgadaWgaaWcbaGaamOAaaqabaGccaaMi8UaamOCamaaBa aaleaacaWGQbaabeaaaeaacaWGZbaabeqdcqGHris5aaGccaGLOaGa ayzkaaWaaWbaaSqabeaacqGHsislcaaIXaaaaOWaaabeaeaacaWGKb WaaSbaaSqaaiaadMgaaeqaaOGaaGjcVlaadkhadaWgaaWcbaGaamyA aaqabaGcdaqadaqaaiaahIhadaWgaaWcbaGaamyAaaqabaGccqGHsi slceWH4bGbaebadaWgaaWcbaGaamyvaaqabaaakiaawIcacaGLPaaa aSqaaiaadohaaeqaniabggHiLdGcdaqadaqaaiaahIhadaWgaaWcba GaamyAaaqabaGccqGHsislceWH4bGbaebadaWgaaWcbaGaamyvaaqa baaakiaawIcacaGLPaaadaahaaWcbeqaaiaadsfaaaGccaGGUaGaaG zbVlaaywW7caaMf8UaaGzbVlaaywW7caGGOaGaaGOmaiaac6cacaaI 2aGaaiykaaaa@6A5A@

We can also estimate (2.6) using propensity weighting by ρ ˜ i 1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacuaHbpGCga acamaaDaaaleaacaWGPbaabaGaeyOeI0IaaGymaaaaaaa@3BA3@ to adjust for nonresponse bias in the variance of the response propensities relative to a set of X MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacaWGybaaaa@37EE@ variables.

Combining (2.3), (2.4) and (2.6), we obtain the following estimator:

ρ ˜ i , T 2 = x i T T ^ 2 1 r d k x k , i r , ( 2.7 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaacuaHbpGCga acamaaBaaaleaacaWGPbGaaiilaiaaykW7caWGubGaaGOmaaqabaGc cqGH9aqpcaWH4bWaa0baaSqaaiaadMgaaeaacaWGubaaaOGaaGjcVl qahsfagaqcamaaDaaaleaacaaIYaaabaGaeyOeI0IaaGymaaaakmaa qababaGaamizamaaBaaaleaacaWGRbaabeaakiaayIW7caWH4bWaaS baaSqaaiaadUgaaeqaaaqaaiaadkhaaeqaniabggHiLdGccaGGSaGa aGjbVlaaysW7caWGPbGaeyicI4SaamOCaiaacYcacaaMf8UaaGzbVl aaywW7caaMf8UaaGzbVlaacIcacaaIYaGaaiOlaiaaiEdacaGGPaaa aa@62B8@

where T ^ 2 = N S ^ x x + N x ¯ U x ¯ U T . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaaceWHubGbaK aadaWgaaWcbaGaaGOmaaqabaGccqGH9aqpcaWGobGaaGjcVlqahofa gaqcamaaBaaaleaacaWG4bGaamiEaaqabaGccqGHRaWkcaWGobGaaG jcVlqahIhagaqeamaaBaaaleaacaWGvbaabeaakiaayIW7ceWH4bGb aebadaqhaaWcbaGaamyvaaqaaiaadsfaaaGccaGGUaaaaa@4A2B@

We therefore distinguish between two types of aggregated population-based auxiliary information as denoted by the indices T 1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpfpC0xc9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vq=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWexLMBb50ujb qegGuANHgiu92DLjhiuvgE0bacfaGae8hhGmPaamivamaaBaaaleaa caaIXaaabeaakiab=1biucaa@426F@ in (2.3) and T 2 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpfpC0xc9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vq=dbbf9v8Gq0db9qqpm0dXdHqpq0=vr 0=vr0=edbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWexLMBb50ujb qegGuANHgiu92DLjhiuvgE0bacfaGae8hhGmPaamivamaaBaaaleaa caaIYaaabeaakiab=1biucaa@4270@ in (2.7):

TYPE 1
Full aggregate population-based auxiliary information: the population cross products are available, i.e., U x i x i T MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaaeqaqaai aahIhadaWgaaWcbaGaamyAaaqabaGccaaMi8UaaCiEamaaDaaaleaa caWGPbaabaGaamivaaaaaeaacaWGvbaabeqdcqGHris5aaaa@406E@ or U ( x i x ¯ U ) ( x i x ¯ U ) T , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaaeqaqaam aabmaabaGaaCiEamaaBaaaleaacaWGPbaabeaakiabgkHiTiqahIha gaqeamaaBaaaleaacaWGvbaabeaaaOGaayjkaiaawMcaamaabmaaba GaaCiEamaaBaaaleaacaWGPbaabeaakiabgkHiTiqahIhagaqeamaa BaaaleaacaWGvbaabeaaaOGaayjkaiaawMcaamaaCaaaleqabaGaam ivaaaakiaaygW7aSqaaiaadwfaaeqaniabggHiLdGccaGGSaaaaa@4AAA@ where x ¯ U = U x i / N ; MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaaceWH4bGbae badaWgaaWcbaGaamyvaaqabaGccqGH9aqpdaaeqaqaamaalyaabaGa aCiEamaaBaaaleaacaWGPbaabeaaaOqaaiaad6eaaaaaleaacaWGvb aabeqdcqGHris5aOGaai4oaaaa@40D4@
TYPE 2
Marginal aggregate population-based auxiliary information: only the population marginal counts are available, i.e., U x i . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbba9vcVhbbf9y8WrFj0xc9vqFj 0db9qqvqFr0dXdHiVc=bYP0xb9peeu0xXdcrpe0db9Wqpepec9ar=x fr=xfr=tmeaabaqaciGacaGaaeqabaWaaeaaeaaakeaadaaeqaqaai aahIhadaWgaaWcbaGaamyAaaqabaaabaGaamyvaaqab0GaeyyeIuoa kiaac6caaaa@3C9A@

The first type implies that we have available all two-by-two tables, e.g., age times gender, age times marital status and gender times marital status. This information might be available to a national statistical institute which has access to population registers or detailed population demographics and wishes to use population-based information to monitor data collection due to a lack of sample-based information on the sample frames. The second type is more restrictive as we have only frequency counts, e.g., age, gender, marital status, without any knowledge about the interactions. This information would be routinely available through websites of national statistical institutes and therefore can be used by marketing and other data collection agencies to monitor their data collection.


Date modified: