Multiple-frame surveys for a multiple-data-source world
Section 3. Estimation in classical multiple-frame surveys

The main problem for inference in a classical multiple-frame survey  MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaacbaqcLbwaqa aaaaaaaaWdbiaa=nbiaaa@37A3@  one that is designed so as to satisfy Assumptions (A1) to (A6) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaacbaqcLbwaqa aaaaaaaaWdbiaa=nbiaaa@37A3@  is how to account for potential overlap among the samples. In the NSAF, telephone households were screened out of the area sample, but in many applications screening is infeasible or it is more cost-effective to obtain data from the full sample selected from each frame. When separate surveys or data sources are not designed with data combination in mind, the overlap depends on the coverage of the individual data sources.

With an overlap design, units that are contained in more than one frame have multiple chances for being selected in the sample. An estimator constructed by summing the weighted observations from each of the Q MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyuaaaa@38EC@ samples,

Y ^ concat = q = 1 Q i S q w i ( q ) y i , MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja WaaSbaaSqaaiaabogacaqGVbGaaeOBaiaabogacaqGHbGaaeiDaaqa baGccaaMe8UaaGypaiaaysW7daaeWbqaamaaqafabaGaaGPaVlaadE hadaqhaaWcbaGaamyAaaqaaiaaiIcacaWGXbGaaGykaaaakiaadMha daWgaaWcbaGaamyAaaqabaaabaGaamyAaiaaykW7cqGHiiIZcaaMc8 Uaam4uamaaBaaameaacaWGXbaabeaaaSqab0GaeyyeIuoaaSqaaiaa dghacaaI9aGaaGymaaqaaiaadgfaa0GaeyyeIuoakiaaiYcaaaa@5AB9@

will be a biased estimator of Y = i = 1 N y i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamywaiaays W7caaI9aGaaGjbVpaaqadabaGaaGPaVlaadMhadaWgaaWcbaGaamyA aaqabaaabaGaamyAaiaai2dacaaIXaaabaGaamOtaaqdcqGHris5aa aa@45B2@ because the individual sample weights do not reflect the multiple chances of selection for units in overlap domains. Methods for estimating population totals thus typically multiply the survey weights w i ( q ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4DamaaDa aaleaacaWGPbaabaGaaGikaiaadghacaaIPaaaaaaa@3C88@ by a multiplicity adjustment m i ( q ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyBamaaDa aaleaacaWGPbaabaGaaGikaiaadghacaaIPaaaaaaa@3C7E@ that satisfies q = 1 Q δ i ( q ) m i ( q ) 1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaabmaeaaca aMc8UaeqiTdq2aa0baaSqaaiaadMgaaeaacaaIOaGaamyCaiaaiMca aaGccaWGTbWaa0baaSqaaiaadMgaaeaacaaIOaGaamyCaiaaiMcaaa GccaaMe8UaeyisISRaaGjbVlaaigdaaSqaaiaadghacaaI9aGaaGym aaqaaiaadgfaa0GaeyyeIuoaaaa@4E0E@ for each unit i , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyAaiaacY caaaa@39B4@ resulting in the estimator

Y ^ = q = 1 Q i S q w i ( q ) m i ( q ) y i = q = 1 Q i S q w ˜ i ( q ) y i , ( 3.1 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0dc9fr=xfr=x frpeWZqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaiqadMfagaqcai aaysW7caaI9aGaaGjbVpaaqahabaWaaabuaeaacaaMc8Uaam4Damaa DaaaleaacaWGPbaabaGaaGikaiaadghacaaIPaaaaOGaamyBamaaDa aaleaacaWGPbaabaGaaGikaiaadghacaaIPaaaaOGaamyEamaaBaaa leaacaWGPbaabeaaaeaacaWGPbGaaGPaVlabgIGiolaadofadaWgaa adbaGaamyCaaqabaaaleqaniabggHiLdaaleaacaWGXbGaaGypaiaa igdaaeaacaWGrbaaniabggHiLdGccqGH9aqpcaaMe8+aaabCaeaada aeqbqaaiaaykW7ceWG3bGbaGaadaqhaaWcbaGaamyAaaqaaiaaiIca caWGXbGaaGykaaaakiaadMhadaWgaaWcbaGaamyAaaqabaaabaGaam yAaiaaykW7cqGHiiIZcaWGtbWaaSbaaWqaaiaadghaaeqaaaWcbeqd cqGHris5aaWcbaGaamyCaiaai2dacaaIXaaabaGaamyuaaqdcqGHri s5aOGaaGilaiaaywW7caaMf8UaaGzbVlaaywW7caaMf8Uaaiikaiaa iodacaGGUaGaaGymaiaacMcaaaa@7B08@

where w ˜ i ( q ) = w i ( q ) m i ( q ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabm4Dayaaia Waa0baaSqaaiaadMgaaeaacaaIOaGaamyCaiaaiMcaaaGccaaMe8Ua aGypaiaaysW7caWG3bWaa0baaSqaaiaadMgaaeaacaaIOaGaamyCai aaiMcaaaGccaWGTbWaa0baaSqaaiaadMgaaeaacaaIOaGaamyCaiaa iMcaaaaaaa@4966@ is the multiplicity-adjusted weight.

3.1   Hartley’s composite estimator

Hartley (1962) was the first author to present a rigorous theory of estimation in dual-frame surveys where units in the overlap domain {1, 2} might be sampled from both frames. This four-page paper made several important contributions. First, Hartley defined the problem in statistical terms. Second, he proposed an optimal estimator for combining the estimates from the two surveys. And third, he studied the design problem of allocating the resources to the different samples, with a joint consideration of the allocation and the estimator that minimize the variance of the estimated population total subject to a fixed cost.

Hartley (1962) estimated the population total Y = i = 1 N y i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamywaiaays W7caaI9aGaaGjbVpaaqadabaGaaGPaVlaadMhadaWgaaWcbaGaamyA aaqabaaabaGaamyAaiaai2dacaaIXaaabaGaamOtaaqdcqGHris5aa aa@45B2@ by

Y ^ ( θ ) = Y ^ { 1 } ( 1 ) + Y ^ { 2 } ( 2 ) + θ Y ^ { 1, 2 } ( 1 ) + ( 1 θ ) Y ^ { 1,2 } ( 2 ) . ( 3.2 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja GaaGikaiabeI7aXjaaiMcacaaMe8UaaGypaiaaysW7ceWGzbGbaKaa daqhaaWcbaGaaG4EaiaaigdacaaI9baabaGaaGikaiaaigdacaaIPa aaaOGaaGjbVlabgUcaRiaaysW7ceWGzbGbaKaadaqhaaWcbaGaaG4E aiaaikdacaaI9baabaGaaGikaiaaikdacaaIPaaaaOGaaGjbVlabgU caRiaaysW7cqaH4oqCceWGzbGbaKaadaqhaaWcbaGaaG4Eaiaaigda caaISaGaaGjbVlaaikdacaaI9baabaGaaGikaiaaigdacaaIPaaaaO GaaGjbVlabgUcaRiaaysW7caaIOaGaaGymaiaaysW7cqGHsislcaaM e8UaeqiUdeNaaGykaiaaysW7ceWGzbGbaKaadaqhaaWcbaGaaG4Eai aaigdacaaISaGaaGOmaiaai2haaeaacaaIOaGaaGOmaiaaiMcaaaGc caaIUaGaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7caGGOaGaaG4mai aac6cacaaIYaGaaiykaaaa@7FB9@

He proposed choosing θ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUdehaaa@39CC@ to minimize V [ Y ^ ( θ ) ] . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOvaiaayk W7daWadeqaaiqadMfagaqcaiaaiIcacqaH4oqCcaaIPaaacaGLBbGa ayzxaaGaaiOlaaaa@412A@ This resulted in the value

θ H = V ( Y ^ { 1, 2 } ( 2 ) ) + Cov ( Y ^ { 2 } ( 2 ) , Y ^ { 1, 2 } ( 2 ) ) Cov ( Y ^ { 1 } ( 1 ) , Y ^ { 1, 2 } ( 1 ) ) V ( Y ^ { 1, 2 } ( 1 ) ) + V ( Y ^ { 1, 2 } ( 2 ) ) . ( 3.3 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUde3aaS baaSqaaiaadIeaaeqaaOGaaGjbVlaai2dacaaMe8+aaSaaaeaacaWG wbGaaGPaVpaabmaabaGabmywayaajaWaa0baaSqaaiaaiUhacaaIXa GaaGilaiaaysW7caaIYaGaaGyFaaqaaiaaiIcacaaIYaGaaGykaaaa aOGaayjkaiaawMcaaiaaysW7cqGHRaWkcaaMe8Uaae4qaiaab+gaca qG2bGaaGPaVpaabmaabaGabmywayaajaWaa0baaSqaaiaaiUhacaaI YaGaaGyFaaqaaiaaiIcacaaIYaGaaGykaaaakiaaiYcaceWGzbGbaK aadaqhaaWcbaGaaG4EaiaaigdacaaISaGaaGjbVlaaikdacaaI9baa baGaaGikaiaaikdacaaIPaaaaaGccaGLOaGaayzkaaGaaGjbVlabgk HiTiaaysW7caqGdbGaae4BaiaabAhacaaMc8+aaeWaaeaaceWGzbGb aKaadaqhaaWcbaGaaG4EaiaaigdacaaI9baabaGaaGikaiaaigdaca aIPaaaaOGaaGilaiaaysW7ceWGzbGbaKaadaqhaaWcbaGaaG4Eaiaa igdacaaISaGaaGjbVlaaikdacaaI9baabaGaaGikaiaaigdacaaIPa aaaaGccaGLOaGaayzkaaaabaGaamOvamaabmaabaGabmywayaajaWa a0baaSqaaiaaiUhacaaIXaGaaGilaiaaysW7caaIYaGaaGyFaaqaai aaiIcacaaIXaGaaGykaaaaaOGaayjkaiaawMcaaiaaysW7cqGHRaWk caaMe8UaamOvamaabmaabaGabmywayaajaWaa0baaSqaaiaaiUhaca aIXaGaaGilaiaaysW7caaIYaGaaGyFaaqaaiaaiIcacaaIYaGaaGyk aaaaaOGaayjkaiaawMcaaaaacaaIUaGaaGzbVlaaywW7caaMf8UaaG zbVlaaywW7caGGOaGaaG4maiaac6cacaaIZaGaaiykaaaa@A74F@

The estimator in (3.2) is of the form in (3.1) with multiplicity weight adjustments

m i ( 1 ) = δ i ( { 1 } ) + δ i ( { 1,2 } ) θ , m i ( 2 ) = δ i ( { 2 } ) + δ i ( { 1, 2 } ) ( 1 θ ) . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyBamaaDa aaleaacaWGPbaabaGaaGikaiaaigdacaaIPaaaaOGaaGjbVlaai2da caaMe8UaeqiTdq2aaSbaaSqaaiaadMgaaeqaaOGaaGPaVpaabmqaba GaaG4EaiaaigdacaaI9baacaGLOaGaayzkaaGaaGjbVlabgUcaRiaa ysW7cqaH0oazdaWgaaWcbaGaamyAaaqabaGccaaMc8+aaeWabeaaca aI7bGaaGymaiaaiYcacaaIYaGaaGyFaaGaayjkaiaawMcaaiaaysW7 cqaH4oqCcaaISaGaaGjbVlaaysW7caaMc8UaaGjcVlaad2gadaqhaa WcbaGaamyAaaqaaiaaiIcacaaIYaGaaGykaaaakiaaysW7caaI9aGa aGjbVlabes7aKnaaBaaaleaacaWGPbaabeaakiaaykW7daqadeqaai aaiUhacaaIYaGaaGyFaaGaayjkaiaawMcaaiaaysW7cqGHRaWkcaaM e8UaeqiTdq2aaSbaaSqaaiaadMgaaeqaaOGaaGPaVpaabmqabaGaaG 4EaiaaigdacaaISaGaaGjbVlaaikdacaaI9baacaGLOaGaayzkaaGa aGjbVpaabmqabaGaaGymaiaaysW7cqGHsislcaaMe8UaeqiUdehaca GLOaGaayzkaaGaaGOlaaaa@8BBA@

If it is desired to use the optimal compositing factor θ H , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUde3aaS baaSqaaiaadIeaaeqaaOGaaiilaaaa@3B7F@ estimators may be substituted for the unknown covariances in (3.3). Because θ H MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUde3aaS baaSqaaiaadIeaaeqaaaaa@3AC5@ depends on covariances involving y , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyEaiaacY caaaa@39C4@ however, the optimal multiplicity adjustment may differ for different variables, giving a different set of weights for each. In addition, θ H MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUde3aaS baaSqaaiaadIeaaeqaaaaa@3AC5@ can be less than 0 or greater than 1, possibly resulting in negative weights for some observations. These features carry over to the Q MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyuaaaa@38EC@ -frame generalization of Hartley’s optimal estimator studied by Lohr and Rao (2006).

The estimator in (3.2), with fixed value of θ , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUdeNaai ilaaaa@3A7C@ is approximately unbiased for Y MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamywaaaa@38F4@ under Assumption (A5). If the estimated domain totals and the estimates of the covariances in (3.3) are consistent, then the estimator with θ ^ H MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGafqiUdeNbaK aadaWgaaWcbaGaamisaaqabaaaaa@3AD5@ is consistent for Y . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamywaiaac6 caaaa@39A6@ Saegusa (2019) studied Hartley’s estimator from the perspective of empirical process theory, establishing a law of large numbers and a central limit theorem when S 1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIXaaabeaaaaa@39D5@ and S 2 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIYaaabeaaaaa@39D6@ are both simple random samples.

Hartley’s application was in agriculture, and many of the early applications of dual-frame surveys were for agriculture or business surveys (Kott and Vogel, 1995), where list frames existed that contained the larger business or agricultural operations. A dual-frame survey with a disproportionately larger sample from the list frame reduced costs because (1) obtaining data from an operation in the list frame was often less expensive than obtaining data from an operation in the area frame and (2) oversampling the list frame was analogous to oversampling high-variance strata in stratified sampling and thus resulted in greater efficiency.

Later, as cellular telephones became more prevalent, concern about bias from using landline telephone samples alone led to use of dual-frame telephone surveys, with one sample from a landline frame and a second sample from a cellular telephone frame. Here, both frames are incomplete but together cover the population of persons with telephones. For these surveys, an important consideration is how to deal with persons having both kinds of telephones. The next section reviews choices for the compositing.

3.2   Multiplicity weighting adjustments

Hartley’s optimal estimator, with θ H , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUde3aaS baaSqaaiaadIeaaeqaaOGaaiilaaaa@3B7F@ uses a different set of weights for each response variable, which can lead to internal inconsistencies among estimators. Various authors have proposed estimators that use a single set of weights for all analyses. Here, I briefly list some of the multiplicity adjustment factors m i ( q ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyBamaaDa aaleaacaWGPbaabaGaaGikaiaadghacaaIPaaaaaaa@3C7E@ that result in one set of weights for the general estimator of the population total in (3.1). The resulting estimators are approximately unbiased for the population total Y MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamywaaaa@38F4@ under Assumptions (A1), (A4), and (A5). These and additional estimators are reviewed in detail by Lohr (2011), Lu, Peng and Sahr (2013), Ferraz and Vogel (2015), Arcos, Rueda, Trujillo and Molina (2015), and Baffour, Haynes, Western, Pennay, Misson and Martinez (2016).

m i ( q ) = n ˜ ( q ) f = 1 Q δ i ( f ) n ˜ ( f ) . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyBamaaDa aaleaacaWGPbaabaGaaGikaiaadghacaaIPaaaaOGaaGjbVlaai2da caaMe8+aaSaaaeaaceWGUbGbaGaadaahaaWcbeqaaiaaiIcacaWGXb GaaGykaaaaaOqaamaaqadabaGaaGPaVlabes7aKnaaDaaaleaacaWG PbaabaGaaGikaiaadAgacaaIPaaaaOGabmOBayaaiaWaaWbaaSqabe aacaaIOaGaamOzaiaaiMcaaaaabaGaamOzaiaai2dacaaIXaaabaGa amyuaaqdcqGHris5aaaakiaai6caaaa@542C@

Approximately unbiased estimates of the variances for all estimators considered in this section can be derived under Assumptions (A1) to (A6) and additional regularity conditions that ensure consistency of estimated totals and variance estimators from the Q MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyuaaaa@38EC@ samples. Skinner and Rao (1996) studied linearization variance estimators; Chauvet (2016) derived linearization variance estimators for the French housing survey that accounted for the variance reduction due to high sampling fractions from some of the frames. Lohr and Rao (2000) developed theory for using the jackknife with multiple frames, and Lohr (2007) and Aidara (2019) considered bootstrap variance estimators. These methods rely on Assumption (A3) of independent samples; Chauvet and de Marsac (2014) considered the situation in which the samples share primary sampling units but independent samples are taken at the second stage of the design.

Calculating linearization variance estimates requires special software that implements the partial derivative calculations for the multiple frames. Replication variance estimation methods such as jackknife and bootstrap, however, can be calculated in standard survey software by creating a single data set that contains all the concatenated observations and weights w ˜ i ( q ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabm4Dayaaia Waa0baaSqaaiaadMgaaeaacaaIOaGaamyCaiaaiMcaaaaaaa@3C97@ from the Q MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyuaaaa@38EC@ samples and creating replicate weights using standard methods for stratified multistage samples (Metcalf and Scott, 2009). The concatenated data set has q = 1 Q H q MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaabmaeaaca aMc8UaamisamaaBaaaleaacaWGXbaabeaaaeaacaWGXbGaaGypaiaa igdaaeaacaWGrbaaniabggHiLdaaaa@40D5@ strata, where H q MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamisamaaBa aaleaacaWGXbaabeaaaaa@3A05@ is the number of strata for S q ; MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaWGXbaabeaakiaacUdaaaa@3AD9@ observations from different samples are in different strata. The replicate weight methods also can include effects of calibration (see Section 3.3) on the variance.

Of course, many applications call for estimates of quantities other than population totals, and the multiple-frame theory applies to parameters that are smooth functions of domain totals. A different compositing factor may be desired when quantities other than population totals are of primary interest, however, and there may be special considerations for other types of analyses. Other types of statistical analyses that have been studied in the multiple-frame setting include linear (Lu, 2014b) and nonparametric (Lu, Fu and Zhang, 2021) regression, logistic regression with ordinal data (Rueda, Arcos, Molina and Ranalli, 2018), empirical distribution functions (Arcos, Martínez, Rueda and Martínez, 2017), gross flow estimation with missing data (Lu and Lohr, 2010), and chi-squared tests (Lu, 2014a).

Lu (2014b) noted that linear regression parameters estimated using the multiplicity-adjusted weights are the finite population regression coefficients B MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaCOqaaaa@38E1@ that minimize the sum of squares i = 1 N ( y i x i T B ) 2 . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaabmaeaaca aMc8UaaGikaiaadMhadaWgaaWcbaGaamyAaaqabaGccaaMe8UaeyOe I0IaaGjbVlaahIhadaqhaaWcbaGaamyAaaqaaiaadsfaaaGccaWHcb GaaGykamaaCaaaleqabaGaaGOmaaaaaeaacaWGPbGaaGypaiaaigda aeaacaWGobaaniabggHiLdGccaGGUaaaaa@4BD8@ However, one of the reasons for taking a multiple-frame survey, rather than using an incomplete frame, is a concern that population characteristics may differ across domains. Lu (2014b) suggested examining the residuals separately by domain and also fitting separate regression models by domain to assess the appropriateness of the regression model.

3.3   Calibration

The PML estimator is calibrated to population counts that are known for the frames and domains. In a dual-frame survey where N ( 1 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOtamaaCa aaleqabaGaaGikaiaaigdacaaIPaaaaaaa@3B36@ and N ( 2 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOtamaaCa aaleqabaGaaGikaiaaikdacaaIPaaaaaaa@3B37@ are known, q = 1 2 i S q w i ( q ) m i , PML ( q ) δ i ( f ) = N ( f ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaabmaeaada aeqaqaaiaaykW7caWG3bWaa0baaSqaaiaadMgaaeaacaaIOaGaamyC aiaaiMcaaaGccaWGTbWaa0baaSqaaiaadMgacaaISaGaaGjbVlaabc facaqGnbGaaeitaaqaaiaaiIcacaWGXbGaaGykaaaakiabes7aKnaa DaaaleaacaWGPbaabaGaaGikaiaadAgacaaIPaaaaOGaaGjbVlaai2 dacaaMe8UaamOtamaaCaaaleqabaGaaGikaiaadAgacaaIPaaaaaqa aiaadMgacaaMe8UaeyicI4SaaGjbVlaadofadaWgaaadbaGaamyCaa qabaaaleqaniabggHiLdaaleaacaWGXbGaaGypaiaaigdaaeaacaaI YaaaniabggHiLdaaaa@622F@ for f = 1, 2. MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOzaiaays W7caaI9aGaaGjbVlaaigdacaaISaGaaGjbVlaaikdacaGGUaaaaa@414E@ If the overlap domain size N { 1, 2 } MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOtamaaBa aaleaacaaI7bGaaGymaiaaiYcacaaMe8UaaGOmaiaai2haaeqaaaaa @3EDB@ is also known, the PML estimator is calibrated to all three domain sizes. Skinner (1991) used calibration with the single-frame estimator, raking the estimator to the population frame counts.

Ranalli, Arcos, Rueda and Teodoro (2016) studied general calibration theory for dual-frame surveys. They assumed that a vector of auxiliary information x MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaCiEaaaa@3917@ is available with known population totals X = i = 1 N x i , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaCiwaiaays W7caaI9aGaaGjbVpaaqadabaGaaGPaVlaahIhadaWgaaWcbaGaamyA aaqabaaabaGaamyAaiaai2dacaaIXaaabaGaamOtaaqdcqGHris5aO Gaaiilaaaa@4672@ and calculated multiple-frame generalized regression weights as

c i ( q ) = w ˜ i ( q ) [ 1 + ( X X ^ ) T ( f = 1 Q k S f α k w ˜ k ( f ) x k x k T ) 1 α i x i ] , ( 3.4 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0dc9fr=xfr=x frpeWZqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaiaadogadaqhaa WcbaGaamyAaaqaaiaaiIcacaWGXbGaaGykaaaakiaaysW7caaI9aGa aGjbVlqadEhagaacamaaDaaaleaacaWGPbaabaGaaGikaiaadghaca aIPaaaaOWaamWaaeaacaaMc8UaaGymaiaaysW7cqGHRaWkcaaMe8Ua aiikaiaahIfacaaMe8UaeyOeI0IaaGjbVlqahIfagaqcaiaacMcada ahaaWcbeqaaiaadsfaaaGcdaqadaqaamaaqahabaWaaabuaeaacaaM c8UaeqySde2aaSbaaSqaaiaadUgaaeqaaOGabm4DayaaiaWaa0baaS qaaiaadUgaaeaacaaIOaGaamOzaiaaiMcaaaGccaWH4bWaaSbaaSqa aiaadUgaaeqaaOGaaCiEamaaDaaaleaacaWGRbaabaGaamivaaaaae aacaWGRbGaaGjbVlabgIGiolaaysW7caWGtbWaaSbaaWqaaiaadAga aeqaaaWcbeqdcqGHris5aaWcbaGaamOzaiaai2dacaaIXaaabaGaam yuaaqdcqGHris5aaGccaGLOaGaayzkaaWaaWbaaSqabeaacqGHsisl caaIXaaaaOGaeqySde2aaSbaaSqaaiaadMgaaeqaaOGaaCiEamaaBa aaleaacaWGPbaabeaaaOGaay5waiaaw2faaiaaiYcacaaMf8UaaGzb VlaaywW7caaMf8UaaGzbVlaacIcacaaIZaGaaiOlaiaaisdacaGGPa aaaa@863A@

where α k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqySde2aaS baaSqaaiaadUgaaeqaaaaa@3AD1@ is an arbitrary constant and X ^ = f = 1 Q k S f w ˜ k ( f ) x k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabCiwayaaja GaaGjbVlaai2dacaaMe8+aaabmaeaadaaeqaqaaiaaykW7ceWG3bGb aGaadaqhaaWcbaGaam4AaaqaaiaaiIcacaWGMbGaaGykaaaakiaahI hadaWgaaWcbaGaam4AaaqabaaabaGaam4AaiaaysW7cqGHiiIZcaaM e8Uaam4uamaaBaaameaacaWGMbaabeaaaSqab0GaeyyeIuoaaSqaai aadAgacaaI9aGaaGymaaqaaiaadgfaa0GaeyyeIuoaaaa@53B8@ estimates X MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaCiwaaaa@38F7@ using the multiplicity-adjusted weights. Under regularity conditions, they showed that for the dual-frame estimator in (3.2) with fixed θ , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUdeNaai ilaaaa@3A7C@ the variance of the generalized regression estimator Y ^ GR = q = 1 2 i S q   c i ( q ) y i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja WaaSbaaSqaaiaabEeacaqGsbaabeaakiaaysW7caaI9aGaaGjbVpaa qadabaWaaabeaeaacaaMc8Uaam4yamaaDaaaleaacaWGPbaabaGaaG ikaiaadghacaaIPaaaaOGaamyEamaaBaaaleaacaWGPbaabeaaaeaa caWGPbGaaGjbVlabgIGiolaaysW7caWGtbWaaSbaaWqaaiaadghaae qaaaWcbeqdcqGHris5aaWcbaGaamyCaiaai2dacaaIXaaabaGaaGOm aaqdcqGHris5aaaa@5565@ is approximated by

V ( Y ^ GR ) V [ q = 1 2 i S q w ˜ i ( q ) ( y i x i T B ) ] , ( 3.5 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOvaiaayk W7caaIOaGabmywayaajaWaaSbaaSqaaiaabEeacaqGsbaabeaakiaa iMcacaaMe8UaeyisISRaaGjbVlaadAfacaaMc8+aamWaaeaadaaeWb qaamaaqafabaGaaGPaVlqadEhagaacamaaDaaaleaacaWGPbaabaGa aGikaiaadghacaaIPaaaaOGaaGPaVlaaiIcacaWG5bWaaSbaaSqaai aadMgaaeqaaOGaaGjbVlabgkHiTiaaysW7caWH4bWaa0baaSqaaiaa dMgaaeaacaWGubaaaOGaaCOqaiaaiMcaaSqaaiaadMgacaaMe8Uaey icI4SaaGjbVlaadofadaWgaaadbaGaamyCaaqabaaaleqaniabggHi LdaaleaacaWGXbGaaGypaiaaigdaaeaacaaIYaaaniabggHiLdaaki aawUfacaGLDbaacaaISaGaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7 caGGOaGaaG4maiaac6cacaaI1aGaaiykaaaa@75F7@

where B = ( i = 1 N α i x i x i T ) 1 i = 1 N α i x i y i . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaCOqaiaays W7caaI9aGaaGjbVpaabmqabaWaaabmaeaacaaMc8UaeqySde2aaSba aSqaaiaadMgaaeqaaOGaaCiEamaaBaaaleaacaWGPbaabeaakiaahI hadaqhaaWcbaGaamyAaaqaaiaadsfaaaaabaGaamyAaiaai2dacaaI XaaabaGaamOtaaqdcqGHris5aaGccaGLOaGaayzkaaWaaWbaaSqabe aacqGHsislcaaIXaaaaOWaaabmaeaacaaMc8UaeqySde2aaSbaaSqa aiaadMgaaeqaaOGaaCiEamaaBaaaleaacaWGPbaabeaakiaadMhada WgaaWcbaGaamyAaaqabaaabaGaamyAaiaai2dacaaIXaaabaGaamOt aaqdcqGHris5aOGaaiOlaaaa@5D57@ The variance of the estimator depends on the residuals from the regression model just as in the single-frame case.

Särndal and Lundström (2005) distinguished among types of auxiliary information that can be used in calibration. InfoU is information available at the population level. A vector x * MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaCiEamaaCa aaleqabaGaaiOkaaaaaaa@39F2@ can be considered as InfoU if the population total X * = i = 1 N x i * MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaCiwamaaCa aaleqabaGaaiOkaaaakiaaysW7caaI9aGaaGjbVpaaqadabaGaaGPa VlaahIhadaqhaaWcbaGaamyAaaqaaiaacQcaaaaabaGaamyAaiaai2 dacaaIXaaabaGaamOtaaqdcqGHris5aaaa@474C@ is known and x * MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaCiEamaaCa aaleqabaGaaiOkaaaaaaa@39F2@ is observed for every respondent in the sample. InfoS is information available at the level of the sample, but not at the population level. Vector x o MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaCiEamaaCa aaleqabaGaam4Baaaaaaa@3A38@ is InfoS if it is known for every member of the sample, both respondents and nonrespondents, but i = 1 N x o MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaabmaeaaca aMc8UaaCiEamaaCaaaleqabaGaam4BaaaaaeaacaWGPbGaaGypaiaa igdaaeaacaWGobaaniabggHiLdaaaa@40FD@ is unknown.

In a multiple-frame survey, the variables available for InfoU and InfoS may differ across frames. For the NSAF, little auxiliary information was known for nonrespondents in the RDD sample but address-related information (for example, characteristics of the block group) was known for all members of the area-frame sample. The reverse may be true for a dual-frame survey in which Frame 1 is an area frame and Frame 2 is a list frame. The list frame may have rich information that can be used for weighting class adjustments or calibration, while the auxiliary information for the area frame may be restricted to information measured in the survey for which population totals are known from an external source such as a census or population register.

Ranalli et al. (2016) allowed for differing InfoU information across the frames; some of the auxiliary variables may be known for units from all samples and for the full population, while other variables may be of the form x i * = x i δ i ( q ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiEamaaDa aaleaacaWGPbaabaGaaiOkaaaakiaaysW7caaI9aGaaGjbVlaadIha daWgaaWcbaGaamyAaaqabaGccqaH0oazdaqhaaWcbaGaamyAaaqaai aaiIcacaWGXbGaaGykaaaaaaa@4603@ with total X * = i = 1 N x i δ i ( q ) , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiwamaaCa aaleqabaGaaiOkaaaakiaaysW7caaI9aGaaGjbVpaaqadabaGaaGPa VlaadIhadaWgaaWcbaGaamyAaaqabaGccqaH0oazdaqhaaWcbaGaam yAaaqaaiaaiIcacaWGXbGaaGykaaaaaeaacaWGPbGaaGypaiaaigda aeaacaWGobaaniabggHiLdGccaGGSaaaaa@4C74@ the total of variable x MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiEaaaa@3913@ in Frame q . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyCaiaac6 caaaa@39BE@ Calibration to frame counts N ( q ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOtamaaCa aaleqabaGaaGikaiaadghacaaIPaaaaaaa@3B70@ is thus a special case of the general calibration theory.

But the differing amounts of information for the frames may also have a bearing on the multiplicity adjustments. Suppose that Frame 2 has rich auxiliary information for calibration while Frame 1 has little information. Calibrating the weights w i ( 2 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4DamaaDa aaleaacaWGPbaabaGaaGikaiaaikdacaaIPaaaaaaa@3C4E@ before compositing may increase the relative effective sample size from S 2 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIYaaabeaaaaa@39D6@ and thus increase the value of n ˜ ( 2 ) / ( n ˜ ( 1 ) + n ˜ ( 2 ) ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0dc9fr=xfr=x frpeWZqaaeaabiGaciaacaqabeaadaqaaqaaaOqaamaalyaabaGabm OBayaaiaWaaWbaaSqabeaacaaIOaGaaGOmaiaaiMcaaaaakeaacaaM c8+aaeWabeaaceWGUbGbaGaadaahaaWcbeqaaiaaiIcacaaIXaGaaG ykaaaakiaaysW7cqGHRaWkcaaMe8UabmOBayaaiaWaaWbaaSqabeaa caaIOaGaaGOmaiaaiMcaaaaakiaawIcacaGLPaaaaaaaaa@48B3@ that would be used for the ESS estimator.

Haziza and Lesage (2016) argued that a two-step weighting procedure offers several advantages for single-frame surveys with nonresponse. The first step divides the design weight for unit i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyAaaaa@3904@ by its estimated response propensity (often calculated from InfoS information) and the second step calibrates the nonresponse-adjusted weights to population control totals (available from InfoU information). When there is substantial nonresponse, weighting adjustment factors from step 1 are often much higher than those from step 2; if the response propensity model is correct, the weighting adjustments in step 2 converge to 1 as n . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOBaiaays W7cqGHsgIRcaaMe8UaeyOhIuQaaiOlaaaa@4033@ The two-step procedure is thus more robust toward misspecification of the calibration model.

The same considerations apply for multiple-frame surveys. A two-step procedure, where step 1 adjusts the samples separately for nonresponse and step 2 calibrates the combined samples, provides robustness to the calibration model. Suppose that S 1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIXaaabeaaaaa@39D5@ has full response; S 2 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIYaaabeaaaaa@39D6@ has nonresponse but the response propensities can be predicted perfectly from variable x . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiEaiaac6 caaaa@39C5@ Then, performing a separate nonresponse adjustment for each sample in step 1 removes the bias for S 2 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIYaaabeaaaaa@39D6@ so that Assumption (A5) is satisfied. If the data are combined first and then calibrated using (3.4), however, the calibration may change the weights for units in S 1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIXaaabeaaaaa@39D5@ in order to meet the calibration constraints  MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaacbaqcLbwaqa aaaaaaaaWdbiaa=nbiaaa@37A3@  introducing bias for the estimates from S 1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIXaaabeaaaaa@39D5@ while not removing it for estimates from S 2 . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIYaaabeaakiaac6caaaa@3A92@ More research is needed on the ordering of steps for weight adjustments. It may be better to perform two steps of nonresponse adjustments and calibration on each sample separately, then adjust the weights for multiplicity, and then calibrate to population totals (including re-calibrating on the individual frame variables).

One consequence of using an overlap estimator for a multiple-frame survey is that the multiplicity adjustments may introduce more weight variation, with observations belonging to one frame having much larger weights than observations belonging to more than one frame. If, for example, a list frame (Frame 2 in Figure 2.2(a, b)) is disproportionately oversampled, then the sampling weights for observations in domain {1} MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaccaGae83EaS Nae8xmaeJae8xFa0haaa@39E1@ which are sampled only from Frame 1, may be large relative to the weights for the other domains. Wolter, Ganesh, Copeland, Singleton and Khare (2019) suggested using a shrinkage estimator, estimating Y { 1 } MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamywamaaBa aaleaacaaI7bGaaGymaiaai2haaeqaaaaa@3BE7@ by κ Y ^ { 1 } ( 1 ) + ( 1 κ ) N { 1 } ( Y ^ { 2 } ( 2 ) + Y ^ { 1, 2 } ) / N ( 2 ) , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaSGbaeaacq aH6oWAceWGzbGbaKaadaqhaaWcbaGaaG4EaiaaigdacaaI9baabaGa aGikaiaaigdacaaIPaaaaOGaaGjbVlabgUcaRiaaysW7caaIOaGaaG ymaiaaysW7cqGHsislcaaMe8UaeqOUdSMaaGykaiaaysW7caWGobWa aSbaaSqaaiaaiUhacaaIXaGaaGyFaaqabaGccaaMc8UaaGikaiqadM fagaqcamaaDaaaleaacaaI7bGaaGOmaiaai2haaeaacaaIOaGaaGOm aiaaiMcaaaGccaaMe8Uaey4kaSIabmywayaajaWaaSbaaSqaaiaaiU hacaaIXaGaaGilaiaaysW7caaIYaGaaGyFaaqabaGccaaIPaaabaGa aGPaVlaad6eadaahaaWcbeqaaiaaiIcacaaIYaGaaGykaaaaaaGcca GGSaaaaa@68D9@ but the shrinkage may introduce bias  MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaacbaqcLbwaqa aaaaaaaaWdbiaa=nbiaaa@37A3@  after all, the reason for using a more complicated multiple-frame design instead of just sampling from Frame 2 is to avoid potential bias from omitting domain {1} MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaccaGae83EaS Nae8xmaeJae8xFa0haaa@39E1@ . A better solution, if feasible, is to address the weight variation when designing the survey, as discussed in Section 5.

3.4   Probability sample combined with census of a population subset

Lohr (2014) and Kim and Tam (2021) noted that the situation in Figure 2.2(a) includes the special case in which a probability sample S 1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIXaaabeaaaaa@39D5@ is taken from Frame 1 having full coverage, and the sample S 2 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIYaaabeaaaaa@39D6@ from Frame 2 is a census of domain {1, 2}. The overlap domain is thus defined to be the units in S 2 , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIYaaabeaakiaacYcaaaa@3A90@ which may be from administrative records or a convenience sample. Although S 2 , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIYaaabeaakiaacYcaaaa@3A90@ considered by itself, may have undercoverage bias, in the multiple-frame setting the bias is eliminated by the presence of a sample from Frame 1. The units in S 2 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIYaaabeaaaaa@39D6@ have w i ( 2 ) = 1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4DamaaDa aaleaacaWGPbaabaGaaGikaiaaikdacaaIPaaaaOGaaGjbVlaai2da caaMe8UaaGymaaaa@40F4@ and represent themselves alone; they do not represent any units in other parts of the population. When N ( 2 ) / N MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaSGbaeaaca WGobWaaWbaaSqabeaacaaIOaGaaGOmaiaaiMcaaaGccaaMc8oabaGa aGPaVlaad6eaaaaaaa@3F40@ is small, say from a small convenience sample, S 2 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIYaaabeaaaaa@39D6@ will have little effect on dual-frame estimators  MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaacbaqcLbwaqa aaaaaaaaWdbiaa=nbiaaa@37A3@  almost all of the population is in domain {1} MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaccaGae83EaS Nae8xmaeJae8xFa0haaa@39E1@ . But when N ( 2 ) / N MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaSGbaeaaca WGobWaaWbaaSqabeaacaaIOaGaaGOmaiaaiMcaaaGccaaMc8oabaGa aGPaVlaad6eaaaaaaa@3F40@ is large, as may occur when Frame 2 consists of administrative records, the availability of those records may improve the precision of Y ^ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja aaaa@3904@ if Assumptions (A1) to (A6) are met.

When S 2 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIYaaabeaaaaa@39D6@ is a census with no measurement error, Y ^ { 1, 2 } ( 2 ) = Y { 1, 2 }   . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja Waa0baaSqaaiaaiUhacaaIXaGaaGilaiaaysW7caaIYaGaaGyFaaqa aiaaiIcacaaIYaGaaGykaaaakiaaysW7caaI9aGaaGjbVlaadMfada WgaaWcbaGaaG4EaiaaigdacaaISaGaaGjbVlaaikdacaaI9baabeaa kiaac6caaaa@4C8F@ The estimator in (3.2) is

Y ^ ( θ ) = Y ^ { 1 } ( 1 ) + θ Y ^ { 1, 2 } ( 1 ) + ( 1 θ ) Y { 1, 2 } ; ( 3.6 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja GaaGPaVlaaiIcacqaH4oqCcaaIPaGaaGjbVlaai2dacaaMe8Uabmyw ayaajaWaa0baaSqaaiaaiUhacaaIXaGaaGyFaaqaaiaaiIcacaaIXa GaaGykaaaakiaaysW7cqGHRaWkcaaMe8UaeqiUdeNabmywayaajaWa a0baaSqaaiaaiUhacaaIXaGaaGilaiaaysW7caaIYaGaaGyFaaqaai aaiIcacaaIXaGaaGykaaaakiaaysW7cqGHRaWkcaaMe8UaaGikaiaa igdacaaMe8UaeyOeI0IaaGjbVlabeI7aXjaaiMcacaaMe8Uaamywam aaBaaaleaacaaI7bGaaGymaiaaiYcacaaMe8UaaGOmaiaai2haaeqa aOGaaG4oaiaaywW7caaMf8UaaGzbVlaaywW7caaMf8Uaaiikaiaaio dacaGGUaGaaGOnaiaacMcaaaa@76A6@

taking θ = 0 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUdeNaaG jbVlaai2dacaaMe8UaaGimaaaa@3E67@ uses the known population total from Frame 2 and relies on Frame 1 only for estimation of the part of the population not in Frame 2.

Kim and Tam (2021) noted that since Y { 1, 2 } MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamywamaaBa aaleaacaaI7bGaaGymaiaaiYcacaaMe8UaaGOmaiaai2haaeqaaaaa @3EE6@ is known, it can be used as an InfoU calibration total. They proposed two calibration estimators: a ratio estimator Y ^ ratio = Y ^ ( 1 ) Y { 1, 2 }   / Y ^ { 1, 2 } ( 1 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja WaaSbaaSqaaiaabkhacaqGHbGaaeiDaiaabMgacaqGVbaabeaakiaa ysW7caaI9aGaaGjbVpaalyaabaGabmywayaajaWaaWbaaSqabeaaca aIOaGaaGymaiaaiMcaaaGccaWGzbWaaSbaaSqaaiaaiUhacaaIXaGa aGilaiaaysW7caaIYaGaaGyFaaqabaaakeaacaaMc8Uabmywayaaja Waa0baaSqaaiaaiUhacaaIXaGaaGilaiaaysW7caaIYaGaaGyFaaqa aiaaiIcacaaIXaGaaGykaaaaaaaaaa@568A@ and a generalized regression calibration estimator. For many designs, however, the ratio estimator will be less efficient than Y ^ ( 0 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja GaaGPaVlaaiIcacaaIWaGaaGykaaaa@3CAE@ from (3.6) because

V ( Y ^ ratio ) V ( Y ^ { 1 } ( 1 ) ) + ( Y { 1 } Y { 1, 2 } ) 2 V [ Y ^ { 1, 2 } ( 1 ) ] 2 Y { 1 } Y { 1, 2 } Cov ( Y ^ { 1 } ( 1 ) , Y ^ { 1, 2 } ( 1 ) ) ; MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOvaiaayk W7daqadeqaaiqadMfagaqcamaaBaaaleaacaqGYbGaaeyyaiaabsha caqGPbGaae4BaaqabaaakiaawIcacaGLPaaacaaMe8UaaGjbVlabgI Ki7kaaysW7caaMe8UaamOvaiaaykW7daqadeqaaiqadMfagaqcamaa DaaaleaacaaI7bGaaGymaiaai2haaeaacaaIOaGaaGymaiaaiMcaaa aakiaawIcacaGLPaaacaaMe8Uaey4kaSIaaGjbVpaabmaabaWaaSaa aeaacaWGzbWaaSbaaSqaaiaaiUhacaaIXaGaaGyFaaqabaaakeaaca WGzbWaaSbaaSqaaiaaiUhacaaIXaGaaGilaiaaysW7caaIYaGaaGyF aaqabaaaaaGccaGLOaGaayzkaaWaaWbaaSqabeaacaaIYaaaaOGaam OvaiaaykW7daWadeqaaiqadMfagaqcamaaDaaaleaacaaI7bGaaGym aiaaiYcacaaMe8UaaGOmaiaai2haaeaacaaIOaGaaGymaiaaiMcaaa aakiaawUfacaGLDbaacaaMe8UaeyOeI0IaaGjbVlaaikdacaaMe8+a aSaaaeaacaWGzbWaaSbaaSqaaiaaiUhacaaIXaGaaGyFaaqabaaake aacaWGzbWaaSbaaSqaaiaaiUhacaaIXaGaaGilaiaaysW7caaIYaGa aGyFaaqabaaaaOGaaGjbVlaayIW7caqGdbGaae4BaiaabAhacaaMe8 +aaeWabeaaceWGzbGbaKaadaqhaaWcbaGaaG4EaiaaigdacaaI9baa baGaaGikaiaaigdacaaIPaaaaOGaaGilaiaaysW7ceWGzbGbaKaada qhaaWcbaGaaG4EaiaaigdacaaISaGaaGjbVlaaikdacaaI9baabaGa aGikaiaaigdacaaIPaaaaaGccaGLOaGaayzkaaGaaG4oaaaa@9E73@

the ratio adjustment can introduce extra variability from Y ^ { 1, 2 } ( 1 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja Waa0baaSqaaiaaiUhacaaIXaGaaGilaiaaysW7caaIYaGaaGyFaaqa aiaaiIcacaaIXaGaaGykaaaaaaa@4117@ that is excluded from Y ^ ( 0 ) . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja GaaGPaVlaaiIcacaaIWaGaaGykaiaac6caaaa@3D60@

Calibrating Y ^ ( θ ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja GaaGPaVlaaiIcacqaH4oqCcaaIPaaaaa@3DAA@ to Y { 1, 2 } = i = 1 N x i , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamywamaaBa aaleaacaaI7bGaaGymaiaaiYcacaaMe8UaaGOmaiaai2haaeqaaOGa aGjbVlaai2dacaaMe8+aaabmaeaacaaMc8UaamiEamaaBaaaleaaca WGPbaabeaaaeaacaWGPbGaaGypaiaaigdaaeaacaWGobaaniabggHi LdGccaGGSaaaaa@4C67@ for x i = δ i ( 2 ) y i , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiEamaaBa aaleaacaWGPbaabeaakiaaysW7caaI9aGaaGjbVlabes7aKnaaDaaa leaacaWGPbaabaGaaGikaiaaikdacaaIPaaaaOGaamyEamaaBaaale aacaWGPbaabeaakiaacYcaaaa@45D5@ the generalized regression weights in (3.4) become

c i ( q ) = w ˜ i ( q ) [ 1 + ( Y { 1, 2 } Y ^ { 1, 2 } ( θ ) ) ( f = 1 Q k S f w ˜ k ( f ) δ k ( 2 ) y k 2 ) 1 δ i ( 2 ) y i ] , ( 3.7 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0dc9fr=xfr=x frpeWZqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaiaadogadaqhaa WcbaGaamyAaaqaaiaaiIcacaWGXbGaaGykaaaakiaaysW7caaMe8Ua aGypaiaaysW7caaMe8Uabm4DayaaiaWaa0baaSqaaiaadMgaaeaaca aIOaGaamyCaiaaiMcaaaGcdaWadaqaaiaaigdacaaMe8Uaey4kaSIa aGjbVpaabmaabaGaamywamaaBaaaleaacaaI7bGaaGymaiaaiYcaca aMe8UaaGOmaiaai2haaeqaaOGaaGjbVlabgkHiTiaaysW7ceWGzbGb aKaadaWgaaWcbaGaaG4EaiaaigdacaaISaGaaGjbVlaaikdacaaI9b aabeaakiaaiIcacqaH4oqCcaaIPaaacaGLOaGaayzkaaGaaGjbVpaa bmaabaWaaabCaeaadaaeqbqaaiaaykW7ceWG3bGbaGaadaqhaaWcba Gaam4AaaqaaiaaiIcacaWGMbGaaGykaaaakiabes7aKnaaDaaaleaa caWGRbaabaGaaGikaiaaikdacaaIPaaaaOGaamyEamaaDaaaleaaca WGRbaabaGaaGOmaaaaaeaacaWGRbGaaGjbVlabgIGiolaaysW7caWG tbWaaSbaaWqaaiaadAgaaeqaaaWcbeqdcqGHris5aaWcbaGaamOzai aai2dacaaIXaaabaGaamyuaaqdcqGHris5aaGccaGLOaGaayzkaaWa aWbaaSqabeaacqGHsislcaaIXaaaaOGaeqiTdq2aa0baaSqaaiaadM gaaeaacaaIOaGaaGOmaiaaiMcaaaGccaWG5bWaaSbaaSqaaiaadMga aeqaaaGccaGLBbGaayzxaaGaaGilaiaaywW7caaMf8UaaGzbVlaayw W7caaMf8UaaiikaiaaiodacaGGUaGaaG4naiaacMcaaaa@998C@

resulting in Y ^ GR = Y ^ ( 0 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja WaaSbaaSqaaiaabEeacaqGsbaabeaakiaaysW7caaI9aGaaGjbVlqa dMfagaqcaiaaykW7caaIOaGaaGimaiaaiMcaaaa@4352@ from (3.6). Similarly, calibrating on the vector x i = ( 1, δ i ( 2 ) , δ i ( 2 ) y i ) T MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaCiEamaaBa aaleaacaWGPbaabeaakiaaysW7caaI9aGaaGjbVpaabmqabaGaaGym aiaaiYcacaaMe8UaeqiTdq2aa0baaSqaaiaadMgaaeaacaaIOaGaaG OmaiaaiMcaaaGccaaISaGaaGjbVlabes7aKnaaDaaaleaacaWGPbaa baGaaGikaiaaikdacaaIPaaaaOGaamyEamaaBaaaleaacaWGPbaabe aaaOGaayjkaiaawMcaamaaCaaaleqabaGaamivaaaaaaa@51E5@ results in Y ^ GR = Y ^ { 1 } ( 1 ) N { 1 } / N ^ { 1 } ( 1 ) + Y { 1, 2 } . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja WaaSbaaSqaaiaabEeacaqGsbaabeaakiaaysW7caaI9aGaaGjbVpaa lyaabaGabmywayaajaWaa0baaSqaaiaaiUhacaaIXaGaaGyFaaqaai aaiIcacaaIXaGaaGykaaaakiaad6eadaWgaaWcbaGaaG4Eaiaaigda caaI9baabeaakiaaykW7aeaacaaMc8UabmOtayaajaWaa0baaSqaai aaiUhacaaIXaGaaGyFaaqaaiaaiIcacaaIXaGaaGykaaaakiaaysW7 cqGHRaWkcaaMe8UaamywamaaBaaaleaacaaI7bGaaGymaiaaiYcaca aMe8UaaGOmaiaai2haaeqaaaaakiaac6caaaa@5D4B@

For some designs, the variance can be reduced even further. Montanari (1987, 1998) proposed using the regression coefficient β = [ V ( X ^ ) ] 1 Cov ( Y ^ , X ^ ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaCOSdiaays W7caaI9aGaaGjbVpaadmqabaGaamOvaiaaiIcaceWHybGbaKaacaaI PaaacaGLBbGaayzxaaWaaWbaaSqabeaacqGHsislcaaIXaaaaOGaaG jcVlaaboeacaqGVbGaaeODaiaaysW7caaIOaGabmywayaajaGaaGil aiaaysW7ceWHybGbaKaacaaIPaaaaa@4F8E@ for calibration, resulting in the estimator

Y ^ opt = Y ^ + ( X X ^ ) T β . ( 3.8 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja WaaSbaaSqaaiaab+gacaqGWbGaaeiDaaqabaGccaaMe8UaaGypaiaa ysW7ceWGzbGbaKaacaaMe8Uaey4kaSIaaGjbVlaaiIcacaWHybGaaG jbVlabgkHiTiaaysW7ceWHybGbaKaacaaIPaWaaWbaaSqabeaacaWG ubaaaOGaaCOSdiaai6cacaaMf8UaaGzbVlaaywW7caaMf8UaaGzbVl aacIcacaaIZaGaaiOlaiaaiIdacaGGPaaaaa@5A74@

Rao (1994) called (3.8) the optimal regression estimator and showed that V ( Y ^ opt ) V ( Y ^ GR ) . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamOvaiaayk W7caaIOaGabmywayaajaWaaSbaaSqaaiaab+gacaqGWbGaaeiDaaqa baGccaaIPaGaaGjbVlabgsMiJkaaysW7caWGwbGaaGPaVlaaiIcace WGzbGbaKaadaWgaaWcbaGaae4raiaabkfaaeqaaOGaaGykaiaac6ca aaa@4BF0@ For the dual-frame situation considered in this section, with x i = δ i ( 2 ) y i , MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiEamaaBa aaleaacaWGPbaabeaakiaaysW7caaI9aGaaGjbVlabes7aKnaaDaaa leaacaWGPbaabaGaaGikaiaaikdacaaIPaaaaOGaamyEamaaBaaale aacaWGPbaabeaakiaacYcaaaa@45D5@

β = Cov ( Y ^ ( 1 ) , Y ^ { 1, 2 } ( 1 ) ) V ( Y ^ { 1, 2 } ( 1 ) ) = 1 + Cov ( Y ^ { 1 } ( 1 ) , Y ^ { 1, 2 } ( 1 ) ) V ( Y ^ { 1, 2 } ( 1 ) ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaCOSdiaays W7caaMe8UaaGypaiaaysW7caaMe8+aaSaaaeaacaqGdbGaae4Baiaa bAhacaaMe8+aaeWabeaaceWGzbGbaKaadaahaaWcbeqaaiaaiIcaca aIXaGaaGykaaaakiaaiYcacaaMe8UabmywayaajaWaa0baaSqaaiaa iUhacaaIXaGaaGilaiaaysW7caaIYaGaaGyFaaqaaiaaiIcacaaIXa GaaGykaaaaaOGaayjkaiaawMcaaaqaaiaadAfacaaMc8+aaeWabeaa ceWGzbGbaKaadaqhaaWcbaGaaG4EaiaaigdacaaISaGaaGjbVlaaik dacaaI9baabaGaaGikaiaaigdacaaIPaaaaaGccaGLOaGaayzkaaaa aiaaysW7caaMe8UaaGypaiaaysW7caaMe8UaaGymaiaaysW7cqGHRa WkcaaMe8+aaSaaaeaacaaMi8Uaae4qaiaab+gacaqG2bGaaGjbVpaa bmqabaGabmywayaajaWaa0baaSqaaiaaiUhacaaIXaGaaGyFaaqaai aaiIcacaaIXaGaaGykaaaakiaaiYcacaaMe8UabmywayaajaWaa0ba aSqaaiaaiUhacaaIXaGaaGilaiaaysW7caaIYaGaaGyFaaqaaiaaiI cacaaIXaGaaGykaaaaaOGaayjkaiaawMcaaaqaaiaadAfacaaMc8+a aeWabeaaceWGzbGbaKaadaqhaaWcbaGaaG4EaiaaigdacaaISaGaaG jbVlaaikdacaaI9baabaGaaGikaiaaigdacaaIPaaaaaGccaGLOaGa ayzkaaaaaaaa@9324@

and

Y ^ opt = Y ^ ( 1 ) + ( Y { 1, 2 } Y ^ { 1, 2 } ( 1 ) ) [ 1 + Cov ( Y ^ { 1 } ( 1 ) , Y ^ { 1, 2 } ( 1 ) ) V ( Y ^ { 1, 2 } ( 1 ) ) ] = Y ^ { 1 } ( 1 ) + θ H Y ^ { 1, 2 } ( 1 ) + ( 1 θ H ) Y { 1, 2 } , ( 3.9 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaqbaeaabiGaaa qaaiqadMfagaqcamaaBaaaleaacaqGVbGaaeiCaiaabshaaeqaaaGc baGaaGypaiaaysW7caaMe8UabmywayaajaWaaWbaaSqabeaacaaIOa GaaGymaiaaiMcaaaGccaaMe8Uaey4kaSIaaGjbVpaabmaabaGaamyw amaaBaaaleaacaaI7bGaaGymaiaaiYcacaaMe8UaaGOmaiaai2haae qaaOGaaGjbVlabgkHiTiaaysW7ceWGzbGbaKaadaqhaaWcbaGaaG4E aiaaigdacaaISaGaaGjbVlaaikdacaaI9baabaGaaGikaiaaigdaca aIPaaaaaGccaGLOaGaayzkaaGaaGjbVlaaysW7daWadaqaaiaaigda caaMe8Uaey4kaSIaaGjbVpaalaaabaGaae4qaiaab+gacaqG2bGaaG jbVpaabmqabaGabmywayaajaWaa0baaSqaaiaaiUhacaaIXaGaaGyF aaqaaiaaiIcacaaIXaGaaGykaaaakiaaiYcacaaMe8Uabmywayaaja Waa0baaSqaaiaaiUhacaaIXaGaaGilaiaaysW7caaIYaGaaGyFaaqa aiaaiIcacaaIXaGaaGykaaaaaOGaayjkaiaawMcaaaqaaiaadAfaca aMc8+aaeWabeaaceWGzbGbaKaadaqhaaWcbaGaaG4EaiaaigdacaaI SaGaaGjbVlaaikdacaaI9baabaGaaGikaiaaigdacaaIPaaaaaGcca GLOaGaayzkaaaaaaGaay5waiaaw2faaaqaaaqaaiaai2dacaaMe8Ua aGjbVlqadMfagaqcamaaDaaaleaacaaI7bGaaGymaiaai2haaeaaca aIOaGaaGymaiaaiMcaaaGccaaMe8Uaey4kaSIaaGjbVlabeI7aXnaa BaaaleaacaWGibaabeaakiqadMfagaqcamaaDaaaleaacaaI7bGaaG ymaiaaiYcacaaMe8UaaGOmaiaai2haaeaacaaIOaGaaGymaiaaiMca aaGccaaMe8Uaey4kaSIaaGjbVlaaiIcacaaIXaGaaGjbVlabgkHiTi aaysW7cqaH4oqCdaWgaaWcbaGaamisaaqabaGccaaIPaGaaGjbVlaa dMfadaWgaaWcbaGaaG4EaiaaigdacaaISaGaaGjbVlaaikdacaaI9b aabeaakiaaiYcacaaMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaaywW7 caaMf8UaaGzbVlaaywW7caaMf8UaaiikaiaaiodacaGGUaGaaGyoai aacMcaaaaaaa@CD68@

where θ H = Cov ( Y ^ { 1 } ( 1 ) , Y ^ { 1, 2 } ( 1 ) ) / V ( Y ^ { 1, 2 } ( 1 ) ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUde3aaS baaSqaaiaadIeaaeqaaOGaaGjbVlaai2dacaaMe8UaeyOeI0Iaae4q aiaab+gacaqG2bGaaGjbVpaalyaabaWaaeWabeaaceWGzbGbaKaada qhaaWcbaGaaG4EaiaaigdacaaI9baabaGaaGikaiaaigdacaaIPaaa aOGaaGilaiaaysW7ceWGzbGbaKaadaqhaaWcbaGaaG4Eaiaaigdaca aISaGaaGjbVlaaikdacaaI9baabaGaaGikaiaaigdacaaIPaaaaaGc caGLOaGaayzkaaaabaGaaGPaVlaadAfadaqadeqaaiqadMfagaqcam aaDaaaleaacaaI7bGaaGymaiaaiYcacaaMe8UaaGOmaiaai2haaeaa caaIOaGaaGymaiaaiMcaaaaakiaawIcacaGLPaaaaaaaaa@63D0@ is Hartley’s optimal value for θ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUdehaaa@39CC@ from (3.3).

Although we usually think of the compositing factor θ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUdehaaa@39CC@ as being between 0 and 1, θ H MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiUde3aaS baaSqaaiaadIeaaeqaaaaa@3AC5@ can be outside of this range. For a conceptual example, suppose that Frame 2 is a list of children receiving food assistance at school and the sample from Frame 1 is a cluster sample of households. Then households in which one or more children are receiving food assistance have some household members in domain { 1, 2 } MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaG4Eaiaaig dacaaISaGaaGjbVlaaikdacaaI9baaaa@3DDC@ and other members in domain {1} MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaccaGae83EaS Nae8xmaeJae8xFa0haaa@39E1@ . If y MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyEaaaa@3914@ exhibits high intra-household correlation, then we would expect Y ^ { 1 } ( 1 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja Waa0baaSqaaiaaiUhacaaIXaGaaGyFaaqaaiaaiIcacaaIXaGaaGyk aaaaaaa@3E18@ and Y ^ { 1, 2 } ( 1 ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja Waa0baaSqaaiaaiUhacaaIXaGaaGilaiaaysW7caaIYaGaaGyFaaqa aiaaiIcacaaIXaGaaGykaaaaaaa@4117@ to be positively correlated. In this case, Hartley’s optimal estimator results in negative weights for units in domain { 1, 2 } MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaG4Eaiaaig dacaaISaGaaGjbVlaaikdacaaI9baaaa@3DDC@ from the probability sample.

Even though Y ^ opt MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaaja WaaSbaaSqaaiaab+gacaqGWbGaaeiDaaqabaaaaa@3C0C@ is more efficient for special situations such as the cluster sample described above, it depends in practice on an estimate of the covariance, is optimal only for this particular y MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamyEaaaa@3914@ variable, and may have negative weights. Negative weights can also occur if one does optimal calibration with auxiliary variable ( 1, δ i ( 2 ) , δ i ( 2 ) y i ); MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaeWabeaaca aIXaGaaGilaiaaysW7cqaH0oazdaqhaaWcbaGaamyAaaqaaiaaiIca caaIYaGaaGykaaaakiaaiYcacaaMe8UaeqiTdq2aa0baaSqaaiaadM gaaeaacaaIOaGaaGOmaiaaiMcaaaGccaWG5bWaaSbaaSqaaiaadMga aeqaaaGccaGLOaGaayzkaaGaai4oaaaa@4B98@ in fact, that calibration results in the estimator proposed by Fuller and Burmeister (1972). These optimal regression estimators are sensitive to the model assumptions, and in general I do not recommend their use.

When the Frame-2 sample is a census and Assumptions (A1) to (A6) are met, the precision of population estimates depends entirely on the design of S 1 . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIXaaabeaakiaac6caaaa@3A91@ When the samples are not designed to be part of a multiple-frame survey (and sometimes even when they are), it is likely that one or more of the assumptions is violated. Assumptions (A4) and (A6) are particularly suspect when it is desired to combine data from surveys that were not designed with combination in mind. Even if both surveys measure unemployment, they may use different questions so that the unemployment statistics from S 2 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIYaaabeaaaaa@39D6@ measure a different concept than the statistics from S 1 . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIXaaabeaakiaac6caaaa@3A91@ Domain misclassification may also occur. A unit in the census S 2 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIYaaabeaaaaa@39D6@ is known to also be in complete Frame 1, but it may be difficult to tell whether a unit in S 1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIXaaabeaaaaa@39D5@ is also in the administrative records or convenience sample that serves as S 2 . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrFfpu0de9LqFHe9Lq pepeea0xd9q8as0=LqLs=Jirpepeea0=as0Fb9pgea0lrP0xe9Fve9 Fve9qapdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4uamaaBa aaleaacaaIYaaabeaakiaac6caaaa@3A92@ These problems are discussed in the next section.


Date modified: