Decomposition of gender wage inequalities through calibration: Application to the Swiss structure of earnings survey
Section 4. The weighted DFL method

4.1  The method

The method proposed by DiNardo et al. (1996) uses a reweighting function by which women’s distribution of characteristics is rendered similar to men’s distribution of characteristics. The reweighted distribution is the women’s counterfactual distribution of characteristics. The DFL method is presented through the use of survey weights in order to take the sampling design into account.

The reweighting function is equal to

ψ ( x k ) = Pr ( D M k = 1 | x k ) / Pr ( D M k = 1 ) Pr ( D M k = 0 | x k ) / Pr ( D M k = 0 ) , MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiYdK3aae WaaeaacaWH4bWaaSbaaSqaaiaadUgaaeqaaaGccaGLOaGaayzkaaGa aGypamaalaaabaWaaSGbaeaaciGGqbGaaiOCamaabmaabaGaamiram aaBaaaleaacaWGnbGaam4AaaqabaGccaaI9aWaaqGaaeaacaaIXaGa aGjcVdGaayjcSdGaaGPaVlaahIhadaWgaaWcbaGaam4Aaaqabaaaki aawIcacaGLPaaaaeaaciGGqbGaaiOCamaabmaabaGaamiramaaBaaa leaacaWGnbGaam4AaaqabaGccaaI9aGaaGymaaGaayjkaiaawMcaaa aaaeaadaWcgaqaaiGaccfacaGGYbWaaeWaaeaacaWGebWaaSbaaSqa aiaad2eacaWGRbaabeaakiaai2dadaabcaqaaiaaicdacaaMi8oaca GLiWoacaaMc8UaaCiEamaaBaaaleaacaWGRbaabeaaaOGaayjkaiaa wMcaaaqaaiGaccfacaGGYbWaaeWaaeaacaWGebWaaSbaaSqaaiaad2 eacaWGRbaabeaakiaai2dacaaIWaaacaGLOaGaayzkaaaaaaaacaaI Saaaaa@6941@

where D M k = 1 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiramaaBa aaleaacaWGnbGaam4AaaqabaGccaaI9aGaaGymaaaa@3A3E@ if individual k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Aaaaa@36EB@ is a man and D M k = 0 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamiramaaBa aaleaacaWGnbGaam4AaaqabaGccaaI9aGaaGimaaaa@3A3D@ otherwise and x k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaCiEamaaBa aaleaacaWGRbaabeaaaaa@3818@ is the vector of observed characteristics for individual k . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaam4Aaiaac6 caaaa@379D@ Obviously, Pr ( D M k = 1 | x k ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaciiuaiaack hadaqadaqaaiaadseadaWgaaWcbaGaamytaiaadUgaaeqaaOGaaGyp amaaeiaabaGaaGymaiaayIW7aiaawIa7aiaaykW7caWH4bWaaSbaaS qaaiaadUgaaeqaaaGccaGLOaGaayzkaaaaaa@446C@ and Pr ( D M k = 0 | x k ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaciiuaiaack hadaqadaqaaiaadseadaWgaaWcbaGaamytaiaadUgaaeqaaOGaaGyp amaaeiaabaGaaGimaiaayIW7aiaawIa7aiaaykW7caWH4bWaaSbaaS qaaiaadUgaaeqaaaGccaGLOaGaayzkaaaaaa@446B@ must be estimated. For this type of estimation, DiNardo et al. (1996) suggested the use of a logit or a probit model. Using the information from the sample,

ψ ^ ( x k ) = Pr ^ ( D M k = 1 | x k ) / Pr ^ ( D M k = 1 ) Pr ^ ( D M k = 0 | x k ) / Pr ^ ( D M k = 0 ) . ( 4.1 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGafqiYdKNbaK aadaqadaqaaiaahIhadaWgaaWcbaGaam4AaaqabaaakiaawIcacaGL PaaacaaI9aWaaSaaaeaadaWcgaqaamaaHaaabaGaciiuaiaackhaai aawkWaamaabmaabaGaamiramaaBaaaleaacaWGnbGaam4AaaqabaGc caaI9aWaaqGaaeaacaaIXaGaaGjcVdGaayjcSdGaaGPaVlaahIhada WgaaWcbaGaam4AaaqabaaakiaawIcacaGLPaaaaeaadaqiaaqaaiGa ccfacaGGYbaacaGLcmaadaqadaqaaiaadseadaWgaaWcbaGaamytai aadUgaaeqaaOGaaGypaiaaigdaaiaawIcacaGLPaaaaaaabaWaaSGb aeaadaqiaaqaaiGaccfacaGGYbaacaGLcmaadaqadaqaaiaadseada WgaaWcbaGaamytaiaadUgaaeqaaOGaaGypamaaeiaabaGaaGimaiaa yIW7aiaawIa7aiaaykW7caWH4bWaaSbaaSqaaiaadUgaaeqaaaGcca GLOaGaayzkaaaabaWaaecaaeaaciGGqbGaaiOCaaGaayPadaWaaeWa aeaacaWGebWaaSbaaSqaaiaad2eacaWGRbaabeaakiaai2dacaaIWa aacaGLOaGaayzkaaaaaaaacaGGUaGaaGzbVlaaywW7caaMf8UaaGzb VlaaywW7caGGOaGaaGinaiaac6cacaaIXaGaaiykaaaa@779F@

Using the reweighting factor, women’s counterfactual wage mean is estimated by

Y ¯ ^ F | M DFL = k S F d k ψ ^ ( x k ) y k k S F d k ψ ^ ( x k ) , ( 4.2 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaary aajaWaa0baaSqaamaaeiaabaGaamOraiaayIW7aiaawIa7aiaayIW7 caWGnbaabaGaaeiraiaabAeacaqGmbaaaOGaaGypamaalaaabaWaaa beaeaacaWGKbWaaSbaaSqaaiaadUgaaeqaaOGafqiYdKNbaKaadaqa daqaaiaahIhadaWgaaWcbaGaam4AaaqabaaakiaawIcacaGLPaaaca WG5bWaaSbaaSqaaiaadUgaaeqaaaqaaiaadUgacqGHiiIZcaWGtbWa aSbaaWqaaiaadAeaaeqaaaWcbeqdcqGHris5aaGcbaWaaabeaeaaca WGKbWaaSbaaSqaaiaadUgaaeqaaOGafqiYdKNbaKaadaqadaqaaiaa hIhadaWgaaWcbaGaam4AaaqabaaakiaawIcacaGLPaaaaSqaaiaadU gacqGHiiIZcaWGtbWaaSbaaWqaaiaadAeaaeqaaaWcbeqdcqGHris5 aaaakiaaiYcacaaMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaacIcaca aI0aGaaiOlaiaaikdacaGGPaaaaa@6A83@

and women’s counterfactual means of characteristics by

X ¯ ^ F | M DFL = k S F d k ψ ^ ( x k ) x k k S F d k ψ ^ ( x k ) . ( 4.3 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabCiwayaary aajaWaa0baaSqaamaaeiaabaGaamOraiaayIW7aiaawIa7aiaayIW7 caWGnbaabaGaaeiraiaabAeacaqGmbaaaOGaaGypamaalaaabaWaaa beaeaacaWGKbWaaSbaaSqaaiaadUgaaeqaaOGafqiYdKNbaKaadaqa daqaaiaahIhadaWgaaWcbaGaam4AaaqabaaakiaawIcacaGLPaaaca WH4bWaaSbaaSqaaiaadUgaaeqaaaqaaiaadUgacqGHiiIZcaWGtbWa aSbaaWqaaiaadAeaaeqaaaWcbeqdcqGHris5aaGcbaWaaabeaeaaca WGKbWaaSbaaSqaaiaadUgaaeqaaOGafqiYdKNbaKaadaqadaqaaiaa hIhadaWgaaWcbaGaam4AaaqabaaakiaawIcacaGLPaaaaSqaaiaadU gacqGHiiIZcaWGtbWaaSbaaWqaaiaadAeaaeqaaaWcbeqdcqGHris5 aaaakiaac6cacaaMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaacIcaca aI0aGaaiOlaiaaiodacaGGPaaaaa@6A86@

The estimated reweighting factor defined in equation (4.1) will be equal to

ψ ^ ( x k ) = a ^ exp ( x k γ ^ ) , MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGafqiYdKNbaK aadaqadaqaaiaahIhadaWgaaWcbaGaam4AaaqabaaakiaawIcacaGL PaaacaaI9aGabmyyayaajaGaaGjbVlaabwgacaqG4bGaaeiCamaabm aabaGaaCiEamaaDaaaleaacaWGRbaabaWexLMBbXgBd9gzLbvyNv2C aeHbbjxAHXgiv5wAJ9gzLbsttbacfaGaa8NeXaaakiqaho7agaqcaa GaayjkaiaawMcaaiaaiYcaaaa@539B@

where γ ^ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabC4Sdyaaja aaaa@374A@ is the estimation of γ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaC4Sdaaa@373A@ from the sample using empirical likelihood and a ^ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmyyayaaja aaaa@36F1@ is the ratio of estimated proportions of women and men. It is given by:

a ^ = Pr ^ ( D M k = 0 ) Pr ^ ( D M k = 1 ) = k S F d k k S M d k . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmyyayaaja GaaGypamaalaaabaWaaecaaeaaciGGqbGaaiOCaaGaayPadaWaaeWa aeaacaWGebWaaSbaaSqaaiaad2eacaWGRbaabeaakiaai2dacaaIWa aacaGLOaGaayzkaaaabaWaaecaaeaaciGGqbGaaiOCaaGaayPadaWa aeWaaeaacaWGebWaaSbaaSqaaiaad2eacaWGRbaabeaakiaai2daca aIXaaacaGLOaGaayzkaaaaaiaai2dadaWcaaqaamaaqababaGaamiz amaaBaaaleaacaWGRbaabeaaaeaacaWGRbGaeyicI4Saam4uamaaBa aameaacaWGgbaabeaaaSqab0GaeyyeIuoaaOqaamaaqababaGaamiz amaaBaaaleaacaWGRbaabeaaaeaacaWGRbGaeyicI4Saam4uamaaBa aameaacaWGnbaabeaaaSqab0GaeyyeIuoaaaGccaaIUaaaaa@5A7C@

Since the DFL method is presented taking the survey weights into account, the reweighting factor ψ k MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaeqiYdK3aaS baaSqaaiaadUgaaeqaaaaa@38E5@ will be multiplied by d k , k S F . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaamizamaaBa aaleaacaWGRbaabeaakiaaiYcacaWGRbGaeyicI4Saam4uamaaBaaa leaacaWGgbaabeaakiaac6caaaa@3DBF@ This resulting factor will be termed “weighted DFL factor”. Women’s estimated counterfactual wage mean can be re-expressed as

Y ¯ ^ F | M DFL = k S F d k ψ ^ ( x k ) y k k S F d k ψ ^ ( x k ) = k S F d k exp ( x γ ^ ) y k k S F d k exp ( x γ ^ ) . ( 4.4 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaary aajaWaa0baaSqaamaaeiaabaGaamOraiaayIW7aiaawIa7aiaayIW7 caWGnbaabaGaaeiraiaabAeacaqGmbaaaOGaaGypamaalaaabaWaaa beaeaacaWGKbWaaSbaaSqaaiaadUgaaeqaaOGafqiYdKNbaKaadaqa daqaaiaahIhadaWgaaWcbaGaam4AaaqabaaakiaawIcacaGLPaaaca WG5bWaaSbaaSqaaiaadUgaaeqaaaqaaiaadUgacqGHiiIZcaWGtbWa aSbaaWqaaiaadAeaaeqaaaWcbeqdcqGHris5aaGcbaWaaabeaeaaca WGKbWaaSbaaSqaaiaadUgaaeqaaOGafqiYdKNbaKaadaqadaqaaiaa hIhadaWgaaWcbaGaam4AaaqabaaakiaawIcacaGLPaaaaSqaaiaadU gacqGHiiIZcaWGtbWaaSbaaWqaaiaadAeaaeqaaaWcbeqdcqGHris5 aaaakiabg2da9maalaaabaWaaabeaeaacaWGKbWaaSbaaSqaaiaadU gaaeqaaOGaciyzaiaacIhacaGGWbWaaeWaaeaacaWH4bWaaWbaaSqa beaatCvAUfeBSn0BKvguHDwzZbqegeKCPfgBGuLBPn2BKvginnfaiu aacaWFsedaaOGabC4SdyaajaaacaGLOaGaayzkaaGaamyEamaaBaaa leaacaWGRbaabeaaaeaacaWGRbGaeyicI4Saam4uamaaBaaameaaca WGgbaabeaaaSqab0GaeyyeIuoaaOqaamaaqababaGaamizamaaBaaa leaacaWGRbaabeaakiGacwgacaGG4bGaaiiCamaabmaabaGaaCiEam aaCaaaleqabaGaa8NeXaaakiqaho7agaqcaaGaayjkaiaawMcaaaWc baGaam4AaiabgIGiolaadofadaWgaaadbaGaamOraaqabaaaleqani abggHiLdaaaOGaaiOlaiaaywW7caaMf8UaaGzbVlaaywW7caaMf8Ua aiikaiaaisdacaGGUaGaaGinaiaacMcaaaa@9913@

Women’s counterfactual means of characteristics are estimated as

X ¯ ^ F | M DFL = k S F d k ψ ^ ( x k ) x k k S F d k ψ ^ ( x k ) = k S F d k exp ( x γ ^ ) x k k S F d k exp ( x γ ^ ) . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabCiwayaary aajaWaa0baaSqaamaaeiaabaGaamOraiaayIW7aiaawIa7aiaayIW7 caWGnbaabaGaaeiraiaabAeacaqGmbaaaOGaaGypamaalaaabaWaaa beaeaacaWGKbWaaSbaaSqaaiaadUgaaeqaaOGafqiYdKNbaKaadaqa daqaaiaahIhadaWgaaWcbaGaam4AaaqabaaakiaawIcacaGLPaaaca WH4bWaaSbaaSqaaiaadUgaaeqaaaqaaiaadUgacqGHiiIZcaWGtbWa aSbaaWqaaiaadAeaaeqaaaWcbeqdcqGHris5aaGcbaWaaabeaeaaca WGKbWaaSbaaSqaaiaadUgaaeqaaOGafqiYdKNbaKaadaqadaqaaiaa hIhadaWgaaWcbaGaam4AaaqabaaakiaawIcacaGLPaaaaSqaaiaadU gacqGHiiIZcaWGtbWaaSbaaWqaaiaadAeaaeqaaaWcbeqdcqGHris5 aaaakiabg2da9maalaaabaWaaabeaeaacaWGKbWaaSbaaSqaaiaadU gaaeqaaOGaciyzaiaacIhacaGGWbWaaeWaaeaacaWH4bWaaWbaaSqa beaatCvAUfeBSn0BKvguHDwzZbqegeKCPfgBGuLBPn2BKvginnfaiu aacaWFsedaaOGabC4SdyaajaaacaGLOaGaayzkaaGaaCiEamaaBaaa leaacaWGRbaabeaaaeaacaWGRbGaeyicI4Saam4uamaaBaaameaaca WGgbaabeaaaSqab0GaeyyeIuoaaOqaamaaqababaGaamizamaaBaaa leaacaWGRbaabeaakiGacwgacaGG4bGaaiiCamaabmaabaGaaCiEam aaCaaaleqabaGaa8NeXaaakiqaho7agaqcaaGaayjkaiaawMcaaaWc baGaam4AaiabgIGiolaadofadaWgaaadbaGaamOraaqabaaaleqani abggHiLdaaaOGaaiOlaaaa@8DCF@

Through the use of the reweighting factor, the counterfactual coefficients in the women’s sample are given by

β F DFL = ( k U F ψ ( x k ) x k x k ) 1 k U F ψ ( x k ) x k y k , MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaaCOSdmaaDa aaleaacaWGgbaabaGaaeiraiaabAeacaqGmbaaaOGaaGypamaabmaa baWaaabuaeaacqaHipqEdaqadaqaaiaahIhadaWgaaWcbaGaam4Aaa qabaaakiaawIcacaGLPaaacaWH4bWaaSbaaSqaaiaadUgaaeqaaOGa aCiEamaaDaaaleaacaWGRbaabaWexLMBbXgBd9gzLbvyNv2CaeHbbj xAHXgiv5wAJ9gzLbsttbacfaGaa8NeXaaaaeaacaWGRbGaeyicI4Sa amyvamaaBaaameaacaWGgbaabeaaaSqab0GaeyyeIuoaaOGaayjkai aawMcaamaaCaaaleqabaGaeyOeI0IaaGymaaaakmaaqafabaGaeqiY dK3aaeWaaeaacaWH4bWaaSbaaSqaaiaadUgaaeqaaaGccaGLOaGaay zkaaGaaCiEamaaBaaaleaacaWGRbaabeaakiaadMhadaWgaaWcbaGa am4AaaqabaaabaGaam4AaiabgIGiolaadwfadaWgaaadbaGaamOraa qabaaaleqaniabggHiLdGccaaISaaaaa@6C23@

and estimated by

β ^ F DFL = ( k S F d k ψ ^ ( x k ) x k x k ) 1 k S F d k ψ ^ ( x k ) x k y k . ( 4.5 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabCOSdyaaja Waa0baaSqaaiaadAeaaeaacaqGebGaaeOraiaabYeaaaGccaaI9aWa aeWaaeaadaaeqbqaaiaadsgadaWgaaWcbaGaam4AaaqabaGccuaHip qEgaqcamaabmaabaGaaCiEamaaBaaaleaacaWGRbaabeaaaOGaayjk aiaawMcaaiaahIhadaWgaaWcbaGaam4AaaqabaGccaWH4bWaa0baaS qaaiaadUgaaeaatCvAUfeBSn0BKvguHDwzZbqegeKCPfgBGuLBPn2B KvginnfaiuaacaWFsedaaaqaaiaadUgacqGHiiIZcaWGtbWaaSbaaW qaaiaadAeaaeqaaaWcbeqdcqGHris5aaGccaGLOaGaayzkaaWaaWba aSqabeaacqGHsislcaaIXaaaaOWaaabuaeaacaWGKbWaaSbaaSqaai aadUgaaeqaaOGafqiYdKNbaKaadaqadaqaaiaahIhadaWgaaWcbaGa am4AaaqabaaakiaawIcacaGLPaaacaWH4bWaaSbaaSqaaiaadUgaae qaaOGaamyEamaaBaaaleaacaWGRbaabeaaaeaacaWGRbGaeyicI4Sa am4uamaaBaaameaacaWGgbaabeaaaSqab0GaeyyeIuoakiaac6caca aMf8UaaGzbVlaaywW7caaMf8UaaGzbVlaacIcacaaI0aGaaiOlaiaa iwdacaGGPaaaaa@7BB7@

The coefficients above have to be computed, because under the same condition as in Result 1, women’s counterfactual wage mean defined in (4.2) is given by

Y ¯ ^ F | M DFL = X ¯ ^ F | M DFL β ^ F DFL . MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabmywayaary aajaWaa0baaSqaamaaeiaabaGaamOraiaayIW7aiaawIa7aiaayIW7 caWGnbaabaGaaeiraiaabAeacaqGmbaaaOGaaGypaiqahIfagaqega qcamaaDaaaleaadaabcaqaaiaadAeacaaMi8oacaGLiWoacaaMi8Ua amytaaqaaiaabseacaqGgbGaaeitamaaCaaameqabaWexLMBbXgBd9 gzLbvyNv2CaeHbbjxAHXgiv5wAJ9gzLbsttbacfaGaa8NeXaaaaaGc ceWHYoGbaKaadaqhaaWcbaGaamOraaqaaiaabseacaqGgbGaaeitaa aakiaai6caaaa@5C76@

The BO decomposition formula can now be expressed as

Y ¯ ^ M Y ¯ ^ F = ( Y ¯ ^ F | M DFL Y ¯ ^ F ) + ( Y ¯ ^ M Y ¯ ^ F | M DFL ) = ( X ¯ ^ F | M DFL β ^ F DFL X ¯ ^ F β ^ F ) + ( X ¯ ^ M β ^ M X ¯ ^ F | M DFL β ^ F DFL ) , ( 4.6 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaqbaeaabiGaaa qaaiqadMfagaqegaqcamaaBaaaleaacaWGnbaabeaakiabgkHiTiqa dMfagaqegaqcamaaBaaaleaacaWGgbaabeaaaOqaaiaai2dadaqada qaaiqadMfagaqegaqcamaaDaaaleaadaabcaqaaiaadAeacaaMi8oa caGLiWoacaaMi8UaamytaaqaaiaabseacaqGgbGaaeitaaaakiabgk HiTiqadMfagaqegaqcamaaBaaaleaacaWGgbaabeaaaOGaayjkaiaa wMcaaiabgUcaRmaabmaabaGabmywayaaryaajaWaaSbaaSqaaiaad2 eaaeqaaOGaeyOeI0IabmywayaaryaajaWaa0baaSqaamaaeiaabaGa amOraiaayIW7aiaawIa7aiaayIW7caWGnbaabaGaaeiraiaabAeaca qGmbaaaaGccaGLOaGaayzkaaaabaaabaGaaGypamaabmaabaGabCiw ayaaryaajaWaa0baaSqaamaaeiaabaGaamOraiaayIW7aiaawIa7ai aayIW7caWGnbaabaGaaeiraiaabAeacaqGmbWaaWbaaWqabeaatCvA UfeBSn0BKvguHDwzZbqegeKCPfgBGuLBPn2BKvginnfaiuaacaWFse daaaaakiqahk7agaqcamaaDaaaleaacaWGgbaabaGaaeiraiaabAea caqGmbaaaOGaeyOeI0IabCiwayaaryaajaWaa0baaSqaaiaadAeaae aacaWFsedaaOGabCOSdyaajaWaaSbaaSqaaiaadAeaaeqaaaGccaGL OaGaayzkaaGaey4kaSYaaeWaaeaaceWHybGbaeHbaKaadaqhaaWcba Gaamytaaqaaiaa=jrmaaGcceWHYoGbaKaadaWgaaWcbaGaamytaaqa baGccqGHsislceWHybGbaeHbaKaadaqhaaWcbaWaaqGaaeaacaWGgb GaaGjcVdGaayjcSdGaaGjcVlaad2eaaeaacaqGebGaaeOraiaabYea daahaaadbeqaaiaa=jrmaaaaaOGabCOSdyaajaWaa0baaSqaaiaadA eaaeaacaqGebGaaeOraiaabYeaaaaakiaawIcacaGLPaaacaaISaGa aGzbVlaaywW7caaMf8UaaGzbVlaaywW7caGGOaGaaGinaiaac6caca aI2aGaaiykaaaaaaa@A114@

where β ^ M MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabCOSdyaaja WaaSbaaSqaaiaad2eaaeqaaaaa@3847@ and β ^ F MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabCOSdyaaja WaaSbaaSqaaiaadAeaaeqaaaaa@3840@ are defined in (3.1). The first term of equation (4.6) is the composition effect and the second one the structure effect.

4.2  Further decomposition of the structure effect

As Fortin et al. (2011) note, the purpose of the DFL reweighting factor is to render the distribution of women’s characteristics identical to that of men. This implies that the means of the auxiliary variables in the two groups should be equal. However, with the DFL method, it is not the case. Indeed,

X ¯ ^ F | M DFL X ¯ ^ M ( 4.7 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabCiwayaary aajaWaa0baaSqaamaaeiaabaGaamOraiaayIW7aiaawIa7aiaayIW7 caWGnbaabaGaaeiraiaabAeacaqGmbaaaOGaeyiyIKRabCiwayaary aajaWaaSbaaSqaaiaad2eaaeqaaOGaaGzbVlaaywW7caaMf8UaaGzb VlaaywW7caGGOaGaaGinaiaac6cacaaI3aGaaiykaaaa@4F14@

(see, for instance, Fortin et al. 2011; Donzé, 2013). The reweighting factor thus fails to match the two distributions perfectly.

The structure effect in equation (4.6) can be further divided in the following elements

( X ¯ ^ M β ^ M X ¯ ^ F | M DFL β ^ F DFL ) = X ¯ ^ M ( β ^ M β ^ F DFL ) + ( X ¯ ^ M X ¯ ^ F | M DFL ) β ^ F DFL , ( 4.8 ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaWaaeWaaeaace WHybGbaeHbaKaadaqhaaWcbaGaamytaaqaamXvP5wqSX2qVrwzqf2z LnharyqqYLwySbsvUL2yVrwzG00uaGqbaiaa=jrmaaGcceWHYoGbaK aadaWgaaWcbaGaamytaaqabaGccqGHsislceWHybGbaeHbaKaadaqh aaWcbaWaaqGaaeaacaWGgbGaaGjcVdGaayjcSdGaaGjcVlaad2eaae aacaqGebGaaeOraiaabYeadaahaaadbeqaaiaa=jrmaaaaaOGabCOS dyaajaWaa0baaSqaaiaadAeaaeaacaqGebGaaeOraiaabYeaaaaaki aawIcacaGLPaaacaaI9aGabCiwayaaryaajaWaa0baaSqaaiaad2ea aeaacaWFsedaaOWaaeWaaeaaceWHYoGbaKaadaWgaaWcbaGaamytaa qabaGccqGHsislceWHYoGbaKaadaqhaaWcbaGaamOraaqaaiaabsea caqGgbGaaeitaaaaaOGaayjkaiaawMcaaiabgUcaRmaabmaabaGabC iwayaaryaajaWaaSbaaSqaaiaad2eaaeqaaOGaeyOeI0IabCiwayaa ryaajaWaa0baaSqaamaaeiaabaGaamOraiaayIW7aiaawIa7aiaayI W7caWGnbaabaGaaeiraiaabAeacaqGmbaaaaGccaGLOaGaayzkaaGa bCOSdyaajaWaa0baaSqaaiaadAeaaeaacaqGebGaaeOraiaabYeaaa GccaaISaGaaGzbVlaaywW7caaMf8UaaGzbVlaacIcacaaI0aGaaiOl aiaaiIdacaGGPaaaaa@8442@

where X ¯ ^ F | M DFL MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabCiwayaary aajaWaa0baaSqaamaaeiaabaGaamOraiaayIW7aiaawIa7aiaayIW7 caWGnbaabaGaaeiraiaabAeacaqGmbaaaaaa@3FE4@ and β ^ F DFL MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGabCOSdyaaja Waa0baaSqaaiaadAeaaeaacaqGebGaaeOraiaabYeaaaaaaa@3AA0@ are defined in equations (4.3) and (4.5), respectively (Fortin et al. 2011). The first element of the right-hand side of equation (4.8) is the pure effect and the second the residual effect or the total reweighting error (Fortin et al. 2011). The pure effect is the actual unexplained part of the wage difference. The residual effect contains the misfit of the model, in other words, what the reweighting factor fails to match between men’s and women’s distribution of characteristics. This method allows for the construction of a counterfactual wage distribution. This in turn allows for the comparison between this new distribution and the observed wage distributions of women and men. The drawback of the method is that it may happen that at least one characteristic is a good predictor of the gender (for instance, the economic sector). This implies that Pr ( D M k = 1 | x k ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiFu0Je9sqqrpepC0xbbL8F4rqqrpgpC0xe9LqFf0xc9 qqpeuf0xe9q8qiYRWFGCk9vi=dbvc9G8Wq0db9qqpm0dXdIqpu0=vr 0=vr0=fdbaqaaeGaciGaaiaabeqaamaabaabaaGcbaGaciiuaiaack hadaqadaqaaiaadseadaWgaaWcbaGaamytaiaadUgaaeqaaOGaaGyp amaaeiaabaGaaGymaiaayIW7aiaawIa7aiaaykW7caWH4bWaaSbaaS qaaiaadUgaaeqaaaGccaGLOaGaayzkaaaaaa@446C@ may get close to 1 and that the reweighting factor will take take on a large value (Fortin et al. 2011). This obviously leads to a large variance of the factor. This will be shown in Section 8.


Date modified: