Sample empirical likelihood approach under complex survey design with scrambled responses
Section 5. Real application

In this section, we applied the proposed approach to 2015-2016 National Health and Nutrition Examination Survey (NHANES) to evaluate its practical performance. NHANES provides timely health- and nutrition-related information for the noninstitutionalized civilian resident population of the United States. It uses a complex, multistage probability design based on in-person survey to collect information. (see https://wwwn.cdc.gov/nchs/nhanes/continuousnhanes/overview.aspx?BeginYear=2015 for more information). The sample size for the 2015-2016 NHANES is about 9,000. We treated the original NHANES sample as a finite population and selected one sample by using a simple random sampling design with sample sizes ( n ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBamXvP5wqonvsaeHbmv3yPrwyGmuy SXwANjxyWHwEaebbnrfifHhDYfgasaacH8rrps0lbbf9q8qqaqpepe c8Eeeu0xXdf9arpi0xb9Lqpe0dbvb9frpepeI8k8hiNsFfY=qqqrFf pie9qqpe0dd9q8qi0de9Fve9Fve9pXqaaeaabiGaciaacaqabeaadi qaaqaaaOqaamaabmqabaGaaGzaVlaad6gacaaMb8oacaGLOaGaayzk aaaaaa@4030@ as 30, 40, 50, 100, and 200, respectively. Suppose our parameters of interest include population means of systolic blood pressure, diastolic blood pressure, HDL cholesterol, and total cholesterol. We created scramble responses for these parameters by using p = MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBamXvP5wqonvsaeHbmv3yPrwyGmuy SXwANjxyWHwEaebbnrfifHhDYfgasaacH8rrps0lbbf9q8qqaqpepe c8Eeeu0xXdf9arpi0xb9Lqpe0dbvb9frpepeI8k8hiNsFfY=qqqrFf pie9qqpe0dd9q8qi0de9Fve9Fve9pXqaaeaabiGaciaacaqabeaadi qaaqaaaOqaaiaadchacaaMe8UaaGypaiaaykW7aaa@3F73@ 0.6, a = MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBamXvP5wqonvsaeHbmv3yPrwyGmuy SXwANjxyWHwEaebbnrfifHhDYfgasaacH8rrps0lbbf9q8qqaqpepe c8Eeeu0xXdf9arpi0xb9Lqpe0dbvb9frpepeI8k8hiNsFfY=qqqrFf pie9qqpe0dd9q8qi0de9Fve9Fve9pXqaaeaabiGaciaacaqabeaadi qaaqaaaOqaaiaadggacaaMe8UaaGypaiaaykW7aaa@3F64@ 1.5, and b 2 = 0 .2 / 1 .5 . MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBamXvP5wqonvsaeHbmv3yPrwyGmuy SXwANjxyWHwEaebbnrfifHhDYfgasaacH8rrps0lbbf9q8qqaqpepe c8Eeeu0xXdf9arpi0xb9Lqpe0dbvb9frpepeI8k8hiNsFfY=qqqrFf pie9qqpe0dd9q8qi0de9Fve9Fve9pXqaaeaabiGaciaacaqabeaadi qaaqaaaOqaaiaadkgadaahaaWcbeqaaiaaikdaaaGccaaMe8UaaGyp aiaaysW7daWcgaqaaiaabcdacaqGUaGaaeOmaaqaaiaabgdacaqGUa GaaeynaaaacaGGUaaaaa@4558@ In addition, body mass index (BMI) was selected as a covariate in the estimation process, since BMI is correlated with those study variables.

We compared the performances of two approaches, HJ and SEL, in terms of point estimates and interval estimates (Table 5.1). Point estimates obtained by using both methods were similar, and they were close to finite population parameters (120.47, 66.17, 54.43, and 180.25 for systolic blood pressure, diastolic blood pressure, HDL cholesterol, and total cholesterol), especially for larger sample sizes (Table 5.1). For systolic blood pressure, diastolic blood pressure, and total cholesterol, intervals produced by SEL shifted slightly to the right compared with the results produced by HJ for small sample sizes. However, when sample sizes increased, the results from the two approaches were similar. For HDL cholesterol, the results are comparable. The results from this application verified the validity of the proposed SEL approach.


Table 5.1
Point estimates and 95% CI for estimating means of different outcomes using scrambled response outcome and BMI from the NHANES data
Table summary
This table displays the results of Point estimates and 95% CI for estimating means of different outcomes using scrambled response outcome and BMI from the NHANES data Systolic Blood Pressure
in mm Hg, Diastolic Blood Pressure
in mm Hg, HDL Cholesterol
in mg/dL and Total Cholesterol
in mg/dL (appearing as column headers).
Systolic Blood Pressure
in mm Hg
Diastolic Blood Pressure
in mm Hg
HDL Cholesterol
in mg/dL
Total Cholesterol
in mg/dL
n MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBamXvP5wqonvsaeHbmv3yPrwyGmuy SXwANjxyWHwEaebbnrfifHhDYfgasaacH8rrps0lbbf9q8qqaqVGY= Hhbbf9v8qrpq0dc9vqFj0db9qqvqFr0dXdHiVc=bYP0xH8peeu0xXd crpe0db9Wqpepec9ar=xfr=xfr=tmeaabaqaciGacaGaaeqabaWace WaeaaakeaajyaGcaGIUbaaaa@3C8D@ HJ SEL HJ SEL HJ SEL HJ SEL
30 124.5 124.5 67.7 69.4 57.9 57.6 187.0 188.3
(112.3, 136.8) (113.5, 139.6) (61.5, 73.8) (63.9, 75.2) (50.3, 65.5) (50.8, 65.9) (160.0, 214.0) (166.6, 225.5)
40 125.6 125.5 70.2 70.2 52.0 51.2 178.7 178.1
(115.4, 135.8) (116.5, 136.1) (64.6, 75.8) (64.9, 76.1) (48.0, 56.0) (47.3, 55.8) (160.6, 196.8) (162.1, 199.0)
50 118.3 116.9 67.1 67.1 57.1 56.8 173.7 173.3
(110.2, 126.4) (109.0, 126.1) (60.9, 73.3) (61.4, 73.8) (50.8, 63.4) (51.3, 63.2) (160.2, 187.1) (161.2, 187.8)
100 120.8 120.5 70.0 69.7 52.3 52.4 173.1 172.8
(115.1, 126.5) (115.1, 126.3) (65.9, 74.0) (65.9, 73.6) (48.9, 55.7) (49.2, 55.9) (163.5, 182.7) (164.0, 183.2)
200 124.1 123.9 67.6 67.5 54.0 53.8 181.4 181.5
(119.4, 128.9) (119.4, 128.8) (64.9, 70.3) (64.8, 70.3) (51.1, 56.8) (51.3, 56.5) (172.7, 190.1) (173.3, 190.9)

Date modified: