Keyword search

Filter results by

Search Help
Currently selected filters that can be removed

Keyword(s)

Year of publication

1 facets displayed. 1 facets selected.

Content

1 facets displayed. 0 facets selected.
Sort Help
entries

Results

All (817)

All (817) (810 to 820 of 817 results)

  • Articles and reports: 12-001-X200700210493
    Description:

    In this paper, we study the problem of variance estimation for a ratio of two totals when marginal random hot deck imputation has been used to fill in missing data. We consider two approaches to inference. In the first approach, the validity of an imputation model is required. In the second approach, the validity of an imputation model is not required but response probabilities need to be estimated, in which case the validity of a nonresponse model is required. We derive variance estimators under two distinct frameworks: the customary two-phase framework and the reverse framework.

    Release date: 2008-01-03

  • Articles and reports: 12-001-X200700210494
    Description:

    The Australian Bureau of Statistics has recently developed a generalized estimation system for processing its large scale annual and sub-annual business surveys. Designs for these surveys have a large number of strata, use Simple Random Sampling within Strata, have non-negligible sampling fractions, are overlapping in consecutive periods, and are subject to frame changes. A significant challenge was to choose a variance estimation method that would best meet the following requirements: valid for a wide range of estimators (e.g., ratio and generalized regression), requires limited computation time, can be easily adapted to different designs and estimators, and has good theoretical properties measured in terms of bias and variance. This paper describes the Without Replacement Scaled Bootstrap (WOSB) that was implemented at the ABS and shows that it is appreciably more efficient than the Rao and Wu (1988)'s With Replacement Scaled Bootstrap (WSB). The main advantages of the Bootstrap over alternative replicate variance estimators are its efficiency (i.e., accuracy per unit of storage space) and the relative simplicity with which it can be specified in a system. This paper describes the WOSB variance estimator for point-in-time and movement estimates that can be expressed as a function of finite population means. Simulation results obtained as part of the evaluation process show that the WOSB was more efficient than the WSB, especially when the stratum sample sizes are sometimes as small as 5.

    Release date: 2008-01-03

  • Articles and reports: 12-001-X200700210495
    Description:

    The purpose of this work is to obtain reliable estimates in study domains when there are potentially very small sample sizes and the sampling design stratum differs from the study domain. The population sizes are unknown as well for both the study domain and the sampling design stratum. In calculating parameter estimates in the study domains, a random sample size is often necessary. We propose a new family of generalized linear mixed models with correlated random effects when there is more than one unknown parameter. The proposed model will estimate both the population size and the parameter of interest. General formulae for full conditional distributions required for Markov chain Monte Carlo (MCMC) simulations are given for this framework. Equations for Bayesian estimation and prediction at the study domains are also given. We apply the 1998 Missouri Turkey Hunting Survey, which stratified samples based on the hunter's place of residence and we require estimates at the domain level, defined as the county in which the turkey hunter actually hunted.

    Release date: 2008-01-03

  • Articles and reports: 12-001-X200700210496
    Description:

    The European Community Household Panel (ECHP) is a panel survey covering a wide range of topics regarding economic, social and living conditions. In particular, it makes it possible to calculate disposable equivalized household income, which is a key variable in the study of economic inequity and poverty. To obtain reliable estimates of the average of this variable for regions within countries it is necessary to have recourse to small area estimation methods. In this paper, we focus on empirical best linear predictors of the average equivalized income based on "unit level models" borrowing strength across both areas and times. Using a simulation study based on ECHP data, we compare the suggested estimators with cross-sectional model-based and design-based estimators. In the case of these empirical predictors, we also compare three different MSE estimators. Results show that those estimators connected to models that take units' autocorrelation into account lead to a significant gain in efficiency, even when there are no covariates available whose population mean is known.

    Release date: 2008-01-03

  • Articles and reports: 12-001-X200700210497
    Description:

    Coverage deficiencies are estimated and analysed for the 2000 population census in Switzerland. For the undercoverage component, the estimation is based on a sample independent of the census and a match with the census. For the overcoverage component, the estimation is based on a sample drawn from the census list and a match with the rest of the census. The over- and undercoverage components are then combined to obtain an estimate of the resulting net coverage. This estimate is based on a capture-recapture model, named the dual system, combined with a synthetic model. The estimators are calculated for the full population and different subgroups, with a variance estimated by a stratified jackknife. The coverage analyses are supplemented by a study of matches between the independent sample and the census in order to determine potential errors of measurement and location in the census data.

    Release date: 2008-01-03

  • Articles and reports: 12-001-X200700210498
    Description:

    In this paper we describe a methodology for combining a convenience sample with a probability sample in order to produce an estimator with a smaller mean squared error (MSE) than estimators based on only the probability sample. We then explore the properties of the resulting composite estimator, a linear combination of the convenience and probability sample estimators with weights that are a function of bias. We discuss the estimator's properties in the context of web-based convenience sampling. Our analysis demonstrates that the use of a convenience sample to supplement a probability sample for improvements in the MSE of estimation may be practical only under limited circumstances. First, the remaining bias of the estimator based on the convenience sample must be quite small, equivalent to no more than 0.1 of the outcome's population standard deviation. For a dichotomous outcome, this implies a bias of no more than five percentage points at 50 percent prevalence and no more than three percentage points at 10 percent prevalence. Second, the probability sample should contain at least 1,000-10,000 observations for adequate estimation of the bias of the convenience sample estimator. Third, it must be inexpensive and feasible to collect at least thousands (and probably tens of thousands) of web-based convenience observations. The conclusions about the limited usefulness of convenience samples with estimator bias of more than 0.1 standard deviations also apply to direct use of estimators based on that sample.

    Release date: 2008-01-03

  • Articles and reports: 12-001-X200700210499
    Description:

    In this Issue is a column where the Editor biefly presents each paper of the current issue of Survey Methodology. As well, it sometimes contain informations on structure or management changes in the journal.

    Release date: 2008-01-03
Data (370)

Data (370) (0 to 10 of 370 results)

  • Table: 13-001-X
    Description:

    This publication presents quarterly information on Canada's National Income and Expenditure Accounts (NIEA), 1947-2008. It contains data on gross domestic product (GDP) by income and by expenditure, saving and investment, borrowing and lending of each of four broad sectors of the economy: (i) persons and unincorporated businesses, (ii) corporate and government business enterprises, (iii) governments and (iv) non-residents. Information is also provided for selected subsectors. The publication begins with an analysis of the economic developments in the most recent quarter. Some issues also contain more technical articles explaining national accounts methodology or analysing a particular aspect of the economy. The publication also includes a glossary, and is no longer being released.

    Release date: 2008-12-23

  • Table: 97-559-X
    Description:

    The tables in the topic 'Labour' present data on the paid work of the Canadian workforce, including detailed industry and occupation data, class of worker, and work activity during the reference year. The census is the only source of data covering the entire labour market, including Indian reserves, overseas households, and all provinces and territories.

    This topic also presents data on the unpaid work of the Canadian workforce, including unpaid household work, unpaid child care, and unpaid senior care. These data, together with information on paid work, provide a more complete picture of the work activities of all Canadians.

    Release date: 2008-12-19

  • Table: 97-563-X
    Description:

    The tables in the topic "Income and earnings" present data on the income of Canadian individuals, families, and households in the year 2005, including the composition of income, and data that serve to measure low income, known as the low income cut-off (LICO). The data also include the household incomes of Canadians by family type, age, and geography, as well as the household incomes of certain population groups (e.g., immigrants).

    The composition of income includes earnings, income from government sources, and investments.

    Release date: 2008-12-19

  • Table: 97-559-X2006029
    Description:

    Data for Canada, provinces, territories, census divisions, census subdivisions and dissemination areas are shown in this table.

    This table is part of the topic 'Labour', which presents data on the paid work of the Canadian workforce, including detailed industry and occupation data, class of worker, and work activity during the reference year. The census is the only source of data covering the entire labour market, including Indian reserves, overseas households, and all provinces and territories.

    This topic also presents data on the unpaid work of the Canadian workforce, including unpaid household work, unpaid child care, and unpaid senior care. These data, together with information on paid work, provide a more complete picture of the work activities of all Canadians.

    It is possible to subscribe to all the day-of-release topic bundles. Refer to Catalogue no. 97-569-XCB for more information.

    Release date: 2008-12-19

  • Table: 97-563-X2006072
    Description:

    Data for Canada, provinces, territories, census divisions, census subdivisions and dissemination areas are shown in this table.

    This table is part of the topic 'Income and earnings,' which presents data on the income of Canadian individuals, families, and households in the year 2005, including the composition of income, and data that serve to measure low income, known as the low income cut-off (LICO). The data also include the household incomes of Canadians by family type, age, and geography, as well as the household incomes of certain population groups (e.g., immigrants).

    The composition of income includes earnings, income from government sources, and investments.

    It is possible to subscribe to all the day-of-release bundles. Refer to Catalogue no. 97-569-XCB for more information.

    Release date: 2008-12-19

  • Table: 89-637-X2008002
    Description:

    A series of supporting data tables accompanies the Inuit analytical article from the 2006 Aboriginal Peoples Survey (APS). These tables provide data at the national level, for each of the four Inuit regions (Nunatsiavut, Nunavik, Nunavut and the Inuvialuit region), along with data for Inuit outside these regions for major themes covered in the analytical article. Data for the Inuit identity population aged 15 and over are provided for: Participation in harvesting activities; diagnosed with arthritis/rheumatism, high blood pressure, asthma, stomach problems or intestinal ulcers, heart problems, tuberculosis and diabetes; smoking status; self-rated health status and; reasons for not completing elementary or secondary school. For Inuit children aged 6 to 14, tables include: contact with a pediatrician, general practitioner or family physician in past 12 months; contact with another medical specialist and; food insecurity.

    Release date: 2008-12-19

  • Table: 26-202-X
    Description:

    This publication presents early estimates of mineral production by class and by province, quantities and values.

    Release date: 2008-12-19

  • Profile of a community or region: 16-002-X200800410751
    Description:

    This article profiles manure production in Canada and maps manure production by sub-sub-drainage area for 2006.

    Release date: 2008-12-09

  • Table: 97-564-X
    Description:

    This new product will present data for specific census topics and population groups according to selected demographic, cultural, and socio-economic characteristics. These detailed 'profile-type' tables expand the analytical depth of basic census information.

    Release date: 2008-12-09

  • Table: 97-563-X2006008
    Description:

    Data for Canada, provinces and territories are shown in this table.

    This table is part of the topic 'Income and earnings,' which presents data on the income of Canadian individuals, families, and households in the year 2005, including the composition of income, and data that serve to measure low income, known as the low income cut-off (LICO). The data also include the household incomes of Canadians by family type, age, and geography, as well as the household incomes of certain population groups (e.g., immigrants).

    The composition of income includes earnings, income from government sources, and investments.

    It is possible to subscribe to all the day-of-release bundles. Refer to Catalogue no. 97-569-XCB for more information.

    This table is available free on the Internet, Catalogue no. 97-563-XWE2006008.

    Release date: 2008-12-09
Analysis (394)

Analysis (394) (0 to 10 of 394 results)

  • Articles and reports: 12-001-X200800210754
    Description:

    The context of the discussion is the increasing incidence of international surveys, of which one is the International Tobacco Control (ITC) Policy Evaluation Project, which began in 2002. The ITC country surveys are longitudinal, and their aim is to evaluate the effects of policy measures being introduced in various countries under the WHO Framework Convention on Tobacco Control. The challenges of organization, data collection and analysis in international surveys are reviewed and illustrated. Analysis is an increasingly important part of the motivation for large scale cross-cultural surveys. The fundamental challenge for analysis is to discern the real response (or lack of response) to policy change, separating it from the effects of data collection mode, differential non-response, external events, time-in-sample, culture, and language. Two problems relevant to statistical analysis are discussed. The first problem is the question of when and how to analyze pooled data from several countries, in order to strengthen conclusions which might be generally valid. While in some cases this seems to be straightforward, there are differing opinions on the extent to which pooling is possible and reasonable. It is suggested that for formal comparisons, random effects models are of conceptual use. The second problem is to find models of measurement across cultures and data collection modes which will enable calibration of continuous, binary and ordinal responses, and produce comparisons from which extraneous effects have been removed. It is noted that hierarchical models provide a natural way of relaxing requirements of model invariance across groups.

    Release date: 2008-12-23

  • Articles and reports: 12-001-X200800210755
    Description:

    Dependent interviewing (DI) is used in many longitudinal surveys to "feed forward" data from one wave to the next. Though it is a promising technique which has been demonstrated to enhance data quality in certain respects, relatively little is known about how it is actually administered in the field. This research seeks to address this issue through behavior coding. Various styles of DI were employed in the English Longitudinal Study of Ageing (ELSA) in January, 2006, and recordings were made of pilot field interviews. These recordings were analysed to determine whether the questions (particularly the DI aspects) were administered appropriately and to explore the respondent's reaction to the fed-forward data. Of particular interest was whether respondents confirmed or challenged the previously-reported information, whether the prior wave data came into play when respondents were providing their current-wave answers, and how any discrepancies were negotiated by the interviewer and respondent. Also of interest was to examine the effectiveness of various styles of DI. For example, in some cases the prior wave data was brought forward and respondents were asked to explicitly confirm it; in other cases the previous data was read and respondents were asked if the situation was still the same. Results indicate varying levels of compliance in terms of initial question-reading, and suggest that some styles of DI may be more effective than others.

    Release date: 2008-12-23

  • Articles and reports: 12-001-X200800210756
    Description:

    In longitudinal surveys nonresponse often occurs in a pattern that is not monotone. We consider estimation of time-dependent means under the assumption that the nonresponse mechanism is last-value-dependent. Since the last value itself may be missing when nonresponse is nonmonotone, the nonresponse mechanism under consideration is nonignorable. We propose an imputation method by first deriving some regression imputation models according to the nonresponse mechanism and then applying nonparametric regression imputation. We assume that the longitudinal data follow a Markov chain with finite second-order moments. No other assumption is imposed on the joint distribution of longitudinal data and their nonresponse indicators. A bootstrap method is applied for variance estimation. Some simulation results and an example concerning the Current Employment Survey are presented.

    Release date: 2008-12-23

  • Articles and reports: 12-001-X200800210757
    Description:

    Sample weights can be calibrated to reflect the known population totals of a set of auxiliary variables. Predictors of finite population totals calculated using these weights have low bias if these variables are related to the variable of interest, but can have high variance if too many auxiliary variables are used. This article develops an "adaptive calibration" approach, where the auxiliary variables to be used in weighting are selected using sample data. Adaptively calibrated estimators are shown to have lower mean squared error and better coverage properties than non-adaptive estimators in many cases.

    Release date: 2008-12-23

  • Articles and reports: 12-001-X200800210758
    Description:

    We propose a method for estimating the variance of estimators of changes over time, a method that takes account of all the components of these estimators: the sampling design, treatment of non-response, treatment of large companies, correlation of non-response from one wave to another, the effect of using a panel, robustification, and calibration using a ratio estimator. This method, which serves to determine the confidence intervals of changes over time, is then applied to the Swiss survey of value added.

    Release date: 2008-12-23

  • Articles and reports: 12-001-X200800210759
    Description:

    The analysis of stratified multistage sample data requires the use of design information such as stratum and primary sampling unit (PSU) identifiers, or associated replicate weights, in variance estimation. In some public release data files, such design information is masked as an effort to avoid their disclosure risk and yet to allow the user to obtain valid variance estimation. For example, in area surveys with a limited number of PSUs, the original PSUs are split or/and recombined to construct pseudo-PSUs with swapped second or subsequent stage sampling units. Such PSU masking methods, however, obviously distort the clustering structure of the sample design, yielding biased variance estimates possibly with certain systematic patterns between two variance estimates from the unmasked and masked PSU identifiers. Some of the previous work observed patterns in the ratio of the masked and unmasked variance estimates when plotted against the unmasked design effect. This paper investigates the effect of PSU masking on variance estimates under cluster sampling regarding various aspects including the clustering structure and the degree of masking. Also, we seek a PSU masking strategy through swapping of subsequent stage sampling units that helps reduce the resulting biases of the variance estimates. For illustration, we used data from the National Health Interview Survey (NHIS) with some artificial modification. The proposed strategy performs very well in reducing the biases of variance estimates. Both theory and empirical results indicate that the effect of PSU masking on variance estimates is modest with minimal swapping of subsequent stage sampling units. The proposed masking strategy has been applied to the 2003-2004 National Health and Nutrition Examination Survey (NHANES) data release.

    Release date: 2008-12-23

  • Articles and reports: 12-001-X200800210760
    Description:

    The design of a stratified simple random sample without replacement from a finite population deals with two main issues: the definition of a rule to partition the population into strata, and the allocation of sampling units in the selected strata. This article examines a tree-based strategy which plans to approach jointly these issues when the survey is multipurpose and multivariate information, quantitative or qualitative, is available. Strata are formed through a hierarchical divisive algorithm that selects finer and finer partitions by minimizing, at each step, the sample allocation required to achieve the precision levels set for each surveyed variable. In this way, large numbers of constraints can be satisfied without drastically increasing the sample size, and also without discarding variables selected for stratification or diminishing the number of their class intervals. Furthermore, the algorithm tends not to define empty or almost empty strata, thus avoiding the need for strata collapsing aggregations. The procedure was applied to redesign the Italian Farm Structure Survey. The results indicate that the gain in efficiency held using our strategy is nontrivial. For a given sample size, this procedure achieves the required precision by exploiting a number of strata which is usually a very small fraction of the number of strata available when combining all possible classes from any of the covariates.

    Release date: 2008-12-23

  • Articles and reports: 12-001-X200800210761
    Description:

    Optimum stratification is the method of choosing the best boundaries that make strata internally homogeneous, given some sample allocation. In order to make the strata internally homogenous, the strata should be constructed in such a way that the strata variances for the characteristic under study be as small as possible. This could be achieved effectively by having the distribution of the main study variable known and create strata by cutting the range of the distribution at suitable points. If the frequency distribution of the study variable is unknown, it may be approximated from the past experience or some prior knowledge obtained at a recent study. In this paper the problem of finding Optimum Strata Boundaries (OSB) is considered as the problem of determining Optimum Strata Widths (OSW). The problem is formulated as a Mathematical Programming Problem (MPP), which minimizes the variance of the estimated population parameter under Neyman allocation subject to the restriction that sum of the widths of all the strata is equal to the total range of the distribution. The distributions of the study variable are considered as continuous with Triangular and Standard Normal density functions. The formulated MPPs, which turn out to be multistage decision problems, can then be solved using dynamic programming technique proposed by Bühler and Deutler (1975). Numerical examples are presented to illustrate the computational details. The results obtained are also compared with the method of Dalenius and Hodges (1959) with an example of normal distribution.

    Release date: 2008-12-23

  • Articles and reports: 12-001-X200800210762
    Description:

    This paper considers the optimum allocation in multivariate stratified sampling as a nonlinear matrix optimisation of integers. As a particular case, a nonlinear problem of the multi-objective optimisation of integers is studied. A full detailed example including some of proposed techniques is provided at the end of the work.

    Release date: 2008-12-23

  • Articles and reports: 12-001-X200800210763
    Description:

    The present work illustrates a sampling strategy useful for obtaining planned sample size for domains belonging to different partitions of the population and in order to guarantee the sampling errors of domain estimates be lower than given thresholds. The sampling strategy that covers the multivariate multi-domain case is useful when the overall sample size is bounded and consequently the standard solution of using a stratified sample with the strata given by cross-classification of variables defining the different partitions is not feasible since the number of strata is larger than the overall sample size. The proposed sampling strategy is based on the use of balanced sampling selection technique and on a GREG-type estimation. The main advantages of the solution is the computational feasibility which allows one to easily implement an overall small area strategy considering jointly the sampling design and the estimator and improving the efficiency of the direct domain estimators. An empirical simulation on real population data and different domain estimators shows the empirical properties of the examined sample strategy.

    Release date: 2008-12-23
Reference (54)

Reference (54) (50 to 60 of 54 results)

  • Surveys and statistical programs – Documentation: 97-555-P
    Description:

    These guides provide information that enables users to effectively use, apply and interpret data from the 2006 Census. Each guide contains definitions and explanations on census concepts. Additional information will be included for specific variables to help general users better understand the concepts and questions used in the census.

    Release date: 2008-01-09

  • Surveys and statistical programs – Documentation: 97-557-P
    Description:

    These guides provide information that enables users to effectively use, apply and interpret data from the 2006 Census. Each guide contains definitions and explanations on census concepts, data quality and historical comparability. Additional information will be included for specific variables to help general users better understand the concepts and questions used in the census.

    Release date: 2008-01-09

  • Surveys and statistical programs – Documentation: 97-555-P2006003
    Description:

    This guide focuses on the following demographic variables: First official language spoken, Home language, Knowledge of non-official languages, Knowledge of official languages, Language of work, and Mother tongue.

    Provides information that enables users to effectively use, apply and interpret data from the 2006 Census. Each guide contains definitions and explanations on census concepts. Additional information will be included for specific variables to help general users better understand the concepts and questions used in the census.

    Release date: 2008-01-09

  • Surveys and statistical programs – Documentation: 97-557-P2006003
    Geography: Canada
    Description:

    This guide focuses on the following demographic variables: Place of birth, Generation status, Citizenship and Immigration.

    Provides information that enables users to effectively use, apply and interpret data from the 2006 Census. Each guide contains definitions and explanations on census concepts. Additional information will be included for specific variables to help general users better understand the concepts and questions used in the census.

    Release date: 2008-01-09
Date modified: