Analysis

Skip to filters. View results.

Statistics Canada's Trust Centre

Results

All (69)

All (69) (0 to 10 of 69 results)

1. Unequal probability inverse sampling Archived
Articles and reports: 12-001-X201600214660
Description:
In an economic survey of a sample of enterprises, occupations are randomly selected from a list until a number r of occupations in a local unit has been identified. This is an inverse sampling problem for which we are proposing a few solutions. Simple designs with and without replacement are processed using negative binomial distributions and negative hypergeometric distributions. We also propose estimators for when the units are selected with unequal probabilities, with or without replacement.
Release date: 2016-12-20
2. A few remarks on a small example by Jean-Claude Deville regarding non-ignorable non-response Archived
Articles and reports: 12-001-X201600214661
Description:
An example presented by Jean-Claude Deville in 2005 is subjected to three estimation methods: the method of moments, the maximum likelihood method, and generalized calibration. The three methods yield exactly the same results for the two non-response models. A discussion follows on how to choose the most appropriate model.
Release date: 2016-12-20
3. A note on the concept of invariance in two-phase sampling designs Archived
Articles and reports: 12-001-X201600214662
Description:
Two-phase sampling designs are often used in surveys when the sampling frame contains little or no auxiliary information. In this note, we shed some light on the concept of invariance, which is often mentioned in the context of two-phase sampling designs. We define two types of invariant two-phase designs: strongly invariant and weakly invariant two-phase designs. Some examples are given. Finally, we describe the implications of strong and weak invariance from an inference point of view.
Release date: 2016-12-20
4. Reducing the response imbalance: Is the accuracy of the survey estimates improved? Archived
Articles and reports: 12-001-X201600214663
Description:
We present theoretical evidence that efforts during data collection to balance the survey response with respect to selected auxiliary variables will improve the chances for low nonresponse bias in the estimates that are ultimately produced by calibrated weighting. One of our results shows that the variance of the bias – measured here as the deviation of the calibration estimator from the (unrealized) full-sample unbiased estimator – decreases linearly as a function of the response imbalance that we assume measured and controlled continuously over the data collection period. An attractive prospect is thus a lower risk of bias if one can manage the data collection to get low imbalance. The theoretical results are validated in a simulation study with real data from an Estonian household survey.
Release date: 2016-12-20
5. Statistical inference based on judgment post-stratified samples in finite population Archived
Articles and reports: 12-001-X201600214664
Description:
This paper draws statistical inference for finite population mean based on judgment post stratified (JPS) samples. The JPS sample first selects a simple random sample and then stratifies the selected units into H judgment classes based on their relative positions (ranks) in a small set of size H. This leads to a sample with random sample sizes in judgment classes. Ranking process can be performed either using auxiliary variables or visual inspection to identify the ranks of the measured observations. The paper develops unbiased estimator and constructs confidence interval for population mean. Since judgment ranks are random variables, by conditioning on the measured observations we construct Rao-Blackwellized estimators for the population mean. The paper shows that Rao-Blackwellized estimators perform better than usual JPS estimators. The proposed estimators are applied to 2012 United States Department of Agriculture Census Data.
Release date: 2016-12-20
6. A cautionary note on Clark Winsorization Archived
Articles and reports: 12-001-X201600214676
Description:
Winsorization procedures replace extreme values with less extreme values, effectively moving the original extreme values toward the center of the distribution. Winsorization therefore both detects and treats influential values. Mulry, Oliver and Kaputa (2014) compare the performance of the one-sided Winsorization method developed by Clark (1995) and described by Chambers, Kokic, Smith and Cruddas (2000) to the performance of M-estimation (Beaumont and Alavi 2004) in highly skewed business population data. One aspect of particular interest for methods that detect and treat influential values is the range of values designated as influential, called the detection region. The Clark Winsorization algorithm is easy to implement and can be extremely effective. However, the resultant detection region is highly dependent on the number of influential values in the sample, especially when the survey totals are expected to vary greatly by collection period. In this note, we examine the effect of the number and magnitude of influential values on the detection regions from Clark Winsorization using data simulated to realistically reflect the properties of the population for the Monthly Retail Trade Survey (MRTS) conducted by the U.S. Census Bureau. Estimates from the MRTS and other economic surveys are used in economic indicators, such as the Gross Domestic Product (GDP).
Release date: 2016-12-20
7. Tests for evaluating nonresponse bias in surveys Archived
Articles and reports: 12-001-X201600214677
Description:
How do we tell whether weighting adjustments reduce nonresponse bias? If a variable is measured for everyone in the selected sample, then the design weights can be used to calculate an approximately unbiased estimate of the population mean or total for that variable. A second estimate of the population mean or total can be calculated using the survey respondents only, with weights that have been adjusted for nonresponse. If the two estimates disagree, then there is evidence that the weight adjustments may not have removed the nonresponse bias for that variable. In this paper we develop the theoretical properties of linearization and jackknife variance estimators for evaluating the bias of an estimated population mean or total by comparing estimates calculated from overlapping subsets of the same data with different sets of weights, when poststratification or inverse propensity weighting is used for the nonresponse adjustments to the weights. We provide sufficient conditions on the population, sample, and response mechanism for the variance estimators to be consistent, and demonstrate their small-sample properties through a simulation study.
Release date: 2016-12-20
8. Adaptive rectangular sampling: An easy, incomplete, neighbourhood-free adaptive cluster sampling design Archived
Articles and reports: 12-001-X201600214684
Description:
This paper introduces an incomplete adaptive cluster sampling design that is easy to implement, controls the sample size well, and does not need to follow the neighbourhood. In this design, an initial sample is first selected, using one of the conventional designs. If a cell satisfies a prespecified condition, a specified radius around the cell is sampled completely. The population mean is estimated using the \pi-estimator. If all the inclusion probabilities are known, then an unbiased \pi estimator is available; if, depending on the situation, the inclusion probabilities are not known for some of the final sample units, then they are estimated. To estimate the inclusion probabilities, a biased estimator is constructed. However, the simulations show that if the sample size is large enough, the error of the inclusion probabilities is negligible, and the relative \pi-estimator is almost unbiased. This design rivals adaptive cluster sampling because it controls the final sample size and is easy to manage. It rivals adaptive two-stage sequential sampling because it considers the cluster form of the population and reduces the cost of moving across the area. Using real data on a bird population and simulations, the paper compares the design with adaptive two-stage sequential sampling. The simulations show that the design has significant efficiency in comparison with its rival.
Release date: 2016-12-20
9. The Measurement of Firm Entry in the Longitudinal Employment Analysis Program Archived
Articles and reports: 11-633-X2016004
Description:
Understanding the importance of the dynamic entry process in the Canadian economy involves measuring the amount and size of firm entry. The paper presents estimates of the importance of firm entry in Canada. It uses the database underlying the Longitudinal Employment Analysis Program (LEAP), which has produced measures of firm entry and exit since 1988. This paper discusses the methodology used to estimate entry and exit, the issues that had to be resolved and the reasons for choosing the particular solutions that were adopted. It then presents measures that are derived from LEAP. Finally, it analyzes the sensitivity of the estimates associated with LEAP to alternative methods of estimating entry and exit.
Release date: 2016-11-10
10. An Overview of Selected International Business Record Linkage Programs Archived
Articles and reports: 18-001-X2016001
Description:
Although the record linkage of business data is not a completely new topic, the fact remains that the public and many data users are unaware of the programs and practices commonly used by statistical agencies across the world.
This report is a brief overview of the main practices, programs and challenges of record linkage of statistical agencies across the world who answered a short survey on this subject supplemented by publically available documentation produced by these agencies. The document shows that the linkage practices are similar between these statistical agencies; however the main differences are in the procedures in place to access to data along with regulatory policies that govern the record linkage permissions and the dissemination of data.
Release date: 2016-10-27

Stats in brief (1)

Stats in brief (1) ((1 result))

1. Enterprise Portfolio Management Archived
Stats in brief: 11-629-X2016003
Description: Discover how the Enterprise Portfolio Management team (EPM) supports some of Canada’s largest enterprises.
Release date: 2016-06-02

Articles and reports (67)

Articles and reports (67) (0 to 10 of 67 results)

1. Unequal probability inverse sampling Archived
Articles and reports: 12-001-X201600214660
Description:
In an economic survey of a sample of enterprises, occupations are randomly selected from a list until a number r of occupations in a local unit has been identified. This is an inverse sampling problem for which we are proposing a few solutions. Simple designs with and without replacement are processed using negative binomial distributions and negative hypergeometric distributions. We also propose estimators for when the units are selected with unequal probabilities, with or without replacement.
Release date: 2016-12-20
2. A few remarks on a small example by Jean-Claude Deville regarding non-ignorable non-response Archived
Articles and reports: 12-001-X201600214661
Description:
An example presented by Jean-Claude Deville in 2005 is subjected to three estimation methods: the method of moments, the maximum likelihood method, and generalized calibration. The three methods yield exactly the same results for the two non-response models. A discussion follows on how to choose the most appropriate model.
Release date: 2016-12-20
3. A note on the concept of invariance in two-phase sampling designs Archived
Articles and reports: 12-001-X201600214662
Description:
Two-phase sampling designs are often used in surveys when the sampling frame contains little or no auxiliary information. In this note, we shed some light on the concept of invariance, which is often mentioned in the context of two-phase sampling designs. We define two types of invariant two-phase designs: strongly invariant and weakly invariant two-phase designs. Some examples are given. Finally, we describe the implications of strong and weak invariance from an inference point of view.
Release date: 2016-12-20
4. Reducing the response imbalance: Is the accuracy of the survey estimates improved? Archived
Articles and reports: 12-001-X201600214663
Description:
We present theoretical evidence that efforts during data collection to balance the survey response with respect to selected auxiliary variables will improve the chances for low nonresponse bias in the estimates that are ultimately produced by calibrated weighting. One of our results shows that the variance of the bias – measured here as the deviation of the calibration estimator from the (unrealized) full-sample unbiased estimator – decreases linearly as a function of the response imbalance that we assume measured and controlled continuously over the data collection period. An attractive prospect is thus a lower risk of bias if one can manage the data collection to get low imbalance. The theoretical results are validated in a simulation study with real data from an Estonian household survey.
Release date: 2016-12-20
5. Statistical inference based on judgment post-stratified samples in finite population Archived
Articles and reports: 12-001-X201600214664
Description:
This paper draws statistical inference for finite population mean based on judgment post stratified (JPS) samples. The JPS sample first selects a simple random sample and then stratifies the selected units into H judgment classes based on their relative positions (ranks) in a small set of size H. This leads to a sample with random sample sizes in judgment classes. Ranking process can be performed either using auxiliary variables or visual inspection to identify the ranks of the measured observations. The paper develops unbiased estimator and constructs confidence interval for population mean. Since judgment ranks are random variables, by conditioning on the measured observations we construct Rao-Blackwellized estimators for the population mean. The paper shows that Rao-Blackwellized estimators perform better than usual JPS estimators. The proposed estimators are applied to 2012 United States Department of Agriculture Census Data.
Release date: 2016-12-20
6. A cautionary note on Clark Winsorization Archived
Articles and reports: 12-001-X201600214676
Description:
Winsorization procedures replace extreme values with less extreme values, effectively moving the original extreme values toward the center of the distribution. Winsorization therefore both detects and treats influential values. Mulry, Oliver and Kaputa (2014) compare the performance of the one-sided Winsorization method developed by Clark (1995) and described by Chambers, Kokic, Smith and Cruddas (2000) to the performance of M-estimation (Beaumont and Alavi 2004) in highly skewed business population data. One aspect of particular interest for methods that detect and treat influential values is the range of values designated as influential, called the detection region. The Clark Winsorization algorithm is easy to implement and can be extremely effective. However, the resultant detection region is highly dependent on the number of influential values in the sample, especially when the survey totals are expected to vary greatly by collection period. In this note, we examine the effect of the number and magnitude of influential values on the detection regions from Clark Winsorization using data simulated to realistically reflect the properties of the population for the Monthly Retail Trade Survey (MRTS) conducted by the U.S. Census Bureau. Estimates from the MRTS and other economic surveys are used in economic indicators, such as the Gross Domestic Product (GDP).
Release date: 2016-12-20
7. Tests for evaluating nonresponse bias in surveys Archived
Articles and reports: 12-001-X201600214677
Description:
How do we tell whether weighting adjustments reduce nonresponse bias? If a variable is measured for everyone in the selected sample, then the design weights can be used to calculate an approximately unbiased estimate of the population mean or total for that variable. A second estimate of the population mean or total can be calculated using the survey respondents only, with weights that have been adjusted for nonresponse. If the two estimates disagree, then there is evidence that the weight adjustments may not have removed the nonresponse bias for that variable. In this paper we develop the theoretical properties of linearization and jackknife variance estimators for evaluating the bias of an estimated population mean or total by comparing estimates calculated from overlapping subsets of the same data with different sets of weights, when poststratification or inverse propensity weighting is used for the nonresponse adjustments to the weights. We provide sufficient conditions on the population, sample, and response mechanism for the variance estimators to be consistent, and demonstrate their small-sample properties through a simulation study.
Release date: 2016-12-20
8. Adaptive rectangular sampling: An easy, incomplete, neighbourhood-free adaptive cluster sampling design Archived
Articles and reports: 12-001-X201600214684
Description:
This paper introduces an incomplete adaptive cluster sampling design that is easy to implement, controls the sample size well, and does not need to follow the neighbourhood. In this design, an initial sample is first selected, using one of the conventional designs. If a cell satisfies a prespecified condition, a specified radius around the cell is sampled completely. The population mean is estimated using the \pi-estimator. If all the inclusion probabilities are known, then an unbiased \pi estimator is available; if, depending on the situation, the inclusion probabilities are not known for some of the final sample units, then they are estimated. To estimate the inclusion probabilities, a biased estimator is constructed. However, the simulations show that if the sample size is large enough, the error of the inclusion probabilities is negligible, and the relative \pi-estimator is almost unbiased. This design rivals adaptive cluster sampling because it controls the final sample size and is easy to manage. It rivals adaptive two-stage sequential sampling because it considers the cluster form of the population and reduces the cost of moving across the area. Using real data on a bird population and simulations, the paper compares the design with adaptive two-stage sequential sampling. The simulations show that the design has significant efficiency in comparison with its rival.
Release date: 2016-12-20
9. The Measurement of Firm Entry in the Longitudinal Employment Analysis Program Archived
Articles and reports: 11-633-X2016004
Description:
Understanding the importance of the dynamic entry process in the Canadian economy involves measuring the amount and size of firm entry. The paper presents estimates of the importance of firm entry in Canada. It uses the database underlying the Longitudinal Employment Analysis Program (LEAP), which has produced measures of firm entry and exit since 1988. This paper discusses the methodology used to estimate entry and exit, the issues that had to be resolved and the reasons for choosing the particular solutions that were adopted. It then presents measures that are derived from LEAP. Finally, it analyzes the sensitivity of the estimates associated with LEAP to alternative methods of estimating entry and exit.
Release date: 2016-11-10
10. An Overview of Selected International Business Record Linkage Programs Archived
Articles and reports: 18-001-X2016001
Description:
Although the record linkage of business data is not a completely new topic, the fact remains that the public and many data users are unaware of the programs and practices commonly used by statistical agencies across the world.
This report is a brief overview of the main practices, programs and challenges of record linkage of statistical agencies across the world who answered a short survey on this subject supplemented by publically available documentation produced by these agencies. The document shows that the linkage practices are similar between these statistical agencies; however the main differences are in the procedures in place to access to data along with regulatory policies that govern the record linkage permissions and the dissemination of data.
Release date: 2016-10-27

Journals and periodicals (1)

Journals and periodicals (1) ((1 result))

1. Compendium of Management Practices for Statistical Organizations from Statistics Canada's International Statistical Fellowship Program
Journals and periodicals: 11-634-X
Description:
This publication is a catalogue of strategies and mechanisms that a statistical organization should consider adopting, according to its particular context. This compendium is based on lessons learned and best practices of leadership and management of statistical agencies within the scope of Statistics Canada’s International Statistical Fellowship Program (ISFP). It contains four broad sections including, characteristics of an effective national statistical system; core management practices; improving, modernizing and finding efficiencies; and, strategies to better inform and engage key stakeholders.
Release date: 2016-07-06

Report a problem or mistake on this page

Date modified:: 2024-09-25

Language selection

Search and menus

Search

Analysis

Filter results by

Keyword(s)

Subject

Year of publication

Author(s)

Survey or statistical program

Content

Results

All (69) (0 to 10 of 69 results)

Stats in brief (1) ((1 result))

Articles and reports (67) (0 to 10 of 67 results)

Journals and periodicals (1) ((1 result))

Analysis

Filter results by

Keyword(s)

Subject

Year of publication

Author(s)

Survey or statistical program

Content

Results

All (69) (0 to 10 of 69 results)

Stats in brief (1) ((1 result))

Articles and reports (67) (0 to 10 of 67 results)

Journals and periodicals (1) ((1 result))

How do I use the filters and the search box?

How do I refine my search?

How does the search work?

How are the results ordered?

How are the results ordered?