Survey design

Skip to main content
Skip to footer

Language selection

Français

Search and menus

Search and menus

Search

Skip to filters. View results.

Results

All (298)

All (298) (0 to 10 of 298 results)

1. Improvements to the Canadian Income Survey Methodology for the 2022 Reference Year
Articles and reports: 75F0002M2024005
Description: The Canadian Income Survey (CIS) has introduced improvements to the methods and data sources used to produce income and poverty estimates with the release of its 2022 reference year estimates. Foremost among these improvements is a significant increase in the sample size for a large subset of the CIS content. The weighting methodology was also improved and the target population of the CIS was changed from persons aged 16 years and over to persons aged 15 years and over. This paper describes the changes made and presents the approximate net result of these changes on the income estimates and data quality of the CIS using 2021 data. The changes described in this paper highlight the ways in which data quality has been improved while having little impact on key CIS estimates and trends.
Release date: 2024-04-26
2. Income Research Paper Series
Journals and periodicals: 75F0002M
Description: This series provides detailed documentation on income developments, including survey design issues, data quality evaluation and exploratory research.
Release date: 2024-04-26
3. Children born into vulnerability: Challenges encountered in a Quebec longitudinal survey Archived
Articles and reports: 11-522-X202200100010
Description: Growing Up in Québec is a longitudinal population survey that began in the spring of 2021 at the Institut de la statistique du Québec. Among the children targeted by this longitudinal follow-up, some will experience developmental difficulties at some point in their lives. Those same children often have characteristics associated with higher sample attrition (low-income family, parents with a low level of education). This article describes the two main challenges we encountered when trying to ensure sufficient representativeness of these children, in both the overall results and the subpopulation analyses.
Release date: 2024-03-25
4. Model-based stratification of payment populations in Medicare integrity investigations
Articles and reports: 12-001-X202300200001
Description: When a Medicare healthcare provider is suspected of billing abuse, a population of payments X made to that provider over a fixed timeframe is isolated. A certified medical reviewer, in a time-consuming process, can determine the overpayment Y = X - (amount justified by the evidence) associated with each payment. Typically, there are too many payments in the population to examine each with care, so a probability sample is selected. The sample overpayments are then used to calculate a 90% lower confidence bound for the total population overpayment. This bound is the amount demanded for recovery from the provider. Unfortunately, classical methods for calculating this bound sometimes fail to provide the 90% confidence level, especially when using a stratified sample.
In this paper, 166 redacted samples from Medicare integrity investigations are displayed and described, along with 156 associated payment populations. The 7,588 examined (Y, X) sample pairs show (1) Medicare audits have high error rates: more than 76% of these payments were considered to have been paid in error; and (2) the patterns in these samples support an “All-or-Nothing” mixture model for (Y, X) previously defined in the literature. Model-based Monte Carlo testing procedures for Medicare sampling plans are discussed, as well as stratification methods based on anticipated model moments. In terms of viability (achieving the 90% confidence level) a new stratification method defined here is competitive with the best of the many existing methods tested and seems less sensitive to choice of operating parameters. In terms of overpayment recovery (equivalent to precision) the new method is also comparable to the best of the many existing methods tested. Unfortunately, no stratification algorithm tested was ever viable for more than about half of the 104 test populations.
Release date: 2024-01-03
5. Sample designs and estimators for multimode surveys with face-to-face data collection
Articles and reports: 12-001-X202300200006
Description: Survey researchers are increasingly turning to multimode data collection to deal with declines in survey response rates and increasing costs. An efficient approach offers the less costly modes (e.g., web) followed with a more expensive mode for a subsample of the units (e.g., households) within each primary sampling unit (PSU). We present two alternatives to this traditional design. One alternative subsamples PSUs rather than units to constrain costs. The second is a hybrid design that includes a clustered (two-stage) sample and an independent, unclustered sample. Using a simulation, we demonstrate the hybrid design has considerable advantages.
Release date: 2024-01-03
6. Constructing all determinantal sampling designs
Articles and reports: 12-001-X202300200008
Description: In this article, we use a slightly simplified version of the method by Fickus, Mixon and Poteet (2013) to define a flexible parameterization of the kernels of determinantal sampling designs with fixed first-order inclusion probabilities. For specific values of the multidimensional parameter, we get back to a matrix from the family P^II from Loonis and Mary (2019). We speculate that, among the determinantal designs with fixed inclusion probabilities, the minimum variance of the Horvitz and Thompson estimator (1952) of a variable of interest is expressed relative to P^II. We provide experimental R programs that facilitate the appropriation of various concepts presented in the article, some of which are described as non-trivial by Fickus et al. (2013). A longer version of this article, including proofs and a more detailed presentation of the determinantal designs, is also available.
Release date: 2024-01-03
7. Targetted double control of burden in multiple surveys
Articles and reports: 12-001-X202300200010
Description: Sample coordination methods aim to increase (in positive coordination) or decrease (in negative coordination) the size of the overlap between samples. The samples considered can be from different occasions of a repeated survey and/or from different surveys covering a common population. Negative coordination is used to control the response burden in a given period, because some units do not respond to survey questionnaires if they are selected in many samples. Usually, methods for sample coordination do not take into account any measure of the response burden that a unit has already expended in responding to previous surveys. We introduce such a measure into a new method by adapting a spatially balanced sampling scheme, based on a generalization of Poisson sampling, together with a negative coordination method. The goal is to create a double control of the burden for these units: once by using a measure of burden during the sampling process and once by using a negative coordination method. We evaluate the approach using Monte-Carlo simulation and investigate its use for controlling for selection “hot-spots” in business surveys in Statistics Netherlands.
Release date: 2024-01-03
8. Comments by Guillaume Chauvet on “Jean-Claude Deville’s contributions to survey theory and official statistics”
Articles and reports: 12-001-X202300200016
Description: In this discussion, I will present some additional aspects of three major areas of survey theory developed or studied by Jean-Claude Deville: calibration, balanced sampling and the generalized weight-share method.
Release date: 2024-01-03
9. Improvements to the Canadian Income Survey Methodology for the 2021 Reference Year
Articles and reports: 75F0002M2023005
Description: The Canadian Income Survey (CIS) has introduced improvements to the methods and systems used to produce income estimates with the release of its 2021 reference year estimates. This paper describes the changes and presents the approximate net result of these changes on income estimates using data for 2019 and 2020. The changes described in this paper highlight the ways in which data quality has been improved while producing minimal impact on key CIS estimates and trends.
Release date: 2023-08-29
10. Sampling with adaptive drawing probabilities
Articles and reports: 12-001-X202300100009
Description: In this paper, with and without-replacement versions of adaptive proportional to size sampling are presented. Unbiased estimators are developed for these methods and their properties are studied. In the two versions, the drawing probabilities are adapted during the sampling process based on the observations already selected. To this end, in the version with-replacement, after each draw and observation of the variable of interest, the vector of the auxiliary variable will be updated using the observed values of the variable of interest to approximate the exact selection probability proportional to size. For the without-replacement version, first, using an initial sample, we model the relationship between the variable of interest and the auxiliary variable. Then, utilizing this relationship, we estimate the unknown (unobserved) population units. Finally, on these estimated population units, we select a new sample proportional to size without-replacement. These approaches can significantly improve the efficiency of designs not only in the case of a positive linear relationship, but also in the case of a non-linear or negative linear relationship between the variables. We investigate the efficiencies of the designs through simulations and real case studies on medicinal flowers, social and economic data.
Release date: 2023-06-30

Data (0)

Data (0) (0 results)

No content available at this time.

Analysis (269)

Analysis (269) (0 to 10 of 269 results)

1. Improvements to the Canadian Income Survey Methodology for the 2022 Reference Year
Articles and reports: 75F0002M2024005
Description: The Canadian Income Survey (CIS) has introduced improvements to the methods and data sources used to produce income and poverty estimates with the release of its 2022 reference year estimates. Foremost among these improvements is a significant increase in the sample size for a large subset of the CIS content. The weighting methodology was also improved and the target population of the CIS was changed from persons aged 16 years and over to persons aged 15 years and over. This paper describes the changes made and presents the approximate net result of these changes on the income estimates and data quality of the CIS using 2021 data. The changes described in this paper highlight the ways in which data quality has been improved while having little impact on key CIS estimates and trends.
Release date: 2024-04-26
2. Income Research Paper Series
Journals and periodicals: 75F0002M
Description: This series provides detailed documentation on income developments, including survey design issues, data quality evaluation and exploratory research.
Release date: 2024-04-26
3. Children born into vulnerability: Challenges encountered in a Quebec longitudinal survey Archived
Articles and reports: 11-522-X202200100010
Description: Growing Up in Québec is a longitudinal population survey that began in the spring of 2021 at the Institut de la statistique du Québec. Among the children targeted by this longitudinal follow-up, some will experience developmental difficulties at some point in their lives. Those same children often have characteristics associated with higher sample attrition (low-income family, parents with a low level of education). This article describes the two main challenges we encountered when trying to ensure sufficient representativeness of these children, in both the overall results and the subpopulation analyses.
Release date: 2024-03-25
4. Model-based stratification of payment populations in Medicare integrity investigations
Articles and reports: 12-001-X202300200001
Description: When a Medicare healthcare provider is suspected of billing abuse, a population of payments X made to that provider over a fixed timeframe is isolated. A certified medical reviewer, in a time-consuming process, can determine the overpayment Y = X - (amount justified by the evidence) associated with each payment. Typically, there are too many payments in the population to examine each with care, so a probability sample is selected. The sample overpayments are then used to calculate a 90% lower confidence bound for the total population overpayment. This bound is the amount demanded for recovery from the provider. Unfortunately, classical methods for calculating this bound sometimes fail to provide the 90% confidence level, especially when using a stratified sample.
In this paper, 166 redacted samples from Medicare integrity investigations are displayed and described, along with 156 associated payment populations. The 7,588 examined (Y, X) sample pairs show (1) Medicare audits have high error rates: more than 76% of these payments were considered to have been paid in error; and (2) the patterns in these samples support an “All-or-Nothing” mixture model for (Y, X) previously defined in the literature. Model-based Monte Carlo testing procedures for Medicare sampling plans are discussed, as well as stratification methods based on anticipated model moments. In terms of viability (achieving the 90% confidence level) a new stratification method defined here is competitive with the best of the many existing methods tested and seems less sensitive to choice of operating parameters. In terms of overpayment recovery (equivalent to precision) the new method is also comparable to the best of the many existing methods tested. Unfortunately, no stratification algorithm tested was ever viable for more than about half of the 104 test populations.
Release date: 2024-01-03
5. Sample designs and estimators for multimode surveys with face-to-face data collection
Articles and reports: 12-001-X202300200006
Description: Survey researchers are increasingly turning to multimode data collection to deal with declines in survey response rates and increasing costs. An efficient approach offers the less costly modes (e.g., web) followed with a more expensive mode for a subsample of the units (e.g., households) within each primary sampling unit (PSU). We present two alternatives to this traditional design. One alternative subsamples PSUs rather than units to constrain costs. The second is a hybrid design that includes a clustered (two-stage) sample and an independent, unclustered sample. Using a simulation, we demonstrate the hybrid design has considerable advantages.
Release date: 2024-01-03
6. Constructing all determinantal sampling designs
Articles and reports: 12-001-X202300200008
Description: In this article, we use a slightly simplified version of the method by Fickus, Mixon and Poteet (2013) to define a flexible parameterization of the kernels of determinantal sampling designs with fixed first-order inclusion probabilities. For specific values of the multidimensional parameter, we get back to a matrix from the family P^II from Loonis and Mary (2019). We speculate that, among the determinantal designs with fixed inclusion probabilities, the minimum variance of the Horvitz and Thompson estimator (1952) of a variable of interest is expressed relative to P^II. We provide experimental R programs that facilitate the appropriation of various concepts presented in the article, some of which are described as non-trivial by Fickus et al. (2013). A longer version of this article, including proofs and a more detailed presentation of the determinantal designs, is also available.
Release date: 2024-01-03
7. Targetted double control of burden in multiple surveys
Articles and reports: 12-001-X202300200010
Description: Sample coordination methods aim to increase (in positive coordination) or decrease (in negative coordination) the size of the overlap between samples. The samples considered can be from different occasions of a repeated survey and/or from different surveys covering a common population. Negative coordination is used to control the response burden in a given period, because some units do not respond to survey questionnaires if they are selected in many samples. Usually, methods for sample coordination do not take into account any measure of the response burden that a unit has already expended in responding to previous surveys. We introduce such a measure into a new method by adapting a spatially balanced sampling scheme, based on a generalization of Poisson sampling, together with a negative coordination method. The goal is to create a double control of the burden for these units: once by using a measure of burden during the sampling process and once by using a negative coordination method. We evaluate the approach using Monte-Carlo simulation and investigate its use for controlling for selection “hot-spots” in business surveys in Statistics Netherlands.
Release date: 2024-01-03
8. Comments by Guillaume Chauvet on “Jean-Claude Deville’s contributions to survey theory and official statistics”
Articles and reports: 12-001-X202300200016
Description: In this discussion, I will present some additional aspects of three major areas of survey theory developed or studied by Jean-Claude Deville: calibration, balanced sampling and the generalized weight-share method.
Release date: 2024-01-03
9. Improvements to the Canadian Income Survey Methodology for the 2021 Reference Year
Articles and reports: 75F0002M2023005
Description: The Canadian Income Survey (CIS) has introduced improvements to the methods and systems used to produce income estimates with the release of its 2021 reference year estimates. This paper describes the changes and presents the approximate net result of these changes on income estimates using data for 2019 and 2020. The changes described in this paper highlight the ways in which data quality has been improved while producing minimal impact on key CIS estimates and trends.
Release date: 2023-08-29
10. Sampling with adaptive drawing probabilities
Articles and reports: 12-001-X202300100009
Description: In this paper, with and without-replacement versions of adaptive proportional to size sampling are presented. Unbiased estimators are developed for these methods and their properties are studied. In the two versions, the drawing probabilities are adapted during the sampling process based on the observations already selected. To this end, in the version with-replacement, after each draw and observation of the variable of interest, the vector of the auxiliary variable will be updated using the observed values of the variable of interest to approximate the exact selection probability proportional to size. For the without-replacement version, first, using an initial sample, we model the relationship between the variable of interest and the auxiliary variable. Then, utilizing this relationship, we estimate the unknown (unobserved) population units. Finally, on these estimated population units, we select a new sample proportional to size without-replacement. These approaches can significantly improve the efficiency of designs not only in the case of a positive linear relationship, but also in the case of a non-linear or negative linear relationship between the variables. We investigate the efficiencies of the designs through simulations and real case studies on medicinal flowers, social and economic data.
Release date: 2023-06-30

Reference (29)

Reference (29) (20 to 30 of 29 results)

21. Calculation of change for annual business surveys Archived
Surveys and statistical programs – Documentation: 11-522-X19980015027
Description:
The disseminated results of annual business surveys inevitably contain statistics that are changing. Since the economic sphere is increasingly dynamic, a simple difference of aggregates between n-l and n is no longer sufficient to provide an overall description of what has happened. The change calculation module in the new generation of annual business surveys divides overall change into various components (births, deaths, inter-industry migration) and calculates change on the basis of a constant field, assigning special importance to restructurings. The main difficulties lie in establishing subsamples, reweighting, calibrating according to calculable changes, and taking account of restructuring.
Release date: 1999-10-22
22. Marginal models for repeated observations: Inference with survey data Archived
Surveys and statistical programs – Documentation: 11-522-X19980015029
Description:
In longitudinal surveys, sample subjects are observed over several time points. This feature typically leads to dependent observations on the same subject, in addition to the customary correlations across subjects induced by the sample design. Much research in the literature has focussed on modeling the marginal mean of a response as a function of covariates. Liang and Zeger (1986) used generalized estimating equations (GEE), requiring only correct specification of the marginal mean, and obtained standard errors of regression parameter estimates and associated Wald tests, assuming a "working" correlation structure for the repeated measurements on a sample subject. Rotnitzky and Jewell (1990) developed quasi-score tests and Rao-Scott adjustments to "working" quasi-score tests under marginal models. These methods are asymptotically robust to misspecification of the within-subject correlation structure, but assume independence of sample subjects which is not satisfied for complex longitudinal survey data based on stratified multi-stage sampling. We proposed asymptotically valid Wald and quasi-score tests for longitudinal survey data, using the Taylor Linearization and jackknife methods. Alternative tests, based on Rao-Scott adjustments to naive tests that ignore survey design features and on Bonferroni-t, are also developed. These tests are particularly useful when the effective degrees of freedom, usually taken as the total number of sample primary units (clusters) minus the number of strata, is small.
Release date: 1999-10-22
23. Estimating the incidence of dementia from longitudinal two-phase sampling with nonignorable missing data Archived
Surveys and statistical programs – Documentation: 11-522-X19980015030
Description:
Two-phase sampling designs have been conducted in waves to estimate the incidence of a rare disease such as dementia. Estimation of disease incidence from longitudinal dementia study has to appropriately adjust for data missing by death as well as the sampling design used at each study wave. In this paper we adopt a selection model approach to model the missing data by death and use a likelihood approach to derive incidence estimates. A modified EM algorithm is used to deal with data missing by sampling selection. The non-paramedic jackknife variance estimator is used to derive variance estimates for the model parameters and the incidence estimates. The proposed approaches are applied to data from the Indianapolis-Ibadan Dementia Study.
Release date: 1999-10-22
24. Estimation with partial overlap longitudinal samples Archived
Surveys and statistical programs – Documentation: 11-522-X19980015035
Description:
In a longitudinal survey conducted for k periods some units may be observed for less than k of the periods. Examples include, surveys designed with partially overlapping subsamples, a pure panel survey with nonresponse, and a panel survey supplemented with additional samples for some of the time periods. Estimators of the regression type are exhibited for such surveys. An application to special studies associated with the National Resources Inventory is discussed.
Release date: 1999-10-22
25. Towards a New Canadian Asset and Debt Survey - a Content Discussion Paper Archived
Notices and consultations: 13F0026M1999001
Description:
The main objectives of a new Canadian survey measuring asset and debt holding of families and individuals will be to update wealth information that is over one decade old; to improve the reliability of the wealth estimates; and, to provide a primary tool for analysing many important policy issues related to the distribution of assets and debts, future consumption possibilities, and savings behaviour that is of interest to governments, business and communities.
This paper is the document that launched the development of the new asset and debt survey, subsequently renamed the Survey of Financial Security. It looks at the conceptual framework for the survey, including the appropriate unit of measurement (family, household or person) and discusses measurement issues such as establishing an accounting framework for assets and debts. The variables proposed for inclusion are also identified. The paper poses several questions to readers and asks for comments and feedback.
Release date: 1999-03-23
26. Asset and Debt Survey: Findings of the Content Consultation Process Archived
Notices and consultations: 13F0026M1999002
Description:
This document summarizes the comments and feedback received on an earlier document: Towards a new Canadian asset and debt survey - A content discussion paper. The new asset and debt survey (now called the Survey of Financial Security) is to update the wealth information on Canadian families and unattached individuals. Since the last data collection was conducted in 1984, it was essential to include a consultative process in the development of the survey in order to obtain feedback on issues of concern and to define the conceptual framework for the survey.
Comments on the content discussion paper are summarized by major theme and sections indicate how the suggestions are being incorporated into the survey or why they could not be incorporated. This paper also mentions the main objectives of the survey and provides an overview of the survey content, revised according to the feedback from the discussion paper.
Release date: 1999-03-23
27. Proposal for an Asset and Debt Survey Archived
Surveys and statistical programs – Documentation: 13F0026M1999003
Description:
This paper presents a proposal for conducting a Canadian asset and debt survey. The first step in preparing this proposal was the release, in February 1997, of a document entitled Towards a new Canadian asset and debt survey whose intent was to elicit feedback on the initial thinking regarding the content of the survey.
This paper reviews the conceptual framework for a new asset and debt survey, data requirements, survey design, collection methodology and testing. It provides also an overview of the anticipated data processing system, describes the analysis and dissemination plan (analytical products and microdata files), and identifies the survey costs and major milestones. Finally, it presents the management/coordination approach used.
Release date: 1999-03-23
28. Sample Representativeness for the Survey of Labour and Income Dynamics Archived
Surveys and statistical programs – Documentation: 75F0002M1993019
Description:
This paper examines the issues and the procedures designed to maintain a representative sample of the population for the Survey of Labour and Income Dynamics (SLID).
Release date: 1995-12-30
29. SLID Following Rules: Who to Trace and Who to Interview Archived
Surveys and statistical programs – Documentation: 75F0002M1994001
Description:
This paper describes the Survey of Labour and Income Dynamics (SLID) following rules, which govern who is traced and who is interviewed. It also outlines the conceptual basis for these procedures.
Release date: 1995-12-30

Report a problem or mistake on this page

Date modified:: 2024-05-29