Keyword search

Results

All (171)

All (171) (0 to 10 of 171 results)

1. The Business and Community Newsletter
Journals and periodicals: 11-632-X
Description: The newsletter offers information aimed at three main groups, businesses (small to medium), communities and ethno-cultural groups/communities. Articles and outreach materials will assist their understanding of national and local data from the many relevant sources found on the Statistics Canada website.
Release date: 2024-09-19
2. Heterogeneous causal effects of labour market programs: A machine learning approach Archived
Articles and reports: 11-522-X202200100017
Description: In this paper, we look for presence of heterogeneity in conducting impact evaluations of the Skills Development intervention delivered under the Labour Market Development Agreements. We use linked longitudinal administrative data covering a sample of Skills Development participants from 2010 to 2017. We apply a causal machine-learning estimator as in Lechner (2019) to estimate the individualized program impacts at the finest aggregation level. These granular impacts reveal the distribution of net impacts facilitating further investigation as to what works for whom. The findings suggest statistically significant improvements in labour market outcomes for participants overall and for subgroups of policy interest.
Release date: 2024-06-28
3. Labour productivity measurement at Statistics Canada
Articles and reports: 13-605-X202400100003
Description: The document focuses on the evolution of Statistics Canada's labour productivity program, tracing its historical background, outlining its structure, as well as detailing the methodology and data sources used. It then discusses the diverse applications of provincial productivity data, identifies key users of productivity statistics, and highlights essential considerations for their interpretation. Finally, the document addresses the review process for quarterly and annual productivity measures and recent program improvements.
Release date: 2024-06-05
4. A Model-based Disaggregation Method for Estimation of Adult Competency Archived
Articles and reports: 11-522-X202200100003
Description: Estimation at fine levels of aggregation is necessary to better describe society. Small area estimation model-based approaches that combine sparse survey data with rich data from auxiliary sources have been proven useful to improve the reliability of estimates for small domains. Considered here is a scenario where small area model-based estimates, produced at a given aggregation level, needed to be disaggregated to better describe the social structure at finer levels. For this scenario, an allocation method was developed to implement the disaggregation, overcoming challenges associated with data availability and model development at such fine levels. The method is applied to adult literacy and numeracy estimation at the county-by-group-level, using data from the U.S. Program for the International Assessment of Adult Competencies. In this application the groups are defined in terms of age or education, but the method could be applied to estimation of other equity-deserving groups.
Release date: 2024-03-25
5. Application of sampling variance smoothing methods for small area proportion estimation Archived
Articles and reports: 11-522-X202200100005
Description: Sampling variance smoothing is an important topic in small area estimation. In this paper, we propose sampling variance smoothing methods for small area proportion estimation. In particular, we consider the generalized variance function and design effect methods for sampling variance smoothing. We evaluate and compare the smoothed sampling variances and small area estimates based on the smoothed variance estimates through analysis of survey data from Statistics Canada. The results from real data analysis indicate that the proposed sampling variance smoothing methods work very well for small area estimation.
Release date: 2024-03-25
6. Measuring the number of food aid recipients Archived
Articles and reports: 11-522-X202200100013
Description: Respondents to typical household surveys tend to significantly underreport their potential use of food aid distributed by associations. This underreporting is most likely related to the social stigma felt by people experiencing great financial difficulty. As a result, survey estimates of the number of recipients of that aid are much lower than the direct counts from the associations. Those counts tend to overestimate due to double counting. Through its adapted protocol, the Enquête Aide alimentaire (EAA) collected in late 2021 in France at a sample of sites of food aid distribution associations, controls the biases that affect the other sources and determines to what extent this aid is used.
Release date: 2024-03-25
7. Evaluating sampling methods for ethnic minorities Archived
Articles and reports: 11-522-X202200100014
Description: Ethnic minorities are often underrepresented in survey research, due to the challenges many researchers face in including these populations. While some studies discuss several methods in comparison, few have directly compared these methods empirically, leaving researchers seeking to include ethnic minorities in their studies unsure of their best options. In this article, I briefly review the methodological and ethical reasons for increasing ethnic minority representation in social science research, as well as challenges of doing so. I then present findings from ten studies which empirically compare methods of sampling and/or recruiting ethnic minority individuals. Finally, I discuss some implications for future research.
Release date: 2024-03-25
8. Bayesian model assisted design-based estimators of the size, total and mean of a hard-to-reach population from a link-tracing sample with initial cluster sample Archived
Articles and reports: 11-522-X202200100015
Description: We present design-based Horvitz-Thompson and multiplicity estimators of the population size, as well as of the total and mean of a response variable associated with the elements of a hidden population to be used with the link-tracing sampling variant proposed by Félix-Medina and Thompson (2004). Since the computation of the estimators requires to know the inclusion probabilities of the sampled people, but they are unknown, we propose a Bayesian model which allows us to estimate them, and consequently to compute the estimators of the population parameters. The results of a small numeric study indicate that the performance of the proposed estimators is acceptable.
Release date: 2024-03-25
9. From theory to practice: Lessons learned from implementing the Network Sampling with Memory method Archived
Articles and reports: 11-522-X202200100016
Description: To overcome the traditional drawbacks of chain sampling methods, the sampling method called “network sampling with memory” was developed. Its unique feature is to recreate, gradually in the field, a frame for the target population composed of individuals identified by respondents and to randomly draw future respondents from this frame, thereby minimizing selection bias. Tested for the first time in France between September 2020 and June 2021, for a survey among Chinese immigrants in Île-de-France (ChIPRe), this presentation describes the difficulties encountered during collection—sometimes contextual, due to the pandemic, but mostly inherent to the method.
Release date: 2024-03-25
10. Integration of existing data to develop an ethnicity indicator in the LSDDP Archived
Articles and reports: 11-522-X202200100018
Description: The Longitudinal Social Data Development Program (LSDDP) is a social data integration approach aimed at providing longitudinal analytical opportunities without imposing additional burden on respondents. The LSDDP uses a multitude of signals from different data sources for the same individual, which helps to better understand their interactions and track changes over time. This article looks at how the ethnicity status of people in Canada can be estimated at the most detailed disaggregated level possible using the results from a variety of business rules applied to linked data and to the LSDDP denominator. It will then show how improvements were obtained using machine learning methods, such as decision trees and random forest techniques.
Release date: 2024-03-25

Data (1)

Data (1) ((1 result))

1. General Social Survey, Cycle 22: Social Networks, 2008 Public Use Microdata Files
Public use microdata: 12M0022X
Description:
This package was designed to enable users to access and manipulate the microdata file for Cycle 22 (2008) of the General Social Survey (GSS). It contains information on the objectives, methodology and estimation procedures, as well as guidelines for releasing estimates based on the survey. Cycle 22 collected data from persons 15 years and over living in private households in Canada, excluding residents of the Yukon, Northwest Territories and Nunavut; and full-time residents of institutions. The survey covered a range of topics such as social networks, and social and civic participation. Information was also collected on major changes in respondents' lives in the last 12 months, the resources they used during these transitions and unmet needs for help. Questions were also asked on trust, sense of belonging, volunteering and unpaid work.

Release date: 2010-03-05

Analysis (145)

Analysis (145) (50 to 60 of 145 results)

51. Combining information from multiple complex surveys Archived
Articles and reports: 12-001-X201400214089
Description:
This manuscript describes the use of multiple imputation to combine information from multiple surveys of the same underlying population. We use a newly developed method to generate synthetic populations nonparametrically using a finite population Bayesian bootstrap that automatically accounts for complex sample designs. We then analyze each synthetic population with standard complete-data software for simple random samples and obtain valid inference by combining the point and variance estimates using extensions of existing combining rules for synthetic data. We illustrate the approach by combining data from the 2006 National Health Interview Survey (NHIS) and the 2006 Medical Expenditure Panel Survey (MEPS).
Release date: 2014-12-19
52. Estimation methods on multiple sampling frames in two-stage sample designs Archived
Articles and reports: 12-001-X201400214090
Description:
When studying a finite population, it is sometimes necessary to select samples from several sampling frames in order to represent all individuals. Here we are interested in the scenario where two samples are selected using a two-stage design, with common first-stage selection. We apply the Hartley (1962), Bankier (1986) and Kalton and Anderson (1986) methods, and we show that these methods can be applied conditional on first-stage selection. We also compare the performance of several estimators as part of a simulation study. Our results suggest that the estimator should be chosen carefully when there are multiple sampling frames, and that a simple estimator is sometimes preferable, even if it uses only part of the information collected.
Release date: 2014-12-19
53. Fractional hot deck imputation for robust inference under item nonresponse in survey sampling Archived
Articles and reports: 12-001-X201400214091
Description:
Parametric fractional imputation (PFI), proposed by Kim (2011), is a tool for general purpose parameter estimation under missing data. We propose a fractional hot deck imputation (FHDI) which is more robust than PFI or multiple imputation. In the proposed method, the imputed values are chosen from the set of respondents and assigned proper fractional weights. The weights are then adjusted to meet certain calibration conditions, which makes the resulting FHDI estimator efficient. Two simulation studies are presented to compare the proposed method with existing methods.
Release date: 2014-12-19
54. Chi-squared tests in dual frame surveys Archived
Articles and reports: 12-001-X201400214096
Description:
In order to obtain better coverage of the population of interest and cost less, a number of surveys employ dual frame structure, in which independent samples are taken from two overlapping sampling frames. This research considers chi-squared tests in dual frame surveys when categorical data is encountered. We extend generalized Wald’s test (Wald 1943), Rao-Scott first-order and second-order corrected tests (Rao and Scott 1981) from a single survey to a dual frame survey and derive the asymptotic distributions. Simulation studies show that both Rao-Scott type corrected tests work well and thus are recommended for use in dual frame surveys. An example is given to illustrate the usage of the developed tests.
Release date: 2014-12-19
55. On aligned composite estimates from overlapping samples for growth rates and totals Archived
Articles and reports: 12-001-X201400214097
Description:
When monthly business surveys are not completely overlapping, there are two different estimators for the monthly growth rate of the turnover: (i) one that is based on the monthly estimated population totals and (ii) one that is purely based on enterprises observed on both occasions in the overlap of the corresponding surveys. The resulting estimates and variances might be quite different. This paper proposes an optimal composite estimator for the growth rate as well as the population totals.
Release date: 2014-12-19
56. Low Income Lines, 2012-2013 Archived
Articles and reports: 75F0002M2014003
Description:
In order to provide a holographic or complete picture of low income, Statistics Canada uses three complementary low income lines: the Low Income Cut-offs (LICOs), the Low Income Measures (LIMs) and the Market Basket Measure (MBM). While the first two lines were developed by Statistics Canada, the MBM is based on concepts developed by Employment and Social Development Canada. Though these measures differ from one another, they give a generally consistent picture of low income status over time. None of these measures is the best. Each contributes its own perspective and its own strengths to the study of low income, so that cumulatively, the three provide a better understanding of the phenomenon of low income as a whole. These measures are not measures of poverty, but strictly measures of low income.
Release date: 2014-12-10
57. A nonparametric method to generate synthetic populations to adjust for complex sampling design features Archived
Articles and reports: 12-001-X201400114003
Description:
Outside of the survey sampling literature, samples are often assumed to be generated by simple random sampling process that produces independent and identically distributed (IID) samples. Many statistical methods are developed largely in this IID world. Application of these methods to data from complex sample surveys without making allowance for the survey design features can lead to erroneous inferences. Hence, much time and effort have been devoted to develop the statistical methods to analyze complex survey data and account for the sample design. This issue is particularly important when generating synthetic populations using finite population Bayesian inference, as is often done in missing data or disclosure risk settings, or when combining data from multiple surveys. By extending previous work in finite population Bayesian bootstrap literature, we propose a method to generate synthetic populations from a posterior predictive distribution in a fashion inverts the complex sampling design features and generates simple random samples from a superpopulation point of view, making adjustment on the complex data so that they can be analyzed as simple random samples. We consider a simulation study with a stratified, clustered unequal-probability of selection sample design, and use the proposed nonparametric method to generate synthetic populations for the 2006 National Health Interview Survey (NHIS), and the Medical Expenditure Panel Survey (MEPS), which are stratified, clustered unequal-probability of selection sample designs.
Release date: 2014-06-27
58. Estimation and replicate variance estimation of deciles for complex survey data from positively skewed populations Archived
Articles and reports: 12-001-X201300211868
Description:
Thompson and Sigman (2000) introduced an estimation procedure for estimating medians from highly positively skewed population data. Their procedure uses interpolation over data-dependent intervals (bins). The earlier paper demonstrated that this procedure has good statistical properties for medians computed from a highly skewed sample. This research extends the previous work to decile estimation methods for a positively skewed population using complex survey data. We present three different interpolation methods along with the traditional decile estimation method (no bins) and evaluate each method empirically, using residential housing data from the Survey of Construction and via a simulation study. We found that a variant of the current procedure using the 95th percentile as a scaling factor produces decile estimates with the best statistical properties.
Release date: 2014-01-15
59. An appraisal-based generalized regression estimator of house price change Archived
Articles and reports: 12-001-X201300211869
Description:
The house price index compiled by Statistics Netherlands relies on the Sale Price Appraisal Ratio (SPAR) method. The SPAR method combines selling prices with prior government assessments of properties. This paper outlines an alternative approach where the appraisals serve as auxiliary information in a generalized regression (GREG) framework. An application on Dutch data demonstrates that, although the GREG index is much smoother than the ratio of sample means, it is very similar to the SPAR series. To explain this result we show that the SPAR index is an estimator of our more general GREG index and in practice almost as efficient.
Release date: 2014-01-15
60. Design-based analysis of factorial designs embedded in probability samples Archived
Articles and reports: 12-001-X201300211870
Description:
At national statistical institutes experiments embedded in ongoing sample surveys are frequently conducted, for example to test the effect of modifications in the survey process on the main parameter estimates of the survey, to quantify the effect of alternative survey implementations on these estimates, or to obtain insight into the various sources of non-sampling errors. A design-based analysis procedure for factorial completely randomized designs and factorial randomized block designs embedded in probability samples is proposed in this paper. Design-based Wald statistics are developed to test whether estimated population parameters, like means, totals and ratios of two population totals, that are observed under the different treatment combinations of the experiment are significantly different. The methods are illustrated with a real life application of an experiment embedded in the Dutch Labor Force Survey.
Release date: 2014-01-15

Reference (21)

Reference (21) (0 to 10 of 21 results)

1. Methods for Constructing Life Tables for Canada, Provinces and Territories
Surveys and statistical programs – Documentation: 84-538-X
Geography: Canada
Description: This electronic publication presents the methodology underlying the production of the life tables for Canada, provinces and territories.
Release date: 2023-08-28
2. Guide to the Census of Population
Surveys and statistical programs – Documentation: 98-304-X
Description:
The Guide to the Census of Population is a reference document that describes the various phases of the 2021 Census of Population. The guide provides an overview of content determination, sampling design, collection, data processing, data quality assessment, confidentiality guidelines and dissemination. It also includes response rates and other data quality information. This product may be useful to both new and experienced users who wish to familiarize themselves with and find specific information about the 2021 Census of Population.

The Guide to the Census of Population combines information previously available in the Overview of the Census, National Household Survey User Guide and the Data Quality and Confidentiality Standards and Guidelines from 2011.

Release date: 2022-11-30
3. Longitudinal Immigration Database (IMDB) Technical Report, 2019 Archived
Surveys and statistical programs – Documentation: 11-633-X2021002
Description:
The Longitudinal Immigration Database (IMDB) is a comprehensive source of data that plays a key role in the understanding of the economic behaviour of immigrants. It is the only annual Canadian dataset that allows users to study the characteristics of immigrants to Canada at the time of admission and their economic outcomes and regional (inter-provincial) mobility over a time span of more than 35 years. The IMDB includes Immigration, Refugees and Citizenship Canada (IRCC) administrative records which contain exhaustive information about immigrants who were admitted to Canada since 1952. It also includes data about non-permanent residents who have been issued temporary resident permits since 1980. This report will discuss the IMDB data sources, concepts and variables, record linkage, data processing, dissemination, data evaluation and quality indicators, comparability with other immigration datasets, and the analyses possible with the IMDB.

Release date: 2021-02-01
4. First-time homebuyer concept: Technical reference note Archived
Surveys and statistical programs – Documentation: 75F0002M2020001
Description:
This note provides the definition of a first-time homebuyer concept used in the 2018 Canadian Housing Survey (CHS). It also includes the methodology used to identify first-time homebuyers and provides sensitivity analysis under alternative methodologies.
Release date: 2020-01-15
5. Census of Agriculture Content Consultation Report
Notices and consultations: 95-635-X
Description: To stay relevant, preparing for a new Census of Agriculture requires a thorough evaluation of data requirements. Before each census, Statistics Canada conducts consultations to solicit input and feedback on the Census of Agriculture's content. This report describes those consultations and the process that was followed to test and determine which topics could be potentially retained for the next census.
Release date: 2019-10-02
6. Input-Output Model Simulations (National Model)
Surveys and statistical programs – Documentation: 15F0004X
Description:
The input-output (IO) models are generally used to simulate the economic impacts of an expenditure on a given basket of goods and services or the output of one or several industries. The simulation results from a "shock" to an IO model will show the direct, indirect and induced impacts on GDP, which industries benefit the most, the number of jobs created, estimates of indirect taxes and subsidies generated, etc. For more details, ask us for the Guide to using the input-output simulation model, available free of charge upon request.
At various times, clients have requested the use of IO price, energy, tax and market models. Given their availability, arrangements can be made to use these models on request.
The national IO model was not released in 2015 or 2016.
Release date: 2019-04-04
7. Input-Output Model Simulations (Interprovincial Model)
Surveys and statistical programs – Documentation: 15F0009X
Description:
The input-output (IO) models are generally used to simulate the economic impacts of an expenditure on a given basket of goods and services or the output of one or several industries. The simulation results from a "shock" to an IO model will show the direct, indirect and induced impacts on GDP, which industries benefit the most, the number of jobs created, estimates of indirect taxes and subsidies generated, etc. For more details, ask us for the Guide to using the input-output simulation model, available free of charge upon request.
At various times, clients have requested the use of IO price, energy, tax and market models. Given their availability, arrangements can be made to use these models on request.
The interprovincial IO model was not released in 2015 or 2016.
Release date: 2019-04-04
8. Longitudinal Immigration Database (IMDB) Technical Report, 2016 Archived
Surveys and statistical programs – Documentation: 11-633-X2018019
Description:
The Longitudinal Immigration Database (IMDB) is a comprehensive source of data that plays a key role in the understanding of the economic behaviour of immigrants. It is the only annual Canadian dataset that allows users to study the characteristics of immigrants to Canada at the time of admission and their economic outcomes and regional (inter-provincial) mobility over a time span of more than 30 years. The IMDB combines administrative files on immigrant admissions and non-permanent resident permits from Immigration, Refugees and Citizenship Canada (IRCC) with tax files from the Canadian Revenue Agency (CRA). Information is available for immigrant taxfilers admitted since 1980. Tax records for 1982 and subsequent years are available for immigrant taxfilers. This report will discuss the IMDB data sources, concepts and variables, record linkage, data processing, dissemination, data evaluation and quality indicators, comparability with other immigration datasets, and the analyses possible with the IMDB.
Release date: 2018-12-10
9. Longitudinal Immigration Database (IMDB) Technical Report, 2015 Archived
Surveys and statistical programs – Documentation: 11-633-X2018011
Description:
The Longitudinal Immigration Database (IMDB) is a comprehensive source of data that plays a key role in the understanding of the economic behaviour of immigrants. It is the only annual Canadian dataset that allows users to study the characteristics of immigrants to Canada at the time of admission and their economic outcomes and regional (inter-provincial) mobility over a time span of more than 30 years. The IMDB combines administrative files on immigrant admissions and non-permanent resident permits from Immigration, Refugees and Citizenship Canada (IRCC) with tax files from the Canadian Revenue Agency (CRA). Information is available for immigrant taxfilers admitted since 1980. Tax records for 1982 and subsequent years are available for immigrant taxfilers.
This report will discuss the IMDB data sources, concepts and variables, record linkage, data processing, dissemination, data evaluation and quality indicators, comparability with other immigration datasets, and the analyses possible with the IMDB.
Release date: 2018-01-08
10. Census Test: Content Analysis Report
Notices and consultations: 92-140-X
Description:
Before each Census of Population, Statistics Canada carries out a three- to four-year process to review the content of the census questionnaires in consultation with census data users, performing tests and developing questionnaire content to ensure that it takes into account the evolution of Canadian society. Factors considered in developing the content include legislative requirements regarding information, program and policy requirements; the burden placed on respondents to respond to questions; concerns about privacy; feedback from consultations and tests; data quality; costs and operational considerations; the comparability of data with earlier data and the availability of alternative data sources. Before each census, Statistics Canada tests the questionnaire content through an extensive test. The content report presents the analyses conducted from the data collected from this test and the results that are used to fine tune the questionnaires, the methodology and the systems used for the Census Program.

Release date: 2016-04-01

Report a problem or mistake on this page

Date modified:: 2024-09-23

Language selection

Search and menus

Search

Keyword search

Filter results by

Keyword(s)

Subject

Type

Year of publication

Geography

Survey or statistical program

Portal

Content

Results

All (171) (0 to 10 of 171 results)

Data (1) ((1 result))

Analysis (145) (50 to 60 of 145 results)

Reference (21) (0 to 10 of 21 results)

Keyword search

Filter results by

Keyword(s)

Subject

Type

Year of publication

Geography

Survey or statistical program

Portal

Content

Results

All (171) (0 to 10 of 171 results)

Data (1) ((1 result))

Analysis (145) (50 to 60 of 145 results)

Reference (21) (0 to 10 of 21 results)

How do I use the filters and the search box?

How do I refine my search?

How does the search work?

How are the results ordered?

How are the results ordered?