Statistical methods

Key indicators

Selected geographical area:Canada

Investment in new housing construction - Canada
(August 2018)

$5,106.5 million

-2.2%

(12-month change)
Residential construction investment - Canada
(Second quarter 2018)

$36,023.7 million

7.8%

(year-over-year change)

Results

All (2,481)

All (2,481) (50 to 60 of 2,481 results)

51. Synthetic Data Disclosure Risk Assessment: A Literature Review Archived
Articles and reports: 11-522-X202500100016
Description: The adoption of synthetic data generation as a confidentiality measure is increasing in statistical agencies worldwide, including at Statistics Canada. This approach provides an alternative to the traditional dissemination of anonymized public microdata files, offering both privacy protection and data utility. However, the creation of synthetic data presents challenges in assessing and mitigating disclosure risks. This paper reviews the different types of disclosure risks, that being attribute, membership and identity disclosure, and presents some of the associated methods for measuring risk. The paper presents prominent risk assessment metrics and discusses practical methods for disclosure control in data synthesis. Methods for assessing disclosure risks usually produce a metric that can be used to gauge the risk, but there is little consensus on threshold values for these metrics. It is also important to focus on importance of balancing utility and confidentiality, which needs further discussion in context of these methods. The paper concludes by offering insights and recommendations about managing disclosure risk while creating synthetic data as well as providing some ideas on future directions for research and practical implications for managing disclosure risks in synthetic data.
Release date: 2025-09-08
52. Exploration of Deep Learning Synthetic Data Generation for Sensitive Utility Data Sharing Archived
Articles and reports: 11-522-X202500100017
Description: Utilities hold crucial information about energy usage and building characteristics which can be utilized by government agencies to improve their corresponding analytics. However, this data is associated with private customer records and thus the building data and energy usage may be too sensitive to share. Often, high-level aggregated versions of this data are shared through robust contracts, limiting the statistics that can be derived. With the advancement of generative machine learning techniques, Statistics Canada and Natural Resources Canada have explored the feasibility of using these models to produce synthetic versions of utility data which may be shared in full to requesting organizations. These synthetic datasets can be created by a utility company through a locally run program and the outputs can be approved before being sent. This work has identified that certain generative models can feasibly be used by utilities to generate new versions of a dataset and has identified the issues which must be addressed prior to implementing this in practice. Both tabular and time-series models have been tested for different data sharing scenarios, where the TimeGAN model successfully captured the general energy peaks and valleys over a given day with reasonable computational requirements. Although this process takes days for annual energy amounts over thousands of customer records, this can enable new data sharing initiatives between utilities and National Statistical Offices while managing privacy risks. As work progresses in future phases with real utility partners, trust can be built for these approaches, and they can begin being tested on real data by actual data holders.
Release date: 2025-09-08
53. Survey-admin Hybrid Measure of Persistent Child Poverty in New Zealand Archived
Articles and reports: 11-522-X202500100018
Description: The Child Poverty Reduction Act (2018) outlines a need for the New Zealand Government to set three- and ten-yearly persistent child poverty reduction targets come end of 2024. In the absence of longitudinal survey data, a survey-administrative data hybrid method that will facilitate the production of these reduction targets and official estimates of persistent child poverty once reporting is required for the 2025/2026 financial year onwards is outlined. This hybrid approach leverages off the cross-sectional Household Economic Survey (HES), administrative-based beneficiary's family data, and recent advances developed for the construction of households within the Administrative Population Census (APC) at Statistics New Zealand. With increasing data collection challenges due to rising non-response and costs, this survey-admin hybrid method represents an alternative to longitudinal survey data collection, ensuring ongoing sustainable and quality statistics to produce persistent child poverty estimates.
Release date: 2025-09-08
54. Efficient Record Linkage for Large Datasets by Business Names Archived
Articles and reports: 11-522-X202500100019
Description: Accurate and efficient record linkage is crucial for maintaining a comprehensive and current Statistical Business Register (SBR) at Statistics Canada. Linking external business lists to the SBR by name presents computational and methodological challenges, especially as data volumes grow. This paper describes a scalable methodology that employs blocking techniques to constrain the computational search space and integrates multiple similarity measures—from edit distances and n-gram overlaps to embedding-based methods using Sentence-BERT (SBERT)—to identify likely matches. By combining simple character-level comparisons with more advanced semantic embedding methods, the approach can adapt to various naming conventions and complexities. While it does not guarantee superior accuracy in all circumstances, it offers a pragmatic balance between computational feasibility and linkage quality.
Release date: 2025-09-08
55. Evaluating the Accuracy when Linking Records in Waves Archived
Articles and reports: 11-522-X202500100020
Description: At Statistics Canada, many data sets are linked with quasi-identifiers such as the first name, last name, or address. In such cases, linkage errors are a potential concern and must be measured. In that regard, previous studies have shown that the evaluation may be based on modeling the number of links from a given record while accounting for all the interactions among the linkage variables and dispensing with clerical reviews, so long as the decision to link two records does not involve other records. In this communication, the methodology is adapted for a class of practical strategies, which violate this constraint by linking the records in consecutive waves, where a given wave links a subset of the records that are not linked in previous waves. In particular, the linkage may be based on a deterministic wave followed by a probabilistic one.
Release date: 2025-09-08
56. Model-Based Threshold Selection for Agricultural Linkages Archived
Articles and reports: 11-522-X202500100021
Description: Optimal threshold selection is a critical challenge in probabilistic linkage, with significant implications for the accuracy and reliability of linked datasets. This paper analyzes the performance of the neighbour model, a recently proposed error model which models linkage errors by the number of links from each record. Three threshold selection algorithms utilizing the neighbour model were assessed, highlighting the strengths and limitations of each. Their performance was assessed through simulation studies, which demonstrated that methods using the neighbour model achieved lower relative bias compared to two established methods for threshold selection. Additionally, the practical utility was validated through goodness-of-fit tests conducted on four agricultural datasets, showing the potential of the model for use in real-world applications.
Release date: 2025-09-08
57. T1 Redesign: T1 Partnership Identification Process Archived
Articles and reports: 11-522-X202500100022
Description: In Canada, T1 Tax forms are used to report personal income, whether earned as an employee or through self-employment. Income from self-employment, or "T1 Business Income" is reported by sole proprietorships or partnerships. A T1 partnership involves two or more legal entities jointly filing for a shared business. T1 business data is received as individual filings, meaning partnerships are received separately for each partner. Internal record linkage within the T1 business database is performed to identify partnerships and prevent overcoverage within the final population of T1 businesses. This new T1 partnership identification process takes advantage of newer algorithms, such as DBSCAN numerical clustering fuzzy matching, to identify internal linkages. Graph theory is used to construct the list of partnerships from the row-pairs identified in the linkage process.
Release date: 2025-09-08
58. Development of Linkage-Adjusted Weights Accounting for Gender for the 2021 Canadian Census Health and Environment Cohort Archived
Articles and reports: 11-522-X202500100023
Description: The latest Canadian Census Health and Environment Cohort (CanCHEC) continues a series of population-based microdata linkages focused on population health research by demographic, social and economic characteristics. The 2021 CanCHEC consists of 95.5% of the 2021 Census long-form sample survey records. The records of survey respondents that could not be linked to the Derived Record Depository and those presumed to be duplicates account for the remaining 4.5%. Linkage-adjusted main and replicate weights allow researchers to estimate and evaluate the variance of summary measures about population health in the presence of missed linked pairs to better understand the experiences of diverse population groups.
Release date: 2025-09-08
59. The Future of National Statistical Organisations: The Longer-Term Role and Shape of NSOs Archived
Articles and reports: 11-522-X202500100024
Description: This paper explores a vision for the future of National Statistics Offices (NSOs). It analyses the history and role of NSOs before exploring current and future challenges and opportunities for NSOs, before finally outlining a future where NSOs become more agile, open, and collaborative while maintaining their high level of trust in the community, thereby allowing them to fulfil their new role as data stewards in a rapidly evolving data landscape.
Release date: 2025-09-08
60. Statistical Inference for a Finite Population Mean with Machine Learning-Based Imputation for Missing Survey Data Archived
Articles and reports: 11-522-X202500100025
Description: National statistical offices have increasingly adopted machine learning (ML) for its potential to improve survey estimates. ML techniques offer significant advantages, notably the ability to manage high-dimensional data and to capture complex, nonlinear relationships, thereby enhancing the overall quality of survey statistics. In this article, following the approach of Chernozhukov et al. (2018), we describe a double debiased machine learning framework that enables valid statistical inference when imputed estimators are derived from ML procedures. Simulation results suggest that the proposed framework performs well in a wide range of scenarios.
Release date: 2025-09-08

Data (10)

Data (10) ((10 results))

1. Social Policy Simulation Database and Model (SPSD/M)
Public use microdata: 89F0002X
Description: The SPSD/M is a static microsimulation model designed to analyse financial interactions between governments and individuals in Canada. It can compute taxes paid to and cash transfers received from government. It is comprised of a database, a series of tax/transfer algorithms and models, analytical software and user documentation.
Release date: 2026-02-12
2. National Address Register
Profile of a community or region: 46-26-0002
Description: The National Address Register (NAR) is a list of commercial and residential addresses in Canada that are extracted from Statistics Canada's Building Register and deemed non-confidential.
Release date: 2025-12-19
3. PASSAGES microsimulation model
Table: 89-26-0006
Description: PASSAGES is an open-source dynamic microsimulation model aimed at supporting policy analysis and research relating to Canadian retirement income system outcomes at the individual and family level. The publicly available version includes a synthetic starting database, a model, and documentation. A confidential starting database is also available.
Release date: 2025-03-12
4. Canadian Statistical Geospatial Explorer Hub Archived
Data Visualization: 71-607-X2020010
Description: The Canadian Statistical Geospatial Explorer empowers users to discover geo enabled data holdings of Statistics Canada at various levels of geography including at the neighbourhood level. Users are able to visualize, thematically map, spatially explore and analyze, export and consume data in various formats. Users can also view the data superimposed on satellite imagery, topographic and street layers.
Release date: 2024-08-21
5. Income divergence index (D-index) by census tract
Table: 11-10-0074-01
Geography: Census tract
Frequency: Occasional
Description:
The divergence index (D-index) describes the degree that families with different income levels are mixing together in neighbourhoods. It compares neighbourhood (census tract, CT) discrete income distributions to a base distribution, which is the income quintiles of the neighbourhood’s census metropolitan area (CMA).

Release date: 2020-06-22
6. Housing Data Viewer Archived
Data Visualization: 71-607-X2019010
Description: The Housing Data Viewer is a visualization tool that allows users to explore Statistics Canada data on a map. Users can use the tool to navigate, compare and export data.
Release date: 2019-10-30
7. Findings of the Canadian Vehicle Fuel Pilot Survey Archived
Table: 53-500-X
Description:
This report presents the results of a pilot survey conducted by Statistics Canada to measure the fuel consumption of on-road motor vehicles registered in Canada. This study was carried out in connection with the Canadian Vehicle Survey (CVS) which collects information on road activity such as distance traveled, number of passengers and trip purpose.
Release date: 2004-10-21
8. National Tourism Indicators, Historical Estimates Archived
Table: 13-220-X
Description: In the 1997 edition, new and revised benchmarks were introduced for 1992 and 1988. The indicators are used to monitor supply, demand and employment for tourism in Canada on a timely basis. The annual tables are derived using the National Income and Expenditure Accounts (NIEA) and various industry and travel surveys. Tables providing actual data and percentage changes, for seasonally adjusted current and constant price estimates are included. In addition, an analytical section provides graphs, and time series of first differences, percentage changes, and seasonal factors for selected indicators. Data are published from 1987 and the publication will be available on the day of release. New data are included in the demand tables for non-tourism commodities produced by non-tourism industries and in the employment tables covering direct tourism employment generated by non-tourism industries. This product was commissioned by the Canadian Tourism Commission to provide annual updates for the Tourism Satellite Account.
Release date: 2003-01-08
9. Historical Statistics of Canada Archived
Table: 11-516-X
Description:
The second edition of Historical statistics of Canada was jointly produced by the Social Science Federation of Canada and Statistics Canada in 1983. This volume contains about 1,088 statistical tables on the social, economic and institutional conditions of Canada from the start of Confederation in 1867 to the mid-1970s. The tables are arranged in sections with an introduction explaining the content of each section, the principal sources of data for each table, and general explanatory notes regarding the statistics. In most cases, there is sufficient description of the individual series to enable the reader to use them without consulting the numerous basic sources referenced in the publication.
The electronic version of this historical publication is accessible on the Internet site of Statistics Canada as a free downloadable document: text as HTML pages and all tables as individual spreadsheets in a comma delimited format (CSV) (which allows online viewing or downloading).
Release date: 1999-07-29
10. National Population Health Survey Overview Archived
Table: 82-567-X
Description:
The National Population Health Survey (NPHS) is designed to enhance the understanding of the processes affecting health. The survey collects cross-sectional as well as longitudinal data. In 1994/95 the survey interviewed a panel of 17,276 individuals, then returned to interview them a second time in 1996/97. The response rate for these individuals was 96% in 1996/97. Data collection from the panel will continue for up to two decades. For cross-sectional purposes, data were collected for a total of 81,000 household residents in all provinces (except people on Indian reserves or on Canadian Forces bases) in 1996/97.
This overview illustrates the variety of information available by presenting data on perceived health, chronic conditions, injuries, repetitive strains, depression, smoking, alcohol consumption, physical activity, consultations with medical professionals, use of medications and use of alternative medicine.
Release date: 1998-07-29

Analysis (2,037)

Analysis (2,037) (60 to 70 of 2,037 results)

61. Statistics Canada International Symposium Series: Proceedings
Journals and periodicals: 11-522-X
Description: Since 1984, an annual international symposium on methodological issues has been sponsored by Statistics Canada. Proceedings have been available since 1987.
Release date: 2025-09-08
62. A conversation with Geoffrey Hole
Articles and reports: 12-001-X202500100001
Description: Geoffrey J.C. Hole (or Geoff, as he likes to be called) was born on January 24, 1940 at Shardeloes, Amersham, Buckinghamshire, England, to Charles William Hole and Sybil Winifred Hole, formerly Morge. He completed a BSc Honours in Mathematics in 1961, and a Postgraduate Diploma in Statistics at Manchester University the following year. He started his career as a mathematical statistician in London, England, working successively for the National Coal Board (1962-63), the Central Electricity Generating Board (1963-66), and the Electricity Council (1966-67), where his title was Economist. He moved to Canada in 1967 to join the Dominion Bureau of Statistics (DBS) as a survey methodologist. In 1971-72, he was Chief of Census Operations, Methodology and Quality Control Section, and Assistant Coordinator, Socio-Economic Survey Methods Section. He then took a one-year leave of absence to complete an MSc (Econ) in Statistics at the London School of Economics. In 1973, Geoff returned to the DBS, which had become Statistics Canada, as Chief, Methodology Group V, Business Survey Methods Division. In 1974, he was appointed Director, Institutions and Agriculture Survey Methods Division, and, as of 1986, Director, Business Survey Methods Division. His career culminated when he became Director, Social Survey Methods Division, in 1987. He held that position until his retirement, on September 29, 2004. In addition to his long-term involvement at Statistics Canada, including as a member of the Editorial Board of Survey Methodology between 1983 and 1987, Geoff was very active in the Statistical Society of Canada (SSC), serving among others as Chair of the Program Committee for the 1986 Annual Meeting at the Banff Centre, in Alberta, and President of the SSC in 1989-90. He was also Program Chair for a joint conference of the International Association of Survey Statisticians and the International Association for Official Statistics which was held in Aguascalientes, Mexico, in 1998.
Release date: 2025-06-30
63. A conversation with Dr. Ivan P. Fellegi
Articles and reports: 12-001-X202500100002
Description: Ivan Fellegi is an expert in statistical science and a public servant who was the Chief Statistician of Canada from 1985 to 2008. This article briefly recounts his early life, long-spanning career and influential research contributions. It includes an interview conducted in February 2017 to mark the 60th year of service of Ivan Fellegi’s career at Statistics Canada.
Release date: 2025-06-30
64. On the use of machine learning methods for the treatment of unit nonresponse in surveys
Articles and reports: 12-001-X202500100003
Description: In recent years, there has been a significant interest in machine learning in national statistical offices. Thanks to their flexibility, these methods may prove useful at the nonresponse treatment stage. In this article, we conduct an empirical investigation in order to compare several machine learning procedures in terms of bias and efficiency. In addition to the classical machine learning procedures, we assess the performance of ensemble approaches that make use of different machine learning procedures to produce a set of weights adjusted for nonresponse.
Release date: 2025-06-30
65. Imputation of nonignorable missing data in surveys using auxiliary margins via hot deck and sequential imputation
Articles and reports: 12-001-X202500100004
Description: Survey data collection often is plagued by unit and item nonresponse. To reduce reliance on strong assumptions about the missingness mechanisms, statisticians can use information about population marginal distributions known, for example, from censuses or administrative databases. One approach that does so is the Missing Data with Auxiliary Margins, or MD-AM, framework, which uses multiple imputation for both unit and item nonresponse so that survey-weighted estimates accord with the known marginal distributions. However, this framework relies on specifying and estimating a joint distribution for the survey data and nonresponse indicators, which can be computationally and practically daunting in data with many variables of mixed types. We propose two adaptations to the MD-AM framework to simplify the imputation task. First, rather than specifying a joint model for unit respondents’ data, we use random hot deck imputation while still leveraging the known marginal distributions. Second, instead of sampling from conditional distributions implied by the joint model for the missing data due to item nonresponse, we apply multiple imputation by chained equations for item nonresponse before imputation for unit nonresponse. Using simulation studies with nonignorable missingness mechanisms, we demonstrate that the proposed approach can provide more accurate point and interval estimates than models that do not leverage the auxiliary information. We illustrate the approach using data on voter turnout from the U.S. Current Population Survey.
Release date: 2025-06-30
66. Mean squared prediction error estimators of the empirical best linear unbiased predictor of a small area mean under a semi-parametric Fay-Herriot model
Articles and reports: 12-001-X202500100005
Description: In this paper, we derive a second-order unbiased (or nearly unbiased) mean squared prediction error (MSPE) estimator of the empirical best linear unbiased predictor (EBLUP) of a small area mean for a semi-parametric extension to the well-known Fay-Herriot model. Specifically, we derive our MSPE estimator essentially assuming certain moment conditions on both the sampling errors and random effects distributions. The normality-based Prasad-Rao MSPE estimator has a surprising robustness property in that it remains second-order unbiased under the non-normality of random effects when a simple Prasad-Rao method-of-moments estimator is used for the variance component and the sampling error distribution is normal. We show that the normality-based MSPE estimator is no longer second-order unbiased when the sampling error distribution has non-zero kurtosis or when the Fay-Herriot moment method is used to estimate the variance component, even when the sampling error distribution is normal. Interestingly, when the simple method-of moments estimator is used for the variance component, our proposed MSPE estimator does not require the estimation of kurtosis of the random effects. Results of a simulation study on the accuracy of the proposed MSPE estimator, under non-normality of both sampling and random effects distributions, are also presented.
Release date: 2025-06-30
67. sCHAID: A tool for constructing nonresponse adjustment cells under a design-based framework
Articles and reports: 12-001-X202500100006
Description: Survey practitioners have increasingly embraced the benefits of modern machine learning techniques, including classification and regression tree algorithms, in the development of nonresponse adjustments. These methods, which do not require a predefined functional relationship between outcomes and predictors, offer a practical means of conducting variable selection and deriving interpretable structures that link response propensity with explanatory variables. However, when applying these algorithms to survey data, it is common to overlook crucial factors like sampling weights, as well as sample design features such as stratification and clustering. To bridge this shortcoming, we propose an extension of the Chi-square Automatic Interaction Detector (CHAID) approach, and we describe the design-based asymptotic properties of the resulting “survey CHAID” (sCHAID) method. To facilitate the practical use of sCHAID, we incorporate a Rao-Scott correction into the splitting criterion, accounting for the survey design. Using data from the U.S. American Community Survey, we illustrate the use of the method and evaluate its performance through comparisons with existing weighted and unweighted algorithms.
Release date: 2025-06-30
68. Model-assisted calibration estimation using generalized entropy calibration in survey sampling
Articles and reports: 12-001-X202500100007
Description: We introduce a novel approach to model-assisted calibration estimation in survey sampling using generalized entropy. The method builds upon recent work by Kwon, Kim and Qiu (2024) and extends it to a model-assisted framework. Unlike traditional calibration techniques, this approach employs a generalized entropy function as the objective for optimization and incorporates a debiasing calibration constraint to ensure design consistency. The proposed estimator is shown to be asymptotically equivalent to an augmented generalized regression (GREG) estimator. It allows for unequal model variance, potentially improving efficiency when the sampling design is informative. The paper presents both design-based and model-based justifications for the method, along with asymptotic properties and variance estimation techniques. Computational aspects are discussed, including an unconstrained optimization approach that facilitates implementation, especially for high-dimensional auxiliary variables. The method’s performance is evaluated through a simulation study, demonstrating its effectiveness in improving estimation efficiency, particularly when the sampling design is informative.
Release date: 2025-06-30
69. Use of nonprobability samples for official statistics, state of the art
Articles and reports: 12-001-X202500100008
Description: Tightened budgets, continuing decrease of response rates in traditional probability surveys and increasing pressure by users for more timely data, has stimulated research on the use of nonprobability sample data, such as administrative records, web scraping, mobile phone data and voluntary internet surveys, for inference on finite population parameters like means and totals. These data are often easier, faster and cheaper to collect than traditional probability samples. However, a major concern with the use of this kind of data for official statistics is their nonrepresentativeness due to possible selection bias, which if not accounted for properly, could bias the inference. In this article, we review and discuss methods considered in the literature to deal with this problem and propose new methods, distinguishing between methods based on integration of the nonprobability sample with an appropriate probability sample, and methods that base the inference solely on the nonprobability sample. Empirical illustrations, based on simulated data are provided.
Release date: 2025-06-30
70. Bridging BigData and sampling methodology: What is big and where is the bridge?
Articles and reports: 12-001-X202500100009
Description: BigData users and the BigData research community are expanding rapidly, while statisticians at large are seemingly becoming divided between those who are enthusiastic and those who are concerned, if not downright hostile. Is BigData also a big step ahead, truly advancing our ability to extract meaningful information and actual knowledge from data? Is BigData underplaying traditional statistical inference as we know it, supplanting survey methodology as a low-cost futuristic option? In this paper I will attempt to unravel the multifaceted relationship bridging BigData to sampling methodology. Starting by reasoning why it should be interesting to look at BigData from a sampling statistician’s perspective, I will delve deeper into the somewhat ambiguous definition of BigData and share some very personal considerations and views on the matter. In the process, several open questions will arise while discussing a personal selection of insights that are traceable through the vast body of statistical literature around BigData and sampling methodology. The discussion will take various angles explored across nine key points, and it will conclude with a forward-looking perspective on a main challenge for future research: addressing the strong assumptions needed to manage deviations from purely randomized data collection.
Release date: 2025-06-30

Reference (382)

Reference (382) (40 to 50 of 382 results)

41. National Household Survey: Aboriginal Peoples
Surveys and statistical programs – Documentation: 99-011-X
Description:
This topic presents data on the Aboriginal peoples of Canada and their demographic characteristics. Depending on the application, estimates using any of the following concepts may be appropriate for the Aboriginal population: (1) Aboriginal identity, (2) Aboriginal ancestry, (3) Registered or Treaty Indian status and (4) Membership in a First Nation or Indian band. Data from the 2011 National Household Survey are available for the geographical locations where these populations reside, including 'on reserve' census subdivisions and Inuit communities of Inuit Nunangat as well as other geographic areas such as the national (Canada), provincial and territorial levels.
Analytical products
The analytical document provides analysis on the key findings and trends in the data, and is complimented with the short articles found in NHS in Brief and the NHS Focus on Geography Series.
Data products
The NHS Profile is one data product that provides a statistical overview of user selected geographic areas based on several detailed variables and/or groups of variables. Other data products include data tables which represent a series of cross tabulations ranging in complexity and are available for various levels of geography.
Release date: 2019-10-29
42. Classifying Cannabis in the Canadian Statistical System Archived
Surveys and statistical programs – Documentation: 11-621-M2018105
Description:
Statistics Canada needs to respond to the legalization of cannabis for non-medical use by measuring various aspects of the introduction of cannabis in the Canadian economy and society. An important part of measuring the economy and society is using statistical classifications. It is common practice with classifications that they are updated and revised as new industries, products, occupations and educational programs are introduced into the Canadian economy and society. This paper describes the changes to the various statistical classifications used by Statistics Canada in order to measure the introduction of legal non-medical cannabis.
Release date: 2019-07-24
43. Analytical Studies Branch Annual Consolidated Plan for Research, Data Development and Modelling, 2019/2020 Archived
Surveys and statistical programs – Documentation: 11-633-X2019001
Description:
The mandate of the Analytical Studies Branch (ASB) is to provide high-quality, relevant and timely information on economic, health and social issues that are important to Canadians. The branch strategically makes use of expert knowledge and a large range of statistical sources to describe, draw inferences from, and make objective and scientifically supported deductions about the evolving nature of the Canadian economy and society. Research questions are addressed by applying leading-edge methods, including microsimulation and predictive analytics using a range of linked and integrated administrative and survey data. In supporting greater access to data, ASB linked data are made available to external researchers and policy makers to support evidence-based decision making. Research results are disseminated by the branch using a range of mediums (i.e., research papers, studies, infographics, videos, and blogs) to meet user needs. The branch also provides analytical support and training, feedback, and quality assurance to the wide range of programs within and outside Statistics Canada.
Release date: 2019-05-29
44. Effective Income Tax and Transfer Rates: Technical Reference Note
Notices and consultations: 75F0002M2019006
Description:
In 2018, Statistics Canada released two new data tables with estimates of effective tax and transfer rates for individual tax filers and census families. These estimates are derived from the Longitudinal Administrative Databank. This publication provides a detailed description of the methods used to derive the estimates of effective tax and transfer rates.
Release date: 2019-04-16
45. Transition of Labour Force Survey Data Processing to the Social Survey Processing Environment (SSPE) Archived
Surveys and statistical programs – Documentation: 75-005-M2019001
Description:
The production of statistics from the Labour Force Survey (LFS) involves many activities, one of which is data processing. This step involves the verification and correction of survey data when required in order to produce microdata files. Beginning in January 2019, LFS processing will be transitioned to a new system, the Social Survey Processing Environment. This document describes the development and testing that preceded the implementation of the new system, and demonstrates that the transition is expected to have minimal impact on LFS estimates and be transparent to users of LFS data.
Release date: 2019-02-08
46. Longitudinal Immigration Database (IMDB) Technical Report, 2016 Archived
Surveys and statistical programs – Documentation: 11-633-X2018019
Description:
The Longitudinal Immigration Database (IMDB) is a comprehensive source of data that plays a key role in the understanding of the economic behaviour of immigrants. It is the only annual Canadian dataset that allows users to study the characteristics of immigrants to Canada at the time of admission and their economic outcomes and regional (inter-provincial) mobility over a time span of more than 30 years. The IMDB combines administrative files on immigrant admissions and non-permanent resident permits from Immigration, Refugees and Citizenship Canada (IRCC) with tax files from the Canadian Revenue Agency (CRA). Information is available for immigrant taxfilers admitted since 1980. Tax records for 1982 and subsequent years are available for immigrant taxfilers. This report will discuss the IMDB data sources, concepts and variables, record linkage, data processing, dissemination, data evaluation and quality indicators, comparability with other immigration datasets, and the analyses possible with the IMDB.
Release date: 2018-12-10
47. Longitudinal Immigration Database (IMDB) Technical Report, 2015 Archived
Surveys and statistical programs – Documentation: 11-633-X2018011
Description:
The Longitudinal Immigration Database (IMDB) is a comprehensive source of data that plays a key role in the understanding of the economic behaviour of immigrants. It is the only annual Canadian dataset that allows users to study the characteristics of immigrants to Canada at the time of admission and their economic outcomes and regional (inter-provincial) mobility over a time span of more than 30 years. The IMDB combines administrative files on immigrant admissions and non-permanent resident permits from Immigration, Refugees and Citizenship Canada (IRCC) with tax files from the Canadian Revenue Agency (CRA). Information is available for immigrant taxfilers admitted since 1980. Tax records for 1982 and subsequent years are available for immigrant taxfilers.
This report will discuss the IMDB data sources, concepts and variables, record linkage, data processing, dissemination, data evaluation and quality indicators, comparability with other immigration datasets, and the analyses possible with the IMDB.
Release date: 2018-01-08
48. Methodology of the Canadian Labour Force Survey
Surveys and statistical programs – Documentation: 71-526-X
Description:
The Canadian Labour Force Survey (LFS) is the official source of monthly estimates of total employment and unemployment. Following the 2011 census, the LFS underwent a sample redesign to account for the evolution of the population and labour market characteristics, to adjust to changes in the information needs and to update the geographical information used to carry out the survey. The redesign program following the 2011 census culminated with the introduction of a new sample at the beginning of 2015. This report is a reference on the methodological aspects of the LFS, covering stratification, sampling, collection, processing, weighting, estimation, variance estimation and data quality.
Release date: 2017-12-21
49. Data Quality Toolkit
Surveys and statistical programs – Documentation: 12-606-X
Description: This is a toolkit intended to aid data producers and data users external to Statistics Canada.
Release date: 2017-09-27
50. Comparison of Place of Residence between the T1 Family File and the Census: Evaluation using record linkage Archived
Surveys and statistical programs – Documentation: 91F0015M2017013
Description:
Using records linkage, this article compares the place of residence in the 2011 Census to that of the 2010 T1 Family File (T1FF). The main result is that although the overall level of consistency in the place of residence is relatively high, it decreases, sometimes substantially, for some segments of the population.
Release date: 2017-09-26

Date modified:: 2026-06-17

Language selection

WxT Language switcher

Search and menus

WxT Search form

Statistical methods

Key indicators

Selected geographical area:Canada

Selected geographical area:Newfoundland and Labrador

Selected geographical area:Prince Edward Island

Selected geographical area:Nova Scotia

Selected geographical area:New Brunswick

Selected geographical area:Quebec

Selected geographical area:Ontario

Selected geographical area:Manitoba

Selected geographical area:Saskatchewan

Selected geographical area:Alberta

Selected geographical area:British Columbia

Selected geographical area:Yukon

Selected geographical area:Northwest Territories

Selected geographical area:Nunavut

Filter results by

Keyword(s)

Subject

Results

All (2,481) (50 to 60 of 2,481 results)

Data (10) ((10 results))

Analysis (2,037) (60 to 70 of 2,037 results)

Reference (382) (40 to 50 of 382 results)

Statistical methods

Key indicators

Selected geographical area:Canada

Selected geographical area:Newfoundland and Labrador

Selected geographical area:Prince Edward Island

Selected geographical area:Nova Scotia

Selected geographical area:New Brunswick

Selected geographical area:Quebec

Selected geographical area:Ontario

Selected geographical area:Manitoba

Selected geographical area:Saskatchewan

Selected geographical area:Alberta

Selected geographical area:British Columbia

Selected geographical area:Yukon

Selected geographical area:Northwest Territories

Selected geographical area:Nunavut

Filter results by

Keyword(s)

Subject

Results

All (2,481) (50 to 60 of 2,481 results)

Data (10) ((10 results))

Analysis (2,037) (60 to 70 of 2,037 results)

Reference (382) (40 to 50 of 382 results)

How are the results ordered?

How are the results ordered?

How do I use the filters and the search box?

How do I refine my search?

How does the search work?