Statistical methods

Key indicators

Selected geographical area:Canada

Investment in new housing construction - Canada
(August 2018)

$5,106.5 million

-2.2%

(12-month change)
Residential construction investment - Canada
(Second quarter 2018)

$36,023.7 million

7.8%

(year-over-year change)

Results

All (2,478)

All (2,478) (2,470 to 2,480 of 2,478 results)

2,471. Non-interview patterns in the Canadian Labour Force Survey Archived
Articles and reports: 12-001-X197500100006
Description: This paper summarizes the results of a project conducted to study non-interviews in the Canadian Labour Force Survey. Temporarily absent (32.7%), no-one-home (31.4%), and refusal (25.5%) are the major components of non-response. The impact of these components to the total non-response in Surveys from July 1972 to June 1973 is discussed in detail.
A detailed analysis of refusal households showed that existing field follow-up procedures were not quite successful in reducing the refusal component. As expected, non-response was found to be related to the length of tenure of households in the sample. Non-response among households enumerated for the first time was generally higher than those households already in the sample.
Release date: 1975-06-16
2,472. Components of variance model in multi-stage stratified samples Archived
Articles and reports: 12-001-X197500100007
Description: There are several multi-stage sample designs in various countries, such as the Current Population Survey in U.S.A., Labour Survey in Sweden, and the General Household Survey in United Kingdom. From each survey, estimated totals of Employed, Unemployed, and other characteristics may be obtained.
The Canadian Labour Force Survey is a monthly household survey in which the dwelling is the ultimate unit of sampling requiring two to four stages of selection. Each province is split up into strata and sampling units at various stages so that the sampling variance contains up to four components of variance whose actual formulae and estimation formulae are derived, utilizing those formerly derived by Yates and Grundy [12]. Ratio estimation is employed and the formulas are modified accordingly. To analyze the components of variance, it is necessary to express them in terms of components of sampling ratios and the sizes of sampling units at the various stages at provincial and national levels and approximate variance functions are thus derived.
Release date: 1975-06-16
2,473. A strategy for up-dating continuous surveys Archived
Articles and reports: 12-001-X197500100008
Description: The need for regular up-dating of the selection probabilities in continuous surveys is emphasized in this paper. A simple strategy (selection method for the initial sample with the revision procedure) is presented and its application to the Canadian Labour Force Survey is discussed.
Release date: 1975-06-16
2,474. Reinterview programs and response errors Archived
Articles and reports: 12-001-X197500100009
Description: This paper discusses several reinterview techniques and their use in relation to Response Variance, Response Bias, Interviewer Training, and the monitoring of various elements of the interview process. Using the Canadian Labour Force Survey as a case study the article describes how reinterview techniques were developed as the survey evolved and briefly describes the strategy being followed in the present reinterview program.
Release date: 1975-06-16
2,475. Data Inventory Project
Surveys and statistical programs – Documentation: 5190
Description: The Data Inventory Project is a government-wide stock-taking of federal data holdings within departments that are part of the Policy Research Data Group to determine the broad range of data holdings that could address the medium to longer-term priorities. The inventory is comprised of the metadata on datasets held within the various departments and will be linked, when possible, to specific key policy issues.
2,476. Internet Pilot Survey on Caregiving
Surveys and statistical programs – Documentation: 5192
Description: The purpose of this pilot is to provide Statistics Canada with information on key aspects of E-questionnaire data collection as well as measuring the impact of Internet collection on estimates.
2,477. Household Survey Frame Service - Global Positioning System (GPS) and digital mapping pilot test
Surveys and statistical programs – Documentation: 5241
Description: The SRGD is conducting a Global Positioning System (GPS) and digital mapping test to improve Statistic Canada's rural dwelling inventory by collecting dwelling identifiers to be used by field collection staff. In rural areas dwelling identification can be difficult where there is an absence of civic style addresses. The test is evaluating alternative methods for dwelling identification including the collection of GPS coordinates and digital photos using a mapping application and a digital tablet
2,478. Respondent Selection Study for the General Social Survey
Surveys and statistical programs – Documentation: 8014
Description: This study will be used to determine which method would be the most effective to select households in Canada for any given survey that is conducted by Statistics Canada.

Data (10)

Data (10) ((10 results))

1. Social Policy Simulation Database and Model (SPSD/M)
Public use microdata: 89F0002X
Description: The SPSD/M is a static microsimulation model designed to analyse financial interactions between governments and individuals in Canada. It can compute taxes paid to and cash transfers received from government. It is comprised of a database, a series of tax/transfer algorithms and models, analytical software and user documentation.
Release date: 2026-02-12
2. National Address Register
Profile of a community or region: 46-26-0002
Description: The National Address Register (NAR) is a list of commercial and residential addresses in Canada that are extracted from Statistics Canada's Building Register and deemed non-confidential.
Release date: 2025-12-19
3. PASSAGES microsimulation model
Table: 89-26-0006
Description: PASSAGES is an open-source dynamic microsimulation model aimed at supporting policy analysis and research relating to Canadian retirement income system outcomes at the individual and family level. The publicly available version includes a synthetic starting database, a model, and documentation. A confidential starting database is also available.
Release date: 2025-03-12
4. Canadian Statistical Geospatial Explorer Hub Archived
Data Visualization: 71-607-X2020010
Description: The Canadian Statistical Geospatial Explorer empowers users to discover geo enabled data holdings of Statistics Canada at various levels of geography including at the neighbourhood level. Users are able to visualize, thematically map, spatially explore and analyze, export and consume data in various formats. Users can also view the data superimposed on satellite imagery, topographic and street layers.
Release date: 2024-08-21
5. Income divergence index (D-index) by census tract
Table: 11-10-0074-01
Geography: Census tract
Frequency: Occasional
Description:
The divergence index (D-index) describes the degree that families with different income levels are mixing together in neighbourhoods. It compares neighbourhood (census tract, CT) discrete income distributions to a base distribution, which is the income quintiles of the neighbourhood’s census metropolitan area (CMA).

Release date: 2020-06-22
6. Housing Data Viewer Archived
Data Visualization: 71-607-X2019010
Description: The Housing Data Viewer is a visualization tool that allows users to explore Statistics Canada data on a map. Users can use the tool to navigate, compare and export data.
Release date: 2019-10-30
7. Findings of the Canadian Vehicle Fuel Pilot Survey Archived
Table: 53-500-X
Description:
This report presents the results of a pilot survey conducted by Statistics Canada to measure the fuel consumption of on-road motor vehicles registered in Canada. This study was carried out in connection with the Canadian Vehicle Survey (CVS) which collects information on road activity such as distance traveled, number of passengers and trip purpose.
Release date: 2004-10-21
8. National Tourism Indicators, Historical Estimates Archived
Table: 13-220-X
Description: In the 1997 edition, new and revised benchmarks were introduced for 1992 and 1988. The indicators are used to monitor supply, demand and employment for tourism in Canada on a timely basis. The annual tables are derived using the National Income and Expenditure Accounts (NIEA) and various industry and travel surveys. Tables providing actual data and percentage changes, for seasonally adjusted current and constant price estimates are included. In addition, an analytical section provides graphs, and time series of first differences, percentage changes, and seasonal factors for selected indicators. Data are published from 1987 and the publication will be available on the day of release. New data are included in the demand tables for non-tourism commodities produced by non-tourism industries and in the employment tables covering direct tourism employment generated by non-tourism industries. This product was commissioned by the Canadian Tourism Commission to provide annual updates for the Tourism Satellite Account.
Release date: 2003-01-08
9. Historical Statistics of Canada Archived
Table: 11-516-X
Description:
The second edition of Historical statistics of Canada was jointly produced by the Social Science Federation of Canada and Statistics Canada in 1983. This volume contains about 1,088 statistical tables on the social, economic and institutional conditions of Canada from the start of Confederation in 1867 to the mid-1970s. The tables are arranged in sections with an introduction explaining the content of each section, the principal sources of data for each table, and general explanatory notes regarding the statistics. In most cases, there is sufficient description of the individual series to enable the reader to use them without consulting the numerous basic sources referenced in the publication.
The electronic version of this historical publication is accessible on the Internet site of Statistics Canada as a free downloadable document: text as HTML pages and all tables as individual spreadsheets in a comma delimited format (CSV) (which allows online viewing or downloading).
Release date: 1999-07-29
10. National Population Health Survey Overview Archived
Table: 82-567-X
Description:
The National Population Health Survey (NPHS) is designed to enhance the understanding of the processes affecting health. The survey collects cross-sectional as well as longitudinal data. In 1994/95 the survey interviewed a panel of 17,276 individuals, then returned to interview them a second time in 1996/97. The response rate for these individuals was 96% in 1996/97. Data collection from the panel will continue for up to two decades. For cross-sectional purposes, data were collected for a total of 81,000 household residents in all provinces (except people on Indian reserves or on Canadian Forces bases) in 1996/97.
This overview illustrates the variety of information available by presenting data on perceived health, chronic conditions, injuries, repetitive strains, depression, smoking, alcohol consumption, physical activity, consultations with medical professionals, use of medications and use of alternative medicine.
Release date: 1998-07-29

Analysis (2,036)

Analysis (2,036) (30 to 40 of 2,036 results)

31. Recruitment and Collection of Web Panels at Statistics Canada Archived
Articles and reports: 11-522-X202500100008
Description: In 2020, Statistics Canada started to use probabilistic web panels as an alternate method of collecting official statistics. In a web panel, respondents to another survey are asked for contact information to participate in future short surveys. This paper will highlight Statistics Canada's experience with panels after 4 years, including what has been learned about the recruitment of panel participants and how to subsequently collect data using panel surveys. The ways in which recruitment questions are presented can result in very different rates of participation. Moreover, the wealth of auxiliary information available on the recruitment survey can be used to actively manage panel collection operations, by predicting the probability of response and using this information to target follow-up efforts.
Release date: 2025-09-08
32. A Bias Evaluation for Probabilistic Web Panels at Statistics Canada Archived
Articles and reports: 11-522-X202500100009
Description: Three series of web panels were implemented at Statistics Canada from 2020 to 2024. Participants for these web panel series were recruited from respondents of large probabilistic social surveys (recruitment surveys), and subsequently were invited to complete a series of short online surveys. Estimates of recruitment survey variables were calculated using both recruitment survey weights and web panel weights, and these were compared; differences signal the possibility of residual bias that was not corrected by the web panel weighting process. This investigation found more significant differences than would be expected if the web panel estimator fully corrected for the bias resulting from the web panel response process. Questions related to certain topics such as politics and voting, sense of belonging, and media consumption were found to have the most significant differences between web panel estimates and recruitment survey estimates.
Release date: 2025-09-08
33. Life in the FastText Lane: Harnessing Linear Programming Constrained Machine Learning for Classifications Revision Archived
Articles and reports: 11-522-X202500100010
Description: Statistics Canada's Labour Force Survey (LFS) plays an essential role in the estimation of labour market conditions in Canada. Periodically, LFS revises its data to the most recent industry and occupational classification versions. Differences in versions can be extensive, including high-level and unit-group structural changes, creations, deletions, split-offs and combination of classification units (classes). Historically, to reconcile split-off classes - where one class splits into multiple classes - a sample of LFS split-off records would be manually recoded to the new classification version. Based on the split-off proportion observed in the recoded sample, a random allocation method would be applied on all data to reflect the changing Canadian labour market over time. This article proposes using machine learning (fastText), constrained to split-off proportions using linear programming, to revise industry and occupation classifications in LFS. The hybrid framework benefits from a text-based revision mechanism while adhering to traditional proportions driven estimates, thus ensuring a minimal impact on the comparability of published labour market indicators.
Release date: 2025-09-08
34. Data-driven Imputation Strategies and their Associated Quality Indicators in Economic Surveys Archived
Articles and reports: 11-522-X202500100011
Description: The use of modern "data"-driven imputation methods to treat non-response in the context of surveys processed in the Integrated Business Statistics Program at Statistics Canada has previously been explored. It was observed that these methods can lead to high quality imputation and further have the potential to result in broad efficiencies when setting up a particular survey's edit and imputation strategy. However, estimation of the associated total variance, more specifically the component due to imputation, remains a challenge. In this article, two methods for estimation of total variance are proposed and show preliminary results that have motivated us to pursue further research in this area.
Release date: 2025-09-08
35. The challenges of conducting a survey of youth in Nunavik UVIKKAVUT QANUIPPAT? Archived
Articles and reports: 11-522-X202500100012
Description: In 2022, the Institut de la statistique du Québec conducted a survey of high school students in Nunavik, a unique, remote region of Quebec. The survey aimed to develop a portrait of the state of the students' physical and mental health, their lifestyle habits and their environment. This article describes the challenges encountered during the survey and the solutions put in place to overcome them.
Release date: 2025-09-08
36. Advancing Equitable Data Collection: Insights from Statistics Canada's Statistical Integration Methods Division Disaggregated Data Action Plan Research Project Archived
Articles and reports: 11-522-X202500100013
Description: As part of answering the call to action for the United Nations' (UN) 17 Sustainable Development Goals, as well as addressing social, economic, and equity challenges within Canada, Statistics Canada's five-year development phase for the Disaggregated Data Action Plan (DDAP) was funded in 2021 to support data driven decision around these challenges. In turn, the document "Guiding Principles: Leveraging the 2021 Census of Populations Data for DDAP Groups of Interest" were created. The guiding principles document explains the organizational framework of the DDAP in the Agency, describes existing data sources, addresses ethical and privacy concerns, and centralizes sampling methods tailored for DDAP initiatives while accounting for characteristics which can complicate sampling and data collection procedures.
Release date: 2025-09-08
37. On the Interplay of Legal Requirements, Quality Aspects and Ethical Risks when using Machine Learning in German Official Statistics Archived
Articles and reports: 11-522-X202500100014
Description: Artificial intelligence (AI) with its subfield machine learning (ML) has found its way into administration in general and also into official statistics in Germany in particular. This paper highlights the ethical issues that may arise when using AI/ML in official statistics and examines whether a separate ethical framework is needed to deal with these issues appropriately, as is proposed by institutions of other countries and intergovernmental institutions related to official statistics. The results of the study are presented to show that the implementation of the requirements of the existing and mostly non-AI/ML-specific frames of reference such as law and quality is already sufficient to adequately address the ethical issues based on risk scenarios.
Release date: 2025-09-08
38. Statistical Disclosure Control Analysis for Small Area Estimation Archived
Articles and reports: 11-522-X202500100015
Description: Currently, Statistics Canada has no official guidance on confidentiality rules for releasing small area estimate. In recent years, there has been increasing demand from Research Data Centre (RDC) researchers for comprehensive confidentiality guidelines such that they can publish small area estimates in their research. This confidentiality analysis applies to area-level small area estimation.
Release date: 2025-09-08
39. Synthetic Data Disclosure Risk Assessment: A Literature Review Archived
Articles and reports: 11-522-X202500100016
Description: The adoption of synthetic data generation as a confidentiality measure is increasing in statistical agencies worldwide, including at Statistics Canada. This approach provides an alternative to the traditional dissemination of anonymized public microdata files, offering both privacy protection and data utility. However, the creation of synthetic data presents challenges in assessing and mitigating disclosure risks. This paper reviews the different types of disclosure risks, that being attribute, membership and identity disclosure, and presents some of the associated methods for measuring risk. The paper presents prominent risk assessment metrics and discusses practical methods for disclosure control in data synthesis. Methods for assessing disclosure risks usually produce a metric that can be used to gauge the risk, but there is little consensus on threshold values for these metrics. It is also important to focus on importance of balancing utility and confidentiality, which needs further discussion in context of these methods. The paper concludes by offering insights and recommendations about managing disclosure risk while creating synthetic data as well as providing some ideas on future directions for research and practical implications for managing disclosure risks in synthetic data.
Release date: 2025-09-08
40. Exploration of Deep Learning Synthetic Data Generation for Sensitive Utility Data Sharing Archived
Articles and reports: 11-522-X202500100017
Description: Utilities hold crucial information about energy usage and building characteristics which can be utilized by government agencies to improve their corresponding analytics. However, this data is associated with private customer records and thus the building data and energy usage may be too sensitive to share. Often, high-level aggregated versions of this data are shared through robust contracts, limiting the statistics that can be derived. With the advancement of generative machine learning techniques, Statistics Canada and Natural Resources Canada have explored the feasibility of using these models to produce synthetic versions of utility data which may be shared in full to requesting organizations. These synthetic datasets can be created by a utility company through a locally run program and the outputs can be approved before being sent. This work has identified that certain generative models can feasibly be used by utilities to generate new versions of a dataset and has identified the issues which must be addressed prior to implementing this in practice. Both tabular and time-series models have been tested for different data sharing scenarios, where the TimeGAN model successfully captured the general energy peaks and valleys over a given day with reasonable computational requirements. Although this process takes days for annual energy amounts over thousands of customer records, this can enable new data sharing initiatives between utilities and National Statistical Offices while managing privacy risks. As work progresses in future phases with real utility partners, trust can be built for these approaches, and they can begin being tested on real data by actual data holders.
Release date: 2025-09-08

Reference (380)

Reference (380) (360 to 370 of 380 results)

361. The Survey of Labour and Income Dynamics (SLID) Coding Structure: Mother Tongue, Country of Birth and Ethnic Background Archived
Surveys and statistical programs – Documentation: 75F0002M1995001
Description:
This paper presents the Survey of Labour and Income Dynamics (SLID) coding structure for the variables on country of birth, mother tongue and ethnic background.
Release date: 1995-12-30
362. The Survey of Labour and Income Dynamics (SLID) Coding Structure: Major Field of Study for Postsecondary Graduates Archived
Surveys and statistical programs – Documentation: 75F0002M1995002
Description:
This paper presents the Survey of Labour and Income Dynamics (SLID) coding structure for the major fields of study for postsecondary graduates. It uses data collected in the 1991 Census of Population.
Release date: 1995-12-30
363. The Survey of Labour and Income Dynamics (SLID) Labour Interview Questionnaire: January 1995 Archived
Surveys and statistical programs – Documentation: 75F0002M1995003
Description:
This paper presents the structure and questions of the January 1995 labour interview. It also discusses changes made to the labour interview between 1994 and 1995.
Release date: 1995-12-30
364. The Survey of Labour and Income Dynamics (SLID) Questionnaire for Demographics and Contact: 1995 Archived
Surveys and statistical programs – Documentation: 75F0002M1995004
Description:
This paper presents information on the demographics and contact module questionnaire for the 1995 Survey of Labour and Income Dynamics (SLID).
Release date: 1995-12-30
365. 1995 Preliminary Interview Questionnaire Archived
Surveys and statistical programs – Documentation: 75F0002M1995005
Description:
This paper presents the questions, responses and question flow for the 1995 Survey of Labour and Income Dynamics (SLID) preliminary interview.
Release date: 1995-12-30
366. Some Effects of Computer-assisted Interviewing on the Data Quality of the Survey of Labour and Income Dynamics Archived
Surveys and statistical programs – Documentation: 75F0002M1995007
Description:
This paper describes the impact of computer-assisted interviewing (CAI) on the quality of the Survey of Labour and Income Dynamics (SLID) data in three content areas: labour force activity, respondent-sensitive sources of income, and household relationships.
Release date: 1995-12-30
367. Questionnaire and Collection Procedures for the Survey of Labour and Income Dynamics (SLID) Income Data Collection: May 1995 Archived
Surveys and statistical programs – Documentation: 75F0002M1995012
Description:
This paper describes the Survey of Labour and Income Dynamics (SLID) income data collection procedures and provides an overview of the interview process. May 1995 was the first year respondents could choose to carry out the interview as in the previous year, or they could grant permission for Statistics Canada to access their income tax returns from Revenue Canada and forego the interview.
Release date: 1995-12-30
368. Surfing the Survey of Labour and Income Dynamics (SLID) Application Archived
Surveys and statistical programs – Documentation: 75F0002M1995015
Description:
This research paper represents a reference guide for the contact and demographic modules of the Survey of Labour and Income Dynamics (SLID).
Release date: 1995-12-30
369. Summary of Interviewers Debriefing Questionnaires: 1995 May Income Interview Archived
Surveys and statistical programs – Documentation: 75F0002M1995017
Description:
This paper presents comments from a sample of interviewers from each regional office who were selected to complete a debriefing questionnaire to evaluate the Survey of Labour and Income Dynamics (SLID) 1995 income interview.
Release date: 1995-12-30
370. First Microdata File of the Survey of Labour and Income Dynamics: Expected Content Archived
Surveys and statistical programs – Documentation: 75F0002M1995018
Description:
This paper presents a preview of the variables on the first microdata file of the Survey of Labour and Income Dynamics.
Release date: 1995-12-30

Date modified:: 2026-05-14

Language selection

WxT Language switcher

Search and menus

WxT Search form

Statistical methods

Key indicators

Selected geographical area:Canada

Selected geographical area:Newfoundland and Labrador

Selected geographical area:Prince Edward Island

Selected geographical area:Nova Scotia

Selected geographical area:New Brunswick

Selected geographical area:Quebec

Selected geographical area:Ontario

Selected geographical area:Manitoba

Selected geographical area:Saskatchewan

Selected geographical area:Alberta

Selected geographical area:British Columbia

Selected geographical area:Yukon

Selected geographical area:Northwest Territories

Selected geographical area:Nunavut

Filter results by

Keyword(s)

Subject

Results

All (2,478) (2,470 to 2,480 of 2,478 results)

Data (10) ((10 results))

Analysis (2,036) (30 to 40 of 2,036 results)

Reference (380) (360 to 370 of 380 results)

Statistical methods

Key indicators

Selected geographical area:Canada

Selected geographical area:Newfoundland and Labrador

Selected geographical area:Prince Edward Island

Selected geographical area:Nova Scotia

Selected geographical area:New Brunswick

Selected geographical area:Quebec

Selected geographical area:Ontario

Selected geographical area:Manitoba

Selected geographical area:Saskatchewan

Selected geographical area:Alberta

Selected geographical area:British Columbia

Selected geographical area:Yukon

Selected geographical area:Northwest Territories

Selected geographical area:Nunavut

Filter results by

Keyword(s)

Subject

Results

All (2,478) (2,470 to 2,480 of 2,478 results)

Data (10) ((10 results))

Analysis (2,036) (30 to 40 of 2,036 results)

Reference (380) (360 to 370 of 380 results)

How are the results ordered?

How are the results ordered?

How do I use the filters and the search box?

How do I refine my search?

How does the search work?