Statistical methods
Key indicators
Selected geographical area:Canada
-
$5,106.5 million-2.2%
(12-month change) -
$36,023.7 million7.8%
(year-over-year change)
Subject
- Limit subject index to Administrative data
- Limit subject index to Collection and questionnaires
- Limit subject index to Data analysis
- Limit subject index to Disclosure control and data dissemination
- Limit subject index to Editing and imputation
- Limit subject index to Frames and coverage
- Limit subject index to History and context
- Limit subject index to Inference and foundations
- Limit subject index to Quality assurance
- Limit subject index to Response and nonresponse
- Limit subject index to Simulations
- Limit subject index to Statistical techniques
- Limit subject index to Survey design
- Limit subject index to Time series
- Limit subject index to Weighting and estimation
- Limit subject index to Other content related to Statistical methods
Results
All (2,478)
All (2,478) (40 to 50 of 2,478 results)
- Articles and reports: 11-522-X202500100009Description: Three series of web panels were implemented at Statistics Canada from 2020 to 2024. Participants for these web panel series were recruited from respondents of large probabilistic social surveys (recruitment surveys), and subsequently were invited to complete a series of short online surveys. Estimates of recruitment survey variables were calculated using both recruitment survey weights and web panel weights, and these were compared; differences signal the possibility of residual bias that was not corrected by the web panel weighting process. This investigation found more significant differences than would be expected if the web panel estimator fully corrected for the bias resulting from the web panel response process. Questions related to certain topics such as politics and voting, sense of belonging, and media consumption were found to have the most significant differences between web panel estimates and recruitment survey estimates.Release date: 2025-09-08
- 42. Life in the FastText Lane: Harnessing Linear Programming Constrained Machine Learning for Classifications Revision ArchivedArticles and reports: 11-522-X202500100010Description: Statistics Canada's Labour Force Survey (LFS) plays an essential role in the estimation of labour market conditions in Canada. Periodically, LFS revises its data to the most recent industry and occupational classification versions. Differences in versions can be extensive, including high-level and unit-group structural changes, creations, deletions, split-offs and combination of classification units (classes). Historically, to reconcile split-off classes - where one class splits into multiple classes - a sample of LFS split-off records would be manually recoded to the new classification version. Based on the split-off proportion observed in the recoded sample, a random allocation method would be applied on all data to reflect the changing Canadian labour market over time. This article proposes using machine learning (fastText), constrained to split-off proportions using linear programming, to revise industry and occupation classifications in LFS. The hybrid framework benefits from a text-based revision mechanism while adhering to traditional proportions driven estimates, thus ensuring a minimal impact on the comparability of published labour market indicators.Release date: 2025-09-08
- 43. Data-driven Imputation Strategies and their Associated Quality Indicators in Economic Surveys ArchivedArticles and reports: 11-522-X202500100011Description: The use of modern "data"-driven imputation methods to treat non-response in the context of surveys processed in the Integrated Business Statistics Program at Statistics Canada has previously been explored. It was observed that these methods can lead to high quality imputation and further have the potential to result in broad efficiencies when setting up a particular survey's edit and imputation strategy. However, estimation of the associated total variance, more specifically the component due to imputation, remains a challenge. In this article, two methods for estimation of total variance are proposed and show preliminary results that have motivated us to pursue further research in this area.Release date: 2025-09-08
- Articles and reports: 11-522-X202500100012Description: In 2022, the Institut de la statistique du Québec conducted a survey of high school students in Nunavik, a unique, remote region of Quebec. The survey aimed to develop a portrait of the state of the students' physical and mental health, their lifestyle habits and their environment. This article describes the challenges encountered during the survey and the solutions put in place to overcome them.Release date: 2025-09-08
- Articles and reports: 11-522-X202500100013Description: As part of answering the call to action for the United Nations' (UN) 17 Sustainable Development Goals, as well as addressing social, economic, and equity challenges within Canada, Statistics Canada's five-year development phase for the Disaggregated Data Action Plan (DDAP) was funded in 2021 to support data driven decision around these challenges. In turn, the document "Guiding Principles: Leveraging the 2021 Census of Populations Data for DDAP Groups of Interest" were created. The guiding principles document explains the organizational framework of the DDAP in the Agency, describes existing data sources, addresses ethical and privacy concerns, and centralizes sampling methods tailored for DDAP initiatives while accounting for characteristics which can complicate sampling and data collection procedures.Release date: 2025-09-08
- Articles and reports: 11-522-X202500100014Description: Artificial intelligence (AI) with its subfield machine learning (ML) has found its way into administration in general and also into official statistics in Germany in particular. This paper highlights the ethical issues that may arise when using AI/ML in official statistics and examines whether a separate ethical framework is needed to deal with these issues appropriately, as is proposed by institutions of other countries and intergovernmental institutions related to official statistics. The results of the study are presented to show that the implementation of the requirements of the existing and mostly non-AI/ML-specific frames of reference such as law and quality is already sufficient to adequately address the ethical issues based on risk scenarios.Release date: 2025-09-08
- Articles and reports: 11-522-X202500100015Description: Currently, Statistics Canada has no official guidance on confidentiality rules for releasing small area estimate. In recent years, there has been increasing demand from Research Data Centre (RDC) researchers for comprehensive confidentiality guidelines such that they can publish small area estimates in their research. This confidentiality analysis applies to area-level small area estimation.Release date: 2025-09-08
- Articles and reports: 11-522-X202500100016Description: The adoption of synthetic data generation as a confidentiality measure is increasing in statistical agencies worldwide, including at Statistics Canada. This approach provides an alternative to the traditional dissemination of anonymized public microdata files, offering both privacy protection and data utility. However, the creation of synthetic data presents challenges in assessing and mitigating disclosure risks. This paper reviews the different types of disclosure risks, that being attribute, membership and identity disclosure, and presents some of the associated methods for measuring risk. The paper presents prominent risk assessment metrics and discusses practical methods for disclosure control in data synthesis. Methods for assessing disclosure risks usually produce a metric that can be used to gauge the risk, but there is little consensus on threshold values for these metrics. It is also important to focus on importance of balancing utility and confidentiality, which needs further discussion in context of these methods. The paper concludes by offering insights and recommendations about managing disclosure risk while creating synthetic data as well as providing some ideas on future directions for research and practical implications for managing disclosure risks in synthetic data.Release date: 2025-09-08
- 49. Exploration of Deep Learning Synthetic Data Generation for Sensitive Utility Data Sharing ArchivedArticles and reports: 11-522-X202500100017Description: Utilities hold crucial information about energy usage and building characteristics which can be utilized by government agencies to improve their corresponding analytics. However, this data is associated with private customer records and thus the building data and energy usage may be too sensitive to share. Often, high-level aggregated versions of this data are shared through robust contracts, limiting the statistics that can be derived. With the advancement of generative machine learning techniques, Statistics Canada and Natural Resources Canada have explored the feasibility of using these models to produce synthetic versions of utility data which may be shared in full to requesting organizations. These synthetic datasets can be created by a utility company through a locally run program and the outputs can be approved before being sent. This work has identified that certain generative models can feasibly be used by utilities to generate new versions of a dataset and has identified the issues which must be addressed prior to implementing this in practice. Both tabular and time-series models have been tested for different data sharing scenarios, where the TimeGAN model successfully captured the general energy peaks and valleys over a given day with reasonable computational requirements. Although this process takes days for annual energy amounts over thousands of customer records, this can enable new data sharing initiatives between utilities and National Statistical Offices while managing privacy risks. As work progresses in future phases with real utility partners, trust can be built for these approaches, and they can begin being tested on real data by actual data holders.Release date: 2025-09-08
- Articles and reports: 11-522-X202500100018Description: The Child Poverty Reduction Act (2018) outlines a need for the New Zealand Government to set three- and ten-yearly persistent child poverty reduction targets come end of 2024. In the absence of longitudinal survey data, a survey-administrative data hybrid method that will facilitate the production of these reduction targets and official estimates of persistent child poverty once reporting is required for the 2025/2026 financial year onwards is outlined. This hybrid approach leverages off the cross-sectional Household Economic Survey (HES), administrative-based beneficiary's family data, and recent advances developed for the construction of households within the Administrative Population Census (APC) at Statistics New Zealand. With increasing data collection challenges due to rising non-response and costs, this survey-admin hybrid method represents an alternative to longitudinal survey data collection, ensuring ongoing sustainable and quality statistics to produce persistent child poverty estimates.Release date: 2025-09-08
- Previous Go to previous page of All results
- 1 Go to page 1 of All results
- 2 Go to page 2 of All results
- 3 Go to page 3 of All results
- 4 Go to page 4 of All results
- 5 (current) Go to page 5 of All results
- 6 Go to page 6 of All results
- 7 Go to page 7 of All results
- ...
- 248 Go to page 248 of All results
- Next Go to next page of All results
Data (10)
Data (10) ((10 results))
- Public use microdata: 89F0002XDescription: The SPSD/M is a static microsimulation model designed to analyse financial interactions between governments and individuals in Canada. It can compute taxes paid to and cash transfers received from government. It is comprised of a database, a series of tax/transfer algorithms and models, analytical software and user documentation.Release date: 2026-02-12
- Profile of a community or region: 46-26-0002Description: The National Address Register (NAR) is a list of commercial and residential addresses in Canada that are extracted from Statistics Canada's Building Register and deemed non-confidential.Release date: 2025-12-19
- Table: 89-26-0006Description: PASSAGES is an open-source dynamic microsimulation model aimed at supporting policy analysis and research relating to Canadian retirement income system outcomes at the individual and family level. The publicly available version includes a synthetic starting database, a model, and documentation. A confidential starting database is also available.Release date: 2025-03-12
- 4. Canadian Statistical Geospatial Explorer Hub ArchivedData Visualization: 71-607-X2020010Description: The Canadian Statistical Geospatial Explorer empowers users to discover geo enabled data holdings of Statistics Canada at various levels of geography including at the neighbourhood level. Users are able to visualize, thematically map, spatially explore and analyze, export and consume data in various formats. Users can also view the data superimposed on satellite imagery, topographic and street layers.Release date: 2024-08-21
- Table: 11-10-0074-01Geography: Census tractFrequency: OccasionalDescription:
The divergence index (D-index) describes the degree that families with different income levels are mixing together in neighbourhoods. It compares neighbourhood (census tract, CT) discrete income distributions to a base distribution, which is the income quintiles of the neighbourhood’s census metropolitan area (CMA).
Release date: 2020-06-22 - 6. Housing Data Viewer ArchivedData Visualization: 71-607-X2019010Description: The Housing Data Viewer is a visualization tool that allows users to explore Statistics Canada data on a map. Users can use the tool to navigate, compare and export data.Release date: 2019-10-30
- Table: 53-500-XDescription:
This report presents the results of a pilot survey conducted by Statistics Canada to measure the fuel consumption of on-road motor vehicles registered in Canada. This study was carried out in connection with the Canadian Vehicle Survey (CVS) which collects information on road activity such as distance traveled, number of passengers and trip purpose.
Release date: 2004-10-21 - Table: 13-220-XDescription: In the 1997 edition, new and revised benchmarks were introduced for 1992 and 1988. The indicators are used to monitor supply, demand and employment for tourism in Canada on a timely basis. The annual tables are derived using the National Income and Expenditure Accounts (NIEA) and various industry and travel surveys. Tables providing actual data and percentage changes, for seasonally adjusted current and constant price estimates are included. In addition, an analytical section provides graphs, and time series of first differences, percentage changes, and seasonal factors for selected indicators. Data are published from 1987 and the publication will be available on the day of release. New data are included in the demand tables for non-tourism commodities produced by non-tourism industries and in the employment tables covering direct tourism employment generated by non-tourism industries. This product was commissioned by the Canadian Tourism Commission to provide annual updates for the Tourism Satellite Account.Release date: 2003-01-08
- 9. Historical Statistics of Canada ArchivedTable: 11-516-XDescription:
The second edition of Historical statistics of Canada was jointly produced by the Social Science Federation of Canada and Statistics Canada in 1983. This volume contains about 1,088 statistical tables on the social, economic and institutional conditions of Canada from the start of Confederation in 1867 to the mid-1970s. The tables are arranged in sections with an introduction explaining the content of each section, the principal sources of data for each table, and general explanatory notes regarding the statistics. In most cases, there is sufficient description of the individual series to enable the reader to use them without consulting the numerous basic sources referenced in the publication.
The electronic version of this historical publication is accessible on the Internet site of Statistics Canada as a free downloadable document: text as HTML pages and all tables as individual spreadsheets in a comma delimited format (CSV) (which allows online viewing or downloading).
Release date: 1999-07-29 - 10. National Population Health Survey Overview ArchivedTable: 82-567-XDescription:
The National Population Health Survey (NPHS) is designed to enhance the understanding of the processes affecting health. The survey collects cross-sectional as well as longitudinal data. In 1994/95 the survey interviewed a panel of 17,276 individuals, then returned to interview them a second time in 1996/97. The response rate for these individuals was 96% in 1996/97. Data collection from the panel will continue for up to two decades. For cross-sectional purposes, data were collected for a total of 81,000 household residents in all provinces (except people on Indian reserves or on Canadian Forces bases) in 1996/97.
This overview illustrates the variety of information available by presenting data on perceived health, chronic conditions, injuries, repetitive strains, depression, smoking, alcohol consumption, physical activity, consultations with medical professionals, use of medications and use of alternative medicine.
Release date: 1998-07-29
Analysis (2,036)
Analysis (2,036) (2,020 to 2,030 of 2,036 results)
- 2,021. Some variance estimators for multistage sampling ArchivedArticles and reports: 12-001-X197500254829Description: J.N.K. Rao (1975) derived a general formula for estimating the variance in multistage sample designs. This general formula extends the previous results by Des Raj (1966) to the case where the conditional variance from a given primary sampling unit is a random variable. The authors reviewed Rao's paper for its application to Horvitz-Thompson and Yates-Grundy variance estimators as well as the variance estimator for the random group method by Rao, Hartley and Cochran (1962). The authors present an altered version of the Yates-Grundy variance estimators as a result of Rao's paper.Release date: 1975-12-15
- 2,022. On the improvement of sample survey estimates ArchivedArticles and reports: 12-001-X197500254830Description: This paper focuses on the improvement of sample survey estimates in the particular situation where the survey sample, or part of it, is included in a larger sample from which auxiliary information is available. The properties of a method of estimation - sometimes applied in specific circumstances - are investigated and the limitations of its application are found. The application of the method to rotation designs in continuing surveys is more closely studied in the context of composite estimation.Release date: 1975-12-15
- 2,023. The telephone experiment in the Canadian Labour Force Survey ArchivedArticles and reports: 12-001-X197500254831Description: This paper summarizes the results of a telephone experiment conducted in conjunction with the Canadian Labour Force Survey over the period June 1972 to November 1973. Included in the paper is a detailed outline of the purpose and design of the experiment. A discussion of the impact telephone interviewing had on the cost of enumeration, non-response and participation and unemployment rates is given. In addition, interviewer and respondent attitudes toward telephone interviewing are described. Finally, the paper summarizes the experiences gained from this experiment and indicates some areas where further examinations related to telephone interviewing can be carried out.Release date: 1975-12-15
- 2,024. On a ratio estimate with post-stratified weighting ArchivedArticles and reports: 12-001-X197500254832Description: A ratio estimate based on an auxiliary variable is considered for the case when the sample is post-stratified using information on another auxiliary variable. The variance of the ratio estimate is derived by the method of linearization [3,4]. An application to subprovincial estimation in the Canadian Labour Force Survey is discussed.Release date: 1975-12-15
- 2,025. Analytic studies of sample survey data ArchivedArticles and reports: 12-001-X197500300001Description: Most sample surveys in the past have been "descriptive" in the sense that the main objective is the computation of means or totals of a number of characters of interest along with their standard errors. However, in recent years data produced from "descriptive" surveys are also being increasingly used for "analytical" purposes, i.e., for investigating relationships among variables. Also some sample surveys might have primary "analytical goals" in which case the "optimal" designing of such "analytical surveys" becomes important. These lecture notes present an account of some recent developments in the analytical studies of sample survey data. Many challenging problems remain to be solved and I hope these notes will provide stimulation for further research in this important area.Release date: 1975-12-15
- 2,026. Measurement of response errors in Censuses and sample surveys ArchivedArticles and reports: 12-001-X197500254824Description:
Madow [1968] has proposed a two-phase sampling scheme under which response bias can be eliminated from sample surveys by obtaining “true” values for a subsample of the original sample. Often in cases of Censuses or ongoing surveys, the subsample data are not used to correct the main survey estimates but to assess their reliability. The main purpose of this paper is to present methods by which reliability estimates can be obtained when true values can be determined for a subsample of units.
Release date: 1975-12-15 - 2,027. Controlled random rounding ArchivedArticles and reports: 12-001-X197500254825Description:
Random rounding is a technique to ensure confidentiality of aggregate statistics. By randomly rounding all the components of a total, independently, together with the random rounding of the total itself, substantial discrepancies may arise when aggregating the published data. This paper presents a procedure which avoids substantial discrepancies while still protecting the concept of confidentiality.
Release date: 1975-12-15 - 2,028. The development of an automated estimation system ArchivedArticles and reports: 12-001-X197500100001Description: Although a survey is designed to satisfy a specific set of survey constraints, some steps involved in designing a survey, such as stratification, sample allocation and sample selection are common to all surveys. The steps involved in the creation of survey design systems are to identify, develop and implement common methods and procedures for such stages which, when taken together, constitute a survey design. The paper describes some methodological considerations in the development of an automated system for three methods of ratio estimation.Release date: 1975-06-16
- Articles and reports: 12-001-X197500100002Description: In 1962, Hartley and Rao derived an asymptotic formula for the joint probability selection for samples selected with unequal probability sampling. In 1966, Connor, derived an exact formula for this joint probability, however, his formulae were very involved. In the present paper the authors, using a modification of Connor's formula derive the exact joint probabilities using a specially designed computer algorithm.Release date: 1975-06-16
- 2,030. Sample design of the Family Expenditure Survey (1974) ArchivedArticles and reports: 12-001-X197500100003Description: In order to monitor changes in expenditure patterns and, if necessary, provide information for a reweighting of the Consumer Price Index, family expenditure surveys have been carried out at approximately two year intervals since 1953. While all of the Family Expenditure Surveys have utilized the Canadian Labour Force Survey [1] frame, the particular survey in 1974 was designed somewhat differently from earlier surveys in that segments or city blocks were specially selected for the survey and there was strict control on the sample size not adhered to in earlier surveys. The sample design, from the considerations based on the broad requirements of the survey to the details of the sampling procedures, is described in this article.Release date: 1975-06-16
- Previous Go to previous page of Analysis results
- 1 Go to page 1 of Analysis results
- ...
- 198 Go to page 198 of Analysis results
- 199 Go to page 199 of Analysis results
- 200 Go to page 200 of Analysis results
- 201 Go to page 201 of Analysis results
- 202 Go to page 202 of Analysis results
- 203 (current) Go to page 203 of Analysis results
- 204 Go to page 204 of Analysis results
- Next Go to next page of Analysis results
Reference (380)
Reference (380) (10 to 20 of 380 results)
- Surveys and statistical programs – Documentation: 89-657-X2024009Description: The Survey on the Official Language Minority Population (SOLMP) user guide contains a description of the survey, along with survey concepts and definitions and an overview of the content development. The target and survey populations, the sample design and sample size are described in the Methodology section. Finally, in the Data Collection module, the collection period and instrument, modes of collection, collection and communications strategies and response rates are provided.Release date: 2024-12-16
- Surveys and statistical programs – Documentation: 11-633-X2024004Description: The Longitudinal Immigration Database (IMDB) is a comprehensive source of data that plays a key role in the understanding of the economic behaviour of immigrants. It is the only annual Canadian dataset that allows users to study the characteristics of immigrants to Canada at the time of admission and their economic outcomes and regional (inter-provincial) mobility over a time span of more than 40 years.Release date: 2024-12-09
- Surveys and statistical programs – Documentation: 11-633-X2024005Description: The Analytical Studies and Modelling Branch is the research (ASMB), modelling, training and access hub of Statistics Canada. It focuses on leveraging the agency’s vast data holdings to generate in-depth insights that support evidence-based policy making and to enable others to do so through analytical training and data access. The ASMB, like other program areas in the agency, works to support Statistics Canada’s overall mission of delivering insights through data for a better Canada.Release date: 2024-12-06
- Surveys and statistical programs – Documentation: 98-303-XDescription: The Coverage Technical Report will present the errors included in census data that result from persons who are either missed (not enumerated) or enumerated more than once. The population coverage error is one of the most important types of errors because it affects the accuracy of not only population counts, but also all the census data results that describe the characteristics of the population universe.Release date: 2024-10-23
- Surveys and statistical programs – Documentation: 89-653-X2024002Description: This guide is intended to provide a detailed review of both the 2022 IPS and IPS–NIS with respect to subject matter and methodological approaches. It is designed to help data users by serving as a guide to the concepts and measures of the survey as well as the technical details of the survey’s design, field work and data processing. This guide is meant to provide users with helpful information on how to use and interpret survey results. The discussion on data quality also allows users to review the strengths and limitations of the data for their particular needs. Chapter 1 of this guide provides an overview of the 2022 IPS and IPS–NIS by introducing the survey background and objectives. Chapter 2 outlines the survey’s themes and explains the key concepts and definitions used for the survey. Chapters 3 to 6 cover important aspects of the survey methodology, sampling design, data collection and processing. Chapters 7 and 8 review issues of data quality and caution users about comparing 2022 IPS or IPS–NIS data with data from other sources. Chapter 9 outlines the survey products available to the public, including data tables, analytical articles and reference material. The appendices provide a comprehensive list of survey indicators, extra coding categories and standard classifications used on both the IPS and the IPS–NIS. Lastly, a glossary of survey terms and information on confidence intervals is also provided.Release date: 2024-08-14
- Surveys and statistical programs – Documentation: 75-514-GDescription: The Guide to the Job Vacancy and Wage Survey contains a dictionary of concepts and definitions, and covers topics such as survey methodology, data collection, processing, and data quality. The guide covers both components of the survey: the job vacancy component, which is quarterly, and the wage component, which is annual.Release date: 2024-06-18
- Surveys and statistical programs – Documentation: 32-26-0007Description: Census of Agriculture data provide statistical information on farms and farm operators at fine geographic levels and for small subpopulations. Quality evaluation activities are essential to ensure that census data are reliable and that they meet user needs. This report provides data quality information pertaining to the Census of Agriculture, such as sources of error, error detection, disclosure control methods, data quality indicators, response rates and collection rates.Release date: 2024-02-06
- Surveys and statistical programs – Documentation: 11-633-X2024001Description: The Longitudinal Immigration Database (IMDB) is a comprehensive source of data that plays a key role in the understanding of the economic behaviour of immigrants. It is the only annual Canadian dataset that allows users to study the characteristics of immigrants to Canada at the time of admission and their economic outcomes and regional (inter-provincial) mobility over a time span of more than 35 years.Release date: 2024-01-22
- 19. Labour Force Survey Response Rates, September 2023 ArchivedSurveys and statistical programs – Documentation: 75-005-M2023001Description: This document provides information on the evolution of response rates for the Labour Force Survey (LFS) and a discussion of the evaluation of two aspects of data quality that ensure the LFS estimates continue providing an accurate portrait of the Canadian labour market.Release date: 2023-10-30
- Surveys and statistical programs – Documentation: 98-306-XDescription:
This report describes sampling, weighting and estimation procedures used in the Census of Population. It provides operational and theoretical justifications for them, and presents the results of the evaluations of these procedures.
Release date: 2023-10-04
- Previous Go to previous page of Reference results
- 1 Go to page 1 of Reference results
- 2 (current) Go to page 2 of Reference results
- 3 Go to page 3 of Reference results
- 4 Go to page 4 of Reference results
- 5 Go to page 5 of Reference results
- 6 Go to page 6 of Reference results
- 7 Go to page 7 of Reference results
- ...
- 38 Go to page 38 of Reference results
- Next Go to next page of Reference results