Statistical methods

Skip to filters. View results.

Key indicators

Changing any selection will automatically update the page content.

Selected geographical area:Canada

Selected geographical area:Newfoundland and Labrador

Selected geographical area:Prince Edward Island

Selected geographical area:Nova Scotia

Selected geographical area:New Brunswick

Selected geographical area:Quebec

Selected geographical area:Ontario

Selected geographical area:Manitoba

Selected geographical area:Saskatchewan

Selected geographical area:Alberta

Selected geographical area:British Columbia

Selected geographical area:Yukon

Selected geographical area:Northwest Territories

Selected geographical area:Nunavut

Sort Help
entries

Results

All (2,478)

All (2,478) (2,450 to 2,460 of 2,478 results)

  • Articles and reports: 12-001-X197600100002
    Description: A special class of missing data problems is discussed, namely that of typical survey data whereby zeros dominate the multivariate response space. Here, techniques which impute means (whether conditional or unconditional) distort rather than improve the quality of the data. A probabilistic model is described which provides reasonable estimates, but also upholds the integrity of the data base. Results are given from a comparative study of the proposed methodology with other estimation/imputation models.
    Release date: 1976-06-14

  • 2,452. Raking ratio estimators Archived
    Articles and reports: 12-001-X197600100003
    Description: This paper presents large sample results for the bias and variance of raking-ratio estimators for up to four iterations. Estimators of the bias and variance are also presented. An expression for the asymptotic covariance matrix of the maximum likelihood estimators of the cell proportions in a two-way table with known marginals is also given.
    Release date: 1976-06-14

  • Articles and reports: 12-001-X197600100004
    Description: With the recent review of the Labour Force Survey, several periphexal projects have been redesigned. This is the case with the LFS re-interview program which will for the coming years be oriented toward the measurement of response errors. This paper describes the new design of the program and discusses how data will be analysed to achieve the objectives.
    Release date: 1976-06-14

  • Articles and reports: 12-001-X197600100005
    Description: This paper presents the Behrens-Fisher problem and gives an overview of the major solutions brought forward to this date. The aim of the paper is to use the most appropriate approach to the problem for testing sets of six month Labour Force Survey data against those of a pilot study. This is done since in many cases (such as Methods Test Panel studies) studies are conducted for six consecutive months and comparisons are required on the basis of those sets of six month data. Empirical results are also given by testing Methods Test Panel Phase III data against corresponding Labour Force Survey data.
    Release date: 1976-06-14

  • Articles and reports: 12-001-X197600100006
    Description: Multi-stage statistical surveys as a means of obtaining socioeconomic characteristics for the population have been in use for many years. Each survey requires an extensive and precise sample design which is governed by the cost structure for obtaining the data and the variance of the characteristic data between units at various stages of sampling. The authors analyzed variance components derived from one month's data of the Canadian Labour Force Survey and examined the variance that would have resulted under different allocation strategies in Table 6 and for different average sizes of units in Table 7. The percentage components of variance, the design effects by stage of sampling and population variances between units of the various stages, as well as measures of homogeneity for households within stages, are derived and shown in Tables 2 to 5.

    The analysis was carried out for the Canadian Labour Force Survey, but the methodology of component of variance estimation (Gray [4]) and the methods used to analyze the results of a particular survey are readily applied to any multi-stage statistical sample survey, where Horvitz-Thompsen estimators and ratio estimation are applied.
    Release date: 1976-06-14

  • Articles and reports: 12-001-X197500254826
    Description: Exact formulae for bias and mean square error of an estimator of process average in single sampling with rectification for finite lots are obtained. Efficiency of the estimator as compared to an unbiased estimator based on the first sample is obtained for a number of values of lot size, sample size, acceptance number and process average used in sampling plans in quality control of data processing.
    Release date: 1975-12-15

  • Articles and reports: 12-001-X197500254827
    Description: In the Methods Test Panel Phase II it was required to do analysis of variance on proportions. Since such analysis gives only approximate results, two models were used in order to be able to draw safe conclusions. Analysis of variance was performed with the proportions as variable and also with the arc sine of the square root of the proportions. The two models are outlined in the present paper and empirical comparisons are made using the MTP Phase II data.
    Release date: 1975-12-15

  • Articles and reports: 12-001-X197500254828
    Description: The Canadian Travel Survey, 1971 was the largest survey on travel of Canadian residents. This paper describes some important aspects of the methodology. Particular emphasis is given to the development of definitions in relation to the methodology, the sampling technique and interview strategy.
    Release date: 1975-12-15

  • Articles and reports: 12-001-X197500254829
    Description: J.N.K. Rao (1975) derived a general formula for estimating the variance in multistage sample designs. This general formula extends the previous results by Des Raj (1966) to the case where the conditional variance from a given primary sampling unit is a random variable. The authors reviewed Rao's paper for its application to Horvitz-Thompson and Yates-Grundy variance estimators as well as the variance estimator for the random group method by Rao, Hartley and Cochran (1962). The authors present an altered version of the Yates-Grundy variance estimators as a result of Rao's paper.
    Release date: 1975-12-15

  • Articles and reports: 12-001-X197500254830
    Description: This paper focuses on the improvement of sample survey estimates in the particular situation where the survey sample, or part of it, is included in a larger sample from which auxiliary information is available. The properties of a method of estimation - sometimes applied in specific circumstances - are investigated and the limitations of its application are found. The application of the method to rotation designs in continuing surveys is more closely studied in the context of composite estimation.
    Release date: 1975-12-15
Data (10)

Data (10) ((10 results))

  • Public use microdata: 89F0002X
    Description: The SPSD/M is a static microsimulation model designed to analyse financial interactions between governments and individuals in Canada. It can compute taxes paid to and cash transfers received from government. It is comprised of a database, a series of tax/transfer algorithms and models, analytical software and user documentation.
    Release date: 2026-02-12

  • Profile of a community or region: 46-26-0002
    Description: The National Address Register (NAR) is a list of commercial and residential addresses in Canada that are extracted from Statistics Canada's Building Register and deemed non-confidential.
    Release date: 2025-12-19

  • Table: 89-26-0006
    Description: PASSAGES is an open-source dynamic microsimulation model aimed at supporting policy analysis and research relating to Canadian retirement income system outcomes at the individual and family level. The publicly available version includes a synthetic starting database, a model, and documentation. A confidential starting database is also available.
    Release date: 2025-03-12

  • Data Visualization: 71-607-X2020010
    Description: The Canadian Statistical Geospatial Explorer empowers users to discover geo enabled data holdings of Statistics Canada at various levels of geography including at the neighbourhood level. Users are able to visualize, thematically map, spatially explore and analyze, export and consume data in various formats. Users can also view the data superimposed on satellite imagery, topographic and street layers.
    Release date: 2024-08-21

  • Table: 11-10-0074-01
    Geography: Census tract
    Frequency: Occasional
    Description:

    The divergence index (D-index) describes the degree that families with different income levels are mixing together in neighbourhoods. It compares neighbourhood (census tract, CT) discrete income distributions to a base distribution, which is the income quintiles of the neighbourhood’s census metropolitan area (CMA).

    Release date: 2020-06-22

  • Data Visualization: 71-607-X2019010
    Description: The Housing Data Viewer is a visualization tool that allows users to explore Statistics Canada data on a map. Users can use the tool to navigate, compare and export data.
    Release date: 2019-10-30

  • Table: 53-500-X
    Description:

    This report presents the results of a pilot survey conducted by Statistics Canada to measure the fuel consumption of on-road motor vehicles registered in Canada. This study was carried out in connection with the Canadian Vehicle Survey (CVS) which collects information on road activity such as distance traveled, number of passengers and trip purpose.

    Release date: 2004-10-21

  • Table: 13-220-X
    Description: In the 1997 edition, new and revised benchmarks were introduced for 1992 and 1988. The indicators are used to monitor supply, demand and employment for tourism in Canada on a timely basis. The annual tables are derived using the National Income and Expenditure Accounts (NIEA) and various industry and travel surveys. Tables providing actual data and percentage changes, for seasonally adjusted current and constant price estimates are included. In addition, an analytical section provides graphs, and time series of first differences, percentage changes, and seasonal factors for selected indicators. Data are published from 1987 and the publication will be available on the day of release. New data are included in the demand tables for non-tourism commodities produced by non-tourism industries and in the employment tables covering direct tourism employment generated by non-tourism industries. This product was commissioned by the Canadian Tourism Commission to provide annual updates for the Tourism Satellite Account.
    Release date: 2003-01-08

  • Table: 11-516-X
    Description:

    The second edition of Historical statistics of Canada was jointly produced by the Social Science Federation of Canada and Statistics Canada in 1983. This volume contains about 1,088 statistical tables on the social, economic and institutional conditions of Canada from the start of Confederation in 1867 to the mid-1970s. The tables are arranged in sections with an introduction explaining the content of each section, the principal sources of data for each table, and general explanatory notes regarding the statistics. In most cases, there is sufficient description of the individual series to enable the reader to use them without consulting the numerous basic sources referenced in the publication.

    The electronic version of this historical publication is accessible on the Internet site of Statistics Canada as a free downloadable document: text as HTML pages and all tables as individual spreadsheets in a comma delimited format (CSV) (which allows online viewing or downloading).

    Release date: 1999-07-29

  • Table: 82-567-X
    Description:

    The National Population Health Survey (NPHS) is designed to enhance the understanding of the processes affecting health. The survey collects cross-sectional as well as longitudinal data. In 1994/95 the survey interviewed a panel of 17,276 individuals, then returned to interview them a second time in 1996/97. The response rate for these individuals was 96% in 1996/97. Data collection from the panel will continue for up to two decades. For cross-sectional purposes, data were collected for a total of 81,000 household residents in all provinces (except people on Indian reserves or on Canadian Forces bases) in 1996/97.

    This overview illustrates the variety of information available by presenting data on perceived health, chronic conditions, injuries, repetitive strains, depression, smoking, alcohol consumption, physical activity, consultations with medical professionals, use of medications and use of alternative medicine.

    Release date: 1998-07-29
Analysis (2,036)

Analysis (2,036) (30 to 40 of 2,036 results)

  • Articles and reports: 11-522-X202500100008
    Description: In 2020, Statistics Canada started to use probabilistic web panels as an alternate method of collecting official statistics. In a web panel, respondents to another survey are asked for contact information to participate in future short surveys. This paper will highlight Statistics Canada's experience with panels after 4 years, including what has been learned about the recruitment of panel participants and how to subsequently collect data using panel surveys. The ways in which recruitment questions are presented can result in very different rates of participation. Moreover, the wealth of auxiliary information available on the recruitment survey can be used to actively manage panel collection operations, by predicting the probability of response and using this information to target follow-up efforts.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100009
    Description: Three series of web panels were implemented at Statistics Canada from 2020 to 2024. Participants for these web panel series were recruited from respondents of large probabilistic social surveys (recruitment surveys), and subsequently were invited to complete a series of short online surveys. Estimates of recruitment survey variables were calculated using both recruitment survey weights and web panel weights, and these were compared; differences signal the possibility of residual bias that was not corrected by the web panel weighting process. This investigation found more significant differences than would be expected if the web panel estimator fully corrected for the bias resulting from the web panel response process. Questions related to certain topics such as politics and voting, sense of belonging, and media consumption were found to have the most significant differences between web panel estimates and recruitment survey estimates.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100010
    Description: Statistics Canada's Labour Force Survey (LFS) plays an essential role in the estimation of labour market conditions in Canada. Periodically, LFS revises its data to the most recent industry and occupational classification versions. Differences in versions can be extensive, including high-level and unit-group structural changes, creations, deletions, split-offs and combination of classification units (classes). Historically, to reconcile split-off classes - where one class splits into multiple classes - a sample of LFS split-off records would be manually recoded to the new classification version. Based on the split-off proportion observed in the recoded sample, a random allocation method would be applied on all data to reflect the changing Canadian labour market over time. This article proposes using machine learning (fastText), constrained to split-off proportions using linear programming, to revise industry and occupation classifications in LFS. The hybrid framework benefits from a text-based revision mechanism while adhering to traditional proportions driven estimates, thus ensuring a minimal impact on the comparability of published labour market indicators.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100011
    Description: The use of modern "data"-driven imputation methods to treat non-response in the context of surveys processed in the Integrated Business Statistics Program at Statistics Canada has previously been explored. It was observed that these methods can lead to high quality imputation and further have the potential to result in broad efficiencies when setting up a particular survey's edit and imputation strategy. However, estimation of the associated total variance, more specifically the component due to imputation, remains a challenge. In this article, two methods for estimation of total variance are proposed and show preliminary results that have motivated us to pursue further research in this area.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100012
    Description: In 2022, the Institut de la statistique du Québec conducted a survey of high school students in Nunavik, a unique, remote region of Quebec. The survey aimed to develop a portrait of the state of the students' physical and mental health, their lifestyle habits and their environment. This article describes the challenges encountered during the survey and the solutions put in place to overcome them.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100013
    Description: As part of answering the call to action for the United Nations' (UN) 17 Sustainable Development Goals, as well as addressing social, economic, and equity challenges within Canada, Statistics Canada's five-year development phase for the Disaggregated Data Action Plan (DDAP) was funded in 2021 to support data driven decision around these challenges. In turn, the document "Guiding Principles: Leveraging the 2021 Census of Populations Data for DDAP Groups of Interest" were created. The guiding principles document explains the organizational framework of the DDAP in the Agency, describes existing data sources, addresses ethical and privacy concerns, and centralizes sampling methods tailored for DDAP initiatives while accounting for characteristics which can complicate sampling and data collection procedures.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100014
    Description: Artificial intelligence (AI) with its subfield machine learning (ML) has found its way into administration in general and also into official statistics in Germany in particular. This paper highlights the ethical issues that may arise when using AI/ML in official statistics and examines whether a separate ethical framework is needed to deal with these issues appropriately, as is proposed by institutions of other countries and intergovernmental institutions related to official statistics. The results of the study are presented to show that the implementation of the requirements of the existing and mostly non-AI/ML-specific frames of reference such as law and quality is already sufficient to adequately address the ethical issues based on risk scenarios.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100015
    Description: Currently, Statistics Canada has no official guidance on confidentiality rules for releasing small area estimate. In recent years, there has been increasing demand from Research Data Centre (RDC) researchers for comprehensive confidentiality guidelines such that they can publish small area estimates in their research. This confidentiality analysis applies to area-level small area estimation.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100016
    Description: The adoption of synthetic data generation as a confidentiality measure is increasing in statistical agencies worldwide, including at Statistics Canada. This approach provides an alternative to the traditional dissemination of anonymized public microdata files, offering both privacy protection and data utility. However, the creation of synthetic data presents challenges in assessing and mitigating disclosure risks. This paper reviews the different types of disclosure risks, that being attribute, membership and identity disclosure, and presents some of the associated methods for measuring risk. The paper presents prominent risk assessment metrics and discusses practical methods for disclosure control in data synthesis. Methods for assessing disclosure risks usually produce a metric that can be used to gauge the risk, but there is little consensus on threshold values for these metrics. It is also important to focus on importance of balancing utility and confidentiality, which needs further discussion in context of these methods. The paper concludes by offering insights and recommendations about managing disclosure risk while creating synthetic data as well as providing some ideas on future directions for research and practical implications for managing disclosure risks in synthetic data.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100017
    Description: Utilities hold crucial information about energy usage and building characteristics which can be utilized by government agencies to improve their corresponding analytics. However, this data is associated with private customer records and thus the building data and energy usage may be too sensitive to share. Often, high-level aggregated versions of this data are shared through robust contracts, limiting the statistics that can be derived. With the advancement of generative machine learning techniques, Statistics Canada and Natural Resources Canada have explored the feasibility of using these models to produce synthetic versions of utility data which may be shared in full to requesting organizations. These synthetic datasets can be created by a utility company through a locally run program and the outputs can be approved before being sent. This work has identified that certain generative models can feasibly be used by utilities to generate new versions of a dataset and has identified the issues which must be addressed prior to implementing this in practice. Both tabular and time-series models have been tested for different data sharing scenarios, where the TimeGAN model successfully captured the general energy peaks and valleys over a given day with reasonable computational requirements. Although this process takes days for annual energy amounts over thousands of customer records, this can enable new data sharing initiatives between utilities and National Statistical Offices while managing privacy risks. As work progresses in future phases with real utility partners, trust can be built for these approaches, and they can begin being tested on real data by actual data holders.
    Release date: 2025-09-08
Reference (380)

Reference (380) (0 to 10 of 380 results)

  • Surveys and statistical programs – Documentation: 19-20-00012026001
    Description: This reference document provides nontechnical answers on selected topics related to the use and interpretation of seasonally adjusted data. It is designed to complement more technical discussions of seasonal adjustment found in Statistics Canada publications and reference manuals.
    Release date: 2026-05-11

  • Surveys and statistical programs – Documentation: 19-20-0001
    Description: Documents in this series provide insight into the statistical methods used by Statistics Canada to produce official statistics. They include introductory material, in-depth descriptions of techniques and methods, best practices, and guidelines. All documents have undergone review to ensure that they conform to Statistics Canada's mandate and adhere to generally accepted methodological standards and practices.
    Release date: 2026-05-11

  • Notices and consultations: 13-605-X
    Description: This product contains articles related to the latest methodological, conceptual developments in the Canadian System of Macroeconomic Accounts as well as the analysis of the Canadian economy. It includes articles detailing new methods, concepts and statistical techniques used to compile the Canadian System of Macroeconomic Accounts. It also includes information related to new or expanded data products, provides updates and supplements to information found in various guides and analytical articles touching upon a broad range of topics related to the Canadian economy.
    Release date: 2026-05-04

  • Surveys and statistical programs – Documentation: 11-633-X2026002
    Description: Recent changes in Canada’s immigration levels have heightened interest in understanding how immigration affects housing demand. This article develops a methodological framework for projecting housing use associated with permanent residents (PRs) and non-permanent residents (NPRs) under alternative immigration scenarios. The framework applies observed per capita housing use rates from the Census of Population to estimate incremental housing use by tenure over time.
    Release date: 2026-04-24

  • Surveys and statistical programs – Documentation: 11-633-X2026001
    Description: This report defines key concepts related to area-level analysis and introduces area-level measures developed and utilized at Statistics Canada for health analysis. It also provides a decision-making framework and practical recommendations to help researchers select appropriate methods. The goal is to guide readers on when area-level analysis is appropriate and what type of area-level measure is suitable to achieve research objectives.
    Release date: 2026-03-05

  • Surveys and statistical programs – Documentation: 91-528-X
    Description: The Technical Guide on Demographic Estimates at Statistics Canada provides detailed descriptions of the most current data sources and methods used by the Centre for demography at Statistics Canada to produce demographic estimates as part of the Demographic estimates program. They comprise postcensal and intercensal population estimates; base population; births and deaths; immigrants; emigrants; returning emigrants; non-permanent residents; interprovincial migration; subprovincial estimates of population and intraprovincial migration; population estimates by age and gender; and census family estimates. A glossary of commonly used terms is available at the end of the guide.
    Release date: 2025-12-17

  • Surveys and statistical programs – Documentation: 11-633-X2025004
    Description: The Longitudinal Immigration Database (IMDB) is a comprehensive source of data that plays a key role in the understanding of the economic behaviour of immigrants. It is the only annual Canadian dataset that allows users to study the characteristics of immigrants to Canada at the time of admission and their economic outcomes and regional (inter-provincial) mobility over a time span of more than 40 years.
    Release date: 2025-12-08

  • Surveys and statistical programs – Documentation: 89-657-X2025002
    Description: The Survey on the Official Language Minority Population (SOLMP) user guide contains a description of the survey, along with survey concepts and definitions and an overview of the content development. The target and survey populations, the sample design and sample size are described in the Methodology section, while the Data Collection module provides the collection period and instrument, modes of collection, collection and communications strategies and response rates.

    Updates to the guide include descriptions of the survey data processing, survey error and weighting, and guidelines for tabulations and analysis. Appendices will provide a listing of questions and variables which changed between the current and previous occasions of the survey, as well as various primers on the survey methodology.
    Release date: 2025-11-14

  • Surveys and statistical programs – Documentation: 32-26-0008
    Description: This report describes the main changes, additions or deletions to the Census of Agriculture questionnaire by topic and in the order they appear on the questionnaire.
    Release date: 2025-07-04

  • Surveys and statistical programs – Documentation: 98-20-00052026004
    Description: This report provides detailed insight into the design and methodology of the content test component of the 2024 Census Test. This test evaluated changes to the wording and flow of some questions, as well as the potential addition of new questions, to help determine the content of the 2026 Census of Population.
    Release date: 2025-07-04