Statistical methods

Skip to filters. View results.

Key indicators

Changing any selection will automatically update the page content.

Selected geographical area:Canada

Selected geographical area:Newfoundland and Labrador

Selected geographical area:Prince Edward Island

Selected geographical area:Nova Scotia

Selected geographical area:New Brunswick

Selected geographical area:Quebec

Selected geographical area:Ontario

Selected geographical area:Manitoba

Selected geographical area:Saskatchewan

Selected geographical area:Alberta

Selected geographical area:British Columbia

Selected geographical area:Yukon

Selected geographical area:Northwest Territories

Selected geographical area:Nunavut

Sort Help
entries

Results

All (2,478)

All (2,478) (0 to 10 of 2,478 results)

  • Surveys and statistical programs – Documentation: 19-20-0001
    Description: Documents in this series provide insight into the statistical methods used by Statistics Canada to produce official statistics. They include introductory material, in-depth descriptions of techniques and methods, best practices, and guidelines. All documents have undergone review to ensure that they conform to Statistics Canada's mandate and adhere to generally accepted methodological standards and practices.
    Release date: 2026-05-11

  • Surveys and statistical programs – Documentation: 19-20-00012026001
    Description: This reference document provides nontechnical answers on selected topics related to the use and interpretation of seasonally adjusted data. It is designed to complement more technical discussions of seasonal adjustment found in Statistics Canada publications and reference manuals.
    Release date: 2026-05-11

  • Notices and consultations: 13-605-X
    Description: This product contains articles related to the latest methodological, conceptual developments in the Canadian System of Macroeconomic Accounts as well as the analysis of the Canadian economy. It includes articles detailing new methods, concepts and statistical techniques used to compile the Canadian System of Macroeconomic Accounts. It also includes information related to new or expanded data products, provides updates and supplements to information found in various guides and analytical articles touching upon a broad range of topics related to the Canadian economy.
    Release date: 2026-05-04

  • Journals and periodicals: 11-633-X
    Description: Papers in this series provide background discussions of the methods used to develop data for economic, health, and social analytical studies at Statistics Canada. They are intended to provide readers with information on the statistical methods, standards and definitions used to develop databases for research purposes. All papers in this series have undergone peer and institutional review to ensure that they conform to Statistics Canada's mandate and adhere to generally accepted standards of good professional practice.
    Release date: 2026-04-24

  • Surveys and statistical programs – Documentation: 11-633-X2026002
    Description: Recent changes in Canada’s immigration levels have heightened interest in understanding how immigration affects housing demand. This article develops a methodological framework for projecting housing use associated with permanent residents (PRs) and non-permanent residents (NPRs) under alternative immigration scenarios. The framework applies observed per capita housing use rates from the Census of Population to estimate incremental housing use by tenure over time.
    Release date: 2026-04-24

  • Surveys and statistical programs – Documentation: 11-633-X2026001
    Description: This report defines key concepts related to area-level analysis and introduces area-level measures developed and utilized at Statistics Canada for health analysis. It also provides a decision-making framework and practical recommendations to help researchers select appropriate methods. The goal is to guide readers on when area-level analysis is appropriate and what type of area-level measure is suitable to achieve research objectives.
    Release date: 2026-03-05

  • Public use microdata: 89F0002X
    Description: The SPSD/M is a static microsimulation model designed to analyse financial interactions between governments and individuals in Canada. It can compute taxes paid to and cash transfers received from government. It is comprised of a database, a series of tax/transfer algorithms and models, analytical software and user documentation.
    Release date: 2026-02-12

  • Articles and reports: 13-604-M2026001
    Description: This documentation outlines the methodology used to develop the Distributions of household economic accounts published in January 2026 for the reference years 2010 to 2025. It describes the framework and the steps implemented to produce distributional information aligned with the National Balance Sheet Accounts and other national accounts concepts. It also includes a report on the quality of the estimated distributions.
    Release date: 2026-01-29

  • Articles and reports: 12-001-X202500200001
    Description: Nested error regression models are commonly used to incorporate unit specific auxiliary variables to improve small area estimates. When the mean structure of the model is misspecified, the design-based mean squared prediction error (MSPE) of Empirical Best Linear Unbiased Predictors (EBLUP) generally increases. The Observed Best Prediction (OBP) method has been proposed with the intent to improve on the design-based MSPE over EBLUP. In this paper, we conduct a Monte Carlo simulation experiments to understand the effect of misspsecification of mean structures on different small area estimators. Our findings suggest that the OBP using unit-level auxiliary variables does not outperform the EBLUP in terms of design-based MSPE, unless the number of small areas m is extremely large. Conversely, the performance of OBP significantly improves when area-level auxiliary variables are employed. This paper includes both analytical and numerical evidence to demonstrate these observations, providing practical insights for addressing model misspecification in small area estimation (SAE).
    Release date: 2025-12-23

  • Articles and reports: 12-001-X202500200002
    Description: This study examines interviewer effects on household nonresponse in three waves of the Household Finance and Consumption Survey (HFCS) in Austria using a multilevel model. Addressing nonresponse at its source is crucial for maintaining survey data quality and representativeness. Our findings indicate that the variation in response behavior explained by interviewer effects decreased from about one-third in the first wave to 7% in the third wave. Effective interviewers tend to have a university degree, be married, homeowners, and have a larger workload. Additionally, higher mean wages in the household’s municipality negatively affect survey participation. These insights suggest targeted interviewer selection and training strategies to improve response rates.
    Release date: 2025-12-23
Data (10)

Data (10) ((10 results))

  • Public use microdata: 89F0002X
    Description: The SPSD/M is a static microsimulation model designed to analyse financial interactions between governments and individuals in Canada. It can compute taxes paid to and cash transfers received from government. It is comprised of a database, a series of tax/transfer algorithms and models, analytical software and user documentation.
    Release date: 2026-02-12

  • Profile of a community or region: 46-26-0002
    Description: The National Address Register (NAR) is a list of commercial and residential addresses in Canada that are extracted from Statistics Canada's Building Register and deemed non-confidential.
    Release date: 2025-12-19

  • Table: 89-26-0006
    Description: PASSAGES is an open-source dynamic microsimulation model aimed at supporting policy analysis and research relating to Canadian retirement income system outcomes at the individual and family level. The publicly available version includes a synthetic starting database, a model, and documentation. A confidential starting database is also available.
    Release date: 2025-03-12

  • Data Visualization: 71-607-X2020010
    Description: The Canadian Statistical Geospatial Explorer empowers users to discover geo enabled data holdings of Statistics Canada at various levels of geography including at the neighbourhood level. Users are able to visualize, thematically map, spatially explore and analyze, export and consume data in various formats. Users can also view the data superimposed on satellite imagery, topographic and street layers.
    Release date: 2024-08-21

  • Table: 11-10-0074-01
    Geography: Census tract
    Frequency: Occasional
    Description:

    The divergence index (D-index) describes the degree that families with different income levels are mixing together in neighbourhoods. It compares neighbourhood (census tract, CT) discrete income distributions to a base distribution, which is the income quintiles of the neighbourhood’s census metropolitan area (CMA).

    Release date: 2020-06-22

  • Data Visualization: 71-607-X2019010
    Description: The Housing Data Viewer is a visualization tool that allows users to explore Statistics Canada data on a map. Users can use the tool to navigate, compare and export data.
    Release date: 2019-10-30

  • Table: 53-500-X
    Description:

    This report presents the results of a pilot survey conducted by Statistics Canada to measure the fuel consumption of on-road motor vehicles registered in Canada. This study was carried out in connection with the Canadian Vehicle Survey (CVS) which collects information on road activity such as distance traveled, number of passengers and trip purpose.

    Release date: 2004-10-21

  • Table: 13-220-X
    Description: In the 1997 edition, new and revised benchmarks were introduced for 1992 and 1988. The indicators are used to monitor supply, demand and employment for tourism in Canada on a timely basis. The annual tables are derived using the National Income and Expenditure Accounts (NIEA) and various industry and travel surveys. Tables providing actual data and percentage changes, for seasonally adjusted current and constant price estimates are included. In addition, an analytical section provides graphs, and time series of first differences, percentage changes, and seasonal factors for selected indicators. Data are published from 1987 and the publication will be available on the day of release. New data are included in the demand tables for non-tourism commodities produced by non-tourism industries and in the employment tables covering direct tourism employment generated by non-tourism industries. This product was commissioned by the Canadian Tourism Commission to provide annual updates for the Tourism Satellite Account.
    Release date: 2003-01-08

  • Table: 11-516-X
    Description:

    The second edition of Historical statistics of Canada was jointly produced by the Social Science Federation of Canada and Statistics Canada in 1983. This volume contains about 1,088 statistical tables on the social, economic and institutional conditions of Canada from the start of Confederation in 1867 to the mid-1970s. The tables are arranged in sections with an introduction explaining the content of each section, the principal sources of data for each table, and general explanatory notes regarding the statistics. In most cases, there is sufficient description of the individual series to enable the reader to use them without consulting the numerous basic sources referenced in the publication.

    The electronic version of this historical publication is accessible on the Internet site of Statistics Canada as a free downloadable document: text as HTML pages and all tables as individual spreadsheets in a comma delimited format (CSV) (which allows online viewing or downloading).

    Release date: 1999-07-29

  • Table: 82-567-X
    Description:

    The National Population Health Survey (NPHS) is designed to enhance the understanding of the processes affecting health. The survey collects cross-sectional as well as longitudinal data. In 1994/95 the survey interviewed a panel of 17,276 individuals, then returned to interview them a second time in 1996/97. The response rate for these individuals was 96% in 1996/97. Data collection from the panel will continue for up to two decades. For cross-sectional purposes, data were collected for a total of 81,000 household residents in all provinces (except people on Indian reserves or on Canadian Forces bases) in 1996/97.

    This overview illustrates the variety of information available by presenting data on perceived health, chronic conditions, injuries, repetitive strains, depression, smoking, alcohol consumption, physical activity, consultations with medical professionals, use of medications and use of alternative medicine.

    Release date: 1998-07-29
Analysis (2,036)

Analysis (2,036) (20 to 30 of 2,036 results)

  • Articles and reports: 75-005-M2025001
    Description: Since 2010, engaging Canadians to participate in the LFS has become more challenging due to a variety of social and technological changes. The decline in the LFS response rate accelerated in 2020, exacerbated by public health measures during the COVID-19 pandemic. This technical paper presents preliminary results of two collection initiatives implemented using an online first strategy to improve the LFS response rates by confirming respondent contact information and expanding the availability of online response. Through these and other planned initiatives, Statistics Canada is working to ensure that the LFS estimates continue to provide an accurate and representative portrait of the Canadian labour market.
    Release date: 2025-10-21

  • Articles and reports: 18-001-X2025001
    Description: This paper brings the analysis of business cluster to a more granular geographic scale by developing a methodology for identifying business clusters at the neighborhood level. The proposed method identifies clusters of businesses at the DB level, which is one of the most granular spatial units of analysis defined by Statistics Canada. The method is developed with an application to four census metropolitan areas (CMAs) of different sizes and for different industry cluster specifications, including simple 2-digit North American Industry Classification System (NAICS) groups as well as industry clusters resulting from groupings of NAICS codes, as defined by Delgado et al. (2014).
    Release date: 2025-10-10

  • Journals and periodicals: 12-206-X
    Description: This report summarizes the annual achievements of the Methodology Research and Development Program (MRDP) sponsored by the Modern Statistical Methods and Data Science Branch at Statistics Canada. This program covers research and development activities in statistical methods with potentially broad application in the agency’s statistical programs; these activities would otherwise be less likely to be carried out during the provision of regular methodology services to those programs. The MRDP also includes activities that provide support in the application of past successful developments in order to promote the use of the results of research and development work. Selected prospective research activities are also presented.
    Release date: 2025-10-10

  • Articles and reports: 11-522-X202500100001
    Description: Synthetic data generation (SDG) is increasingly applied across sectors for privacy-preserving data sharing, de-biasing and augmentation. Each use case requires a distinct set of evaluation metrics that must account for the stochasticity of the SDG process: membership and attribute disclosure vulnerability are critical for privacy; fidelity and downstream task utility apply more broadly; and fairness and diversity are relevant for de-biasing and augmentation, respectively. Presenting accumulated evidence and through exemplar case studies, it is shown that SDG can perform well across many of these use cases and our key learnings from our experiences with synthetic health data are shared.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100002
    Description: Under the consumer-merchant bipartite network, we apply the indirect sampling approach to estimate merchant payment acceptance through a consumer payment diary. The records of in-person transactions in the consumer diary provide both the merchant sample via consumer-merchant linkages, and the merchant acceptance via consumers' responses on methods of payments used and accepted. Among merchants receiving multiple transactions during the period of the diary, we show that the derived payment acceptance from the consumer reporting is high quality in terms of very few conflicts between usage and perception, and within perceptions. Therefore, consumers are leveraged to be both sampling and reporting units in our indirect sampling application to eliminate merchant response burden. Furthermore, the necessity to proceed to weight adjustment to account for the non-recorded-merchant bias due to the relatively shorter duration of the diary (i.e., 3 days) is shown. Finally, these indirect sampling estimates are compared to the ones from a direct sampling survey, and it is found that the results are aligning well.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100003
    Description: In-person data collection is critical for the success of many large government-sponsored surveys. Despite response rate declines and increasing costs, the mode remains the gold standard for meeting the most rigorous survey requirements for federal survey programs, particularly as part of a multimode data collection strategy (Schober, 2018). However, over the last ten years critical labor market and workforce changes, exacerbated by the pandemic, have made in-person data collection efforts prohibitive for all but the largest survey organizations. Shifting ideas about job flexibility and job satisfaction alongside the increasingly technical role and demanding nature of the job have impacted recruitment and retention for survey organizations across the U.S. and Europe (Charman et al., 2024). The trends in U.S. field data collector employment are summarized and it is outlined that there are promising practices in recruiting and retaining high quality field data collectors. Additionally, broader ways to structure the field data collector labor force for continued success are considered, including supplementing field data collection with multimode alternatives such as video interviewing and updating value propositions for respondents.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100004
    Description: The Survey of Household Spending (SHS) conducted by Statistics Canada collects paper diaries and shopping receipts as a source of household expenditure data. An auto-capturing algorithm was created for SHS 2023 to reduce statistical clerks' manual work of extracting important information from scanned receipts of common store brands. The algorithm used Tesseract optical character recognition (OCR) to extract text characters from images of receipts, and it identified store and product entities using regular expressions, also known as regex. The goal of this study was to enhance the current auto-capture algorithm by experimenting with more advanced OCR and machine learning methods. As a result, PaddleOCR, an open-source OCR toolkit, was selected as the new default OCR engine due to its overall performance in recognizing texts, especially digits, accurately across receipts of various qualities. Additionally, entity classifiers based on support vector machines were trained on historical SHS records and existing regex patterns. By using classifiers to categorize different elements present on receipts instead of relying solely on regex patterns, product and store recognition improved. It is expected that this new algorithm will be used for SHS 2025 to improve the auto-capture quality and reduce the manual burden associated with capturing receipt variables.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100005
    Description: The Physical Flow Account for Plastic Material (PFAPM) aims to enhance environmental-economic analysis by tracking plastic material flows within the Canadian economy. To help streamline this complex process, the project leveraged advanced natural language processing (NLP) such as large language models (LLM) techniques to automate sector classification and summarize the impact of COVID-19 from company reports. By integrating machine learning models and retrieval-augmented generation (RAG) methods, the manual workload was significantly reduced, improving data analysis efficiency, and leading to higher quality insights.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100006
    Description: Small area estimation is frequently used to produce estimates at a disaggregated level where direct survey estimation does not have sufficient sample to produce precise estimates. Often this is done using the area-level Fay-Herriot model, by assuming the direct estimates are independent under the design and have a known variance, and applying a smoothing process to the variance estimates of the direct estimates to better meet that last assumption. It is not rare that small area estimates are benchmarked/raked to aggregated level direct estimates. This article shows that wrongly assuming independence can have a big impact on the MSE of the raked estimates. Values of the covariances between direct estimates are thus required for good point and MSE estimates. Getting good estimates of those covariances is difficult given the small sample sizes in some areas. An original way of deriving values for those covariances, by reverse-engineering a hypothetical raking process, is presented.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100007
    Description: This paper employs the Pseudo Maximum Likelihood (PML) estimator to the non-probability two-phase sampling when relevant auxiliary information is available from both probability survey sample and non-probability survey sample. To accommodate various weight adjustments and estimates variance beyond totals and means such as medians and quantiles, a simplified pseudo-population bootstrap procedure is proposed to approximately estimate the second-phase variance. Specifically, the simplification ignores the second phase sampling variability (i.e., treated as fixed, while in fact it is random), if the first-phase sampling fraction of the non-probability sample is negligible. Using the Bank of Canada 2020 Cash Alternative Survey Wave 2, the performance of the proposed method is compared to alternative methods, which either do not explicitly model the selection probability (i.e., raking) or ignore the valuable information from Phase 1 (i.e., Phase-2-Only). The results show that the PML-based approach performs better than raking and Phase-2-Only estimates in terms of reducing the selection bias for both phases' payment-related variables, especially for the low-response youth group. Estimated variances of the PML-based estimates are stable.

    Release date: 2025-09-08
Reference (380)

Reference (380) (370 to 380 of 380 results)

  • Surveys and statistical programs – Documentation: 13-604-M1995032
    Description:

    The International System of National Accounts 1993 (1993 SNA) was prepared and published under the auspices of the Inter-secretariat Working Group on National Accounts. This working group consists of the Statistical Office of the European Communities, the International Monetary Fund, the Organisation of Economic Co-operation and Development, the Statistical Division and regional commissions of the United Nations Secretariat, and the World Bank. The adoption of this document for universal implementation was unanimously recommended to the United Nations Economic and Social Council by its Statistical Commission at the 27th session, held in New York from February 22 to March 3, 1993. The plan for implementing the 1993 SNA system, however, does not seem to be as well organized as its production was.

    Very detailed comments have been made on this document in two papers entitled 'The 1993 International System of National Accounts vis-à-vis The Canadian System of National Accounts,' and 'The 1993 International System of National Accounts and the Canadian Input-Output Tables.' In a summary fashion, the present paper highlights certain important areas where the Canadian System of National Accounts (CSNA) will need to revise its practices to conform to the 1993 SNA. The reader is encouraged to refer to these two papers for further details.

    Release date: 1995-11-30

  • Surveys and statistical programs – Documentation: 13-604-M1995034
    Description:

    One of the most significant financial market trends is the increased use of derivative instruments. Across the entire investment spectrum, from private investors to major banks and large institutional fund managers, the use of derivative products is becoming encompassing. Derivatives can be broadly defined as secondary assets, the value of which changes in concert with price movements of a related or underlying primary asset. These instruments may be divided into four broad categories: futures, forwards, options and swaps. Trading on established exchanges, and very active in over-the-counter markets, derivative contracts have become fundamental tools in both domestic and international finance.

    Release date: 1995-11-30

  • Surveys and statistical programs – Documentation: 13-604-M1993023
    Description:

    This paper reports the results of a survey of national Income and Expenditure Accounts (IEA) release date practices as reported by national statistical bureaus. This international survey was conducted by the author between January and March 1993 by means of a questionnaire mailed to statisticians of several countries.

    Respondents to the survey were asked on what date their preliminary IEA estimates for each of the four quarters of the 1991 calendar year were officially released. They were also asked to indicate the dates on which each of the subsequent four revised sets of estimates were released. To avoid the possibility of unwarranted generalizations from a single year's experience, respondents were asked whether 1991 was a typical year or if there were special circumstances that affected the release dates in this particular period. Finally, general information was sought on each country's official revision policy.

    Release date: 1993-07-01

  • Surveys and statistical programs – Documentation: 13-604-M1991014
    Description:

    Currently, one measure of real gross domestic product (GDP) at market prices is published by Statistics Canada. It is a fixed weighted index, and the weights are from the base year, 1986. In the first quarter of 1990, alternate formulations of real GDP were reviewed in an article released in this publication. One of the alternatives discussed in the article was the Chain Volume Indexes.

    The purpose of this article was to introduce a new set of indexes into the Income and Expenditure Accounts. The indexes include quarterly re-weighted Chain Volume Indexes and annually re-weighted Chain Volume Indexes of GDP, excluding the value of physical change in inventories.

    Release date: 1991-08-31

  • Surveys and statistical programs – Documentation: 13-604-M1991011
    Description:

    The Canadian System of National Accounts (CSNA) has evolved considerably over the past four decades. This article presents a brief account of the relationship between this system, as it stands today, and the international standard for national accounting, which has been established by the United Nations. The major similarities and differences between the two systems are highlighted. The paper then goes on to briefly summarize the present state of discussions concerning revisions to the international SNA standard.

    Release date: 1990-11-30

  • Surveys and statistical programs – Documentation: 13-604-M1990006
    Description:

    Gross domestic product (GDP) is a key measure in the System of National Accounts, as well as an indispensable tool for economic analysis. This variable is available in current dollars or, in other words, expressed in the prices of the period to which each estimate applies. Two distinct parts exist within this current dollar measure: a volume component and a price component. This article focusses on the measure of GDP which expresses the volume of transactions in the economy (i.e., GDP expressed in real terms).

    Release date: 1990-06-20

  • Surveys and statistical programs – Documentation: 5190
    Description: The Data Inventory Project is a government-wide stock-taking of federal data holdings within departments that are part of the Policy Research Data Group to determine the broad range of data holdings that could address the medium to longer-term priorities. The inventory is comprised of the metadata on datasets held within the various departments and will be linked, when possible, to specific key policy issues.

  • Surveys and statistical programs – Documentation: 5192
    Description: The purpose of this pilot is to provide Statistics Canada with information on key aspects of E-questionnaire data collection as well as measuring the impact of Internet collection on estimates.

  • Surveys and statistical programs – Documentation: 5241
    Description: The SRGD is conducting a Global Positioning System (GPS) and digital mapping test to improve Statistic Canada's rural dwelling inventory by collecting dwelling identifiers to be used by field collection staff. In rural areas dwelling identification can be difficult where there is an absence of civic style addresses. The test is evaluating alternative methods for dwelling identification including the collection of GPS coordinates and digital photos using a mapping application and a digital tablet

  • Surveys and statistical programs – Documentation: 8014
    Description: This study will be used to determine which method would be the most effective to select households in Canada for any given survey that is conducted by Statistics Canada.