Statistical methods

Skip to filters. View results.

Key indicators

Changing any selection will automatically update the page content.

Selected geographical area:Canada

Selected geographical area:Newfoundland and Labrador

Selected geographical area:Prince Edward Island

Selected geographical area:Nova Scotia

Selected geographical area:New Brunswick

Selected geographical area:Quebec

Selected geographical area:Ontario

Selected geographical area:Manitoba

Selected geographical area:Saskatchewan

Selected geographical area:Alberta

Selected geographical area:British Columbia

Selected geographical area:Yukon

Selected geographical area:Northwest Territories

Selected geographical area:Nunavut

Sort Help
entries

Results

All (2,478)

All (2,478) (2,470 to 2,480 of 2,478 results)

  • Articles and reports: 12-001-X197500100006
    Description: This paper summarizes the results of a project conducted to study non-interviews in the Canadian Labour Force Survey. Temporarily absent (32.7%), no-one-home (31.4%), and refusal (25.5%) are the major components of non-response. The impact of these components to the total non-response in Surveys from July 1972 to June 1973 is discussed in detail.

    A detailed analysis of refusal households showed that existing field follow-up procedures were not quite successful in reducing the refusal component. As expected, non-response was found to be related to the length of tenure of households in the sample. Non-response among households enumerated for the first time was generally higher than those households already in the sample.
    Release date: 1975-06-16

  • Articles and reports: 12-001-X197500100007
    Description: There are several multi-stage sample designs in various countries, such as the Current Population Survey in U.S.A., Labour Survey in Sweden, and the General Household Survey in United Kingdom. From each survey, estimated totals of Employed, Unemployed, and other characteristics may be obtained.

    The Canadian Labour Force Survey is a monthly household survey in which the dwelling is the ultimate unit of sampling requiring two to four stages of selection. Each province is split up into strata and sampling units at various stages so that the sampling variance contains up to four components of variance whose actual formulae and estimation formulae are derived, utilizing those formerly derived by Yates and Grundy [12]. Ratio estimation is employed and the formulas are modified accordingly. To analyze the components of variance, it is necessary to express them in terms of components of sampling ratios and the sizes of sampling units at the various stages at provincial and national levels and approximate variance functions are thus derived.
    Release date: 1975-06-16

  • Articles and reports: 12-001-X197500100008
    Description: The need for regular up-dating of the selection probabilities in continuous surveys is emphasized in this paper. A simple strategy (selection method for the initial sample with the revision procedure) is presented and its application to the Canadian Labour Force Survey is discussed.
    Release date: 1975-06-16

  • Articles and reports: 12-001-X197500100009
    Description: This paper discusses several reinterview techniques and their use in relation to Response Variance, Response Bias, Interviewer Training, and the monitoring of various elements of the interview process. Using the Canadian Labour Force Survey as a case study the article describes how reinterview techniques were developed as the survey evolved and briefly describes the strategy being followed in the present reinterview program.
    Release date: 1975-06-16

  • Surveys and statistical programs – Documentation: 5190
    Description: The Data Inventory Project is a government-wide stock-taking of federal data holdings within departments that are part of the Policy Research Data Group to determine the broad range of data holdings that could address the medium to longer-term priorities. The inventory is comprised of the metadata on datasets held within the various departments and will be linked, when possible, to specific key policy issues.

  • Surveys and statistical programs – Documentation: 5192
    Description: The purpose of this pilot is to provide Statistics Canada with information on key aspects of E-questionnaire data collection as well as measuring the impact of Internet collection on estimates.

  • Surveys and statistical programs – Documentation: 5241
    Description: The SRGD is conducting a Global Positioning System (GPS) and digital mapping test to improve Statistic Canada's rural dwelling inventory by collecting dwelling identifiers to be used by field collection staff. In rural areas dwelling identification can be difficult where there is an absence of civic style addresses. The test is evaluating alternative methods for dwelling identification including the collection of GPS coordinates and digital photos using a mapping application and a digital tablet

  • Surveys and statistical programs – Documentation: 8014
    Description: This study will be used to determine which method would be the most effective to select households in Canada for any given survey that is conducted by Statistics Canada.
Data (10)

Data (10) ((10 results))

  • Public use microdata: 89F0002X
    Description: The SPSD/M is a static microsimulation model designed to analyse financial interactions between governments and individuals in Canada. It can compute taxes paid to and cash transfers received from government. It is comprised of a database, a series of tax/transfer algorithms and models, analytical software and user documentation.
    Release date: 2026-02-12

  • Profile of a community or region: 46-26-0002
    Description: The National Address Register (NAR) is a list of commercial and residential addresses in Canada that are extracted from Statistics Canada's Building Register and deemed non-confidential.
    Release date: 2025-12-19

  • Table: 89-26-0006
    Description: PASSAGES is an open-source dynamic microsimulation model aimed at supporting policy analysis and research relating to Canadian retirement income system outcomes at the individual and family level. The publicly available version includes a synthetic starting database, a model, and documentation. A confidential starting database is also available.
    Release date: 2025-03-12

  • Data Visualization: 71-607-X2020010
    Description: The Canadian Statistical Geospatial Explorer empowers users to discover geo enabled data holdings of Statistics Canada at various levels of geography including at the neighbourhood level. Users are able to visualize, thematically map, spatially explore and analyze, export and consume data in various formats. Users can also view the data superimposed on satellite imagery, topographic and street layers.
    Release date: 2024-08-21

  • Table: 11-10-0074-01
    Geography: Census tract
    Frequency: Occasional
    Description:

    The divergence index (D-index) describes the degree that families with different income levels are mixing together in neighbourhoods. It compares neighbourhood (census tract, CT) discrete income distributions to a base distribution, which is the income quintiles of the neighbourhood’s census metropolitan area (CMA).

    Release date: 2020-06-22

  • Data Visualization: 71-607-X2019010
    Description: The Housing Data Viewer is a visualization tool that allows users to explore Statistics Canada data on a map. Users can use the tool to navigate, compare and export data.
    Release date: 2019-10-30

  • Table: 53-500-X
    Description:

    This report presents the results of a pilot survey conducted by Statistics Canada to measure the fuel consumption of on-road motor vehicles registered in Canada. This study was carried out in connection with the Canadian Vehicle Survey (CVS) which collects information on road activity such as distance traveled, number of passengers and trip purpose.

    Release date: 2004-10-21

  • Table: 13-220-X
    Description: In the 1997 edition, new and revised benchmarks were introduced for 1992 and 1988. The indicators are used to monitor supply, demand and employment for tourism in Canada on a timely basis. The annual tables are derived using the National Income and Expenditure Accounts (NIEA) and various industry and travel surveys. Tables providing actual data and percentage changes, for seasonally adjusted current and constant price estimates are included. In addition, an analytical section provides graphs, and time series of first differences, percentage changes, and seasonal factors for selected indicators. Data are published from 1987 and the publication will be available on the day of release. New data are included in the demand tables for non-tourism commodities produced by non-tourism industries and in the employment tables covering direct tourism employment generated by non-tourism industries. This product was commissioned by the Canadian Tourism Commission to provide annual updates for the Tourism Satellite Account.
    Release date: 2003-01-08

  • Table: 11-516-X
    Description:

    The second edition of Historical statistics of Canada was jointly produced by the Social Science Federation of Canada and Statistics Canada in 1983. This volume contains about 1,088 statistical tables on the social, economic and institutional conditions of Canada from the start of Confederation in 1867 to the mid-1970s. The tables are arranged in sections with an introduction explaining the content of each section, the principal sources of data for each table, and general explanatory notes regarding the statistics. In most cases, there is sufficient description of the individual series to enable the reader to use them without consulting the numerous basic sources referenced in the publication.

    The electronic version of this historical publication is accessible on the Internet site of Statistics Canada as a free downloadable document: text as HTML pages and all tables as individual spreadsheets in a comma delimited format (CSV) (which allows online viewing or downloading).

    Release date: 1999-07-29

  • Table: 82-567-X
    Description:

    The National Population Health Survey (NPHS) is designed to enhance the understanding of the processes affecting health. The survey collects cross-sectional as well as longitudinal data. In 1994/95 the survey interviewed a panel of 17,276 individuals, then returned to interview them a second time in 1996/97. The response rate for these individuals was 96% in 1996/97. Data collection from the panel will continue for up to two decades. For cross-sectional purposes, data were collected for a total of 81,000 household residents in all provinces (except people on Indian reserves or on Canadian Forces bases) in 1996/97.

    This overview illustrates the variety of information available by presenting data on perceived health, chronic conditions, injuries, repetitive strains, depression, smoking, alcohol consumption, physical activity, consultations with medical professionals, use of medications and use of alternative medicine.

    Release date: 1998-07-29
Analysis (2,036)

Analysis (2,036) (40 to 50 of 2,036 results)

  • Articles and reports: 11-522-X202500100018
    Description: The Child Poverty Reduction Act (2018) outlines a need for the New Zealand Government to set three- and ten-yearly persistent child poverty reduction targets come end of 2024. In the absence of longitudinal survey data, a survey-administrative data hybrid method that will facilitate the production of these reduction targets and official estimates of persistent child poverty once reporting is required for the 2025/2026 financial year onwards is outlined. This hybrid approach leverages off the cross-sectional Household Economic Survey (HES), administrative-based beneficiary's family data, and recent advances developed for the construction of households within the Administrative Population Census (APC) at Statistics New Zealand. With increasing data collection challenges due to rising non-response and costs, this survey-admin hybrid method represents an alternative to longitudinal survey data collection, ensuring ongoing sustainable and quality statistics to produce persistent child poverty estimates.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100019
    Description: Accurate and efficient record linkage is crucial for maintaining a comprehensive and current Statistical Business Register (SBR) at Statistics Canada. Linking external business lists to the SBR by name presents computational and methodological challenges, especially as data volumes grow. This paper describes a scalable methodology that employs blocking techniques to constrain the computational search space and integrates multiple similarity measures—from edit distances and n-gram overlaps to embedding-based methods using Sentence-BERT (SBERT)—to identify likely matches. By combining simple character-level comparisons with more advanced semantic embedding methods, the approach can adapt to various naming conventions and complexities. While it does not guarantee superior accuracy in all circumstances, it offers a pragmatic balance between computational feasibility and linkage quality.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100020
    Description: At Statistics Canada, many data sets are linked with quasi-identifiers such as the first name, last name, or address. In such cases, linkage errors are a potential concern and must be measured. In that regard, previous studies have shown that the evaluation may be based on modeling the number of links from a given record while accounting for all the interactions among the linkage variables and dispensing with clerical reviews, so long as the decision to link two records does not involve other records. In this communication, the methodology is adapted for a class of practical strategies, which violate this constraint by linking the records in consecutive waves, where a given wave links a subset of the records that are not linked in previous waves. In particular, the linkage may be based on a deterministic wave followed by a probabilistic one.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100021
    Description: Optimal threshold selection is a critical challenge in probabilistic linkage, with significant implications for the accuracy and reliability of linked datasets. This paper analyzes the performance of the neighbour model, a recently proposed error model which models linkage errors by the number of links from each record. Three threshold selection algorithms utilizing the neighbour model were assessed, highlighting the strengths and limitations of each. Their performance was assessed through simulation studies, which demonstrated that methods using the neighbour model achieved lower relative bias compared to two established methods for threshold selection. Additionally, the practical utility was validated through goodness-of-fit tests conducted on four agricultural datasets, showing the potential of the model for use in real-world applications.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100022
    Description: In Canada, T1 Tax forms are used to report personal income, whether earned as an employee or through self-employment. Income from self-employment, or "T1 Business Income" is reported by sole proprietorships or partnerships. A T1 partnership involves two or more legal entities jointly filing for a shared business. T1 business data is received as individual filings, meaning partnerships are received separately for each partner. Internal record linkage within the T1 business database is performed to identify partnerships and prevent overcoverage within the final population of T1 businesses. This new T1 partnership identification process takes advantage of newer algorithms, such as DBSCAN numerical clustering fuzzy matching, to identify internal linkages. Graph theory is used to construct the list of partnerships from the row-pairs identified in the linkage process.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100023
    Description: The latest Canadian Census Health and Environment Cohort (CanCHEC) continues a series of population-based microdata linkages focused on population health research by demographic, social and economic characteristics. The 2021 CanCHEC consists of 95.5% of the 2021 Census long-form sample survey records. The records of survey respondents that could not be linked to the Derived Record Depository and those presumed to be duplicates account for the remaining 4.5%. Linkage-adjusted main and replicate weights allow researchers to estimate and evaluate the variance of summary measures about population health in the presence of missed linked pairs to better understand the experiences of diverse population groups.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100024
    Description: This paper explores a vision for the future of National Statistics Offices (NSOs). It analyses the history and role of NSOs before exploring current and future challenges and opportunities for NSOs, before finally outlining a future where NSOs become more agile, open, and collaborative while maintaining their high level of trust in the community, thereby allowing them to fulfil their new role as data stewards in a rapidly evolving data landscape.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100025
    Description: National statistical offices have increasingly adopted machine learning (ML) for its potential to improve survey estimates. ML techniques offer significant advantages, notably the ability to manage high-dimensional data and to capture complex, nonlinear relationships, thereby enhancing the overall quality of survey statistics. In this article, following the approach of Chernozhukov et al. (2018), we describe a double debiased machine learning framework that enables valid statistical inference when imputed estimators are derived from ML procedures. Simulation results suggest that the proposed framework performs well in a wide range of scenarios.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100026
    Description: In 2022, Canada became the first country to release statistical information about its transgender and non-binary populations based on census data. Moreover, following a 2018 government-wide policy direction, Statistics Canada's surveys have been collecting and disseminating information about gender by default rather than sex at birth. Due to the small size of the transgender and non-binary populations, disseminating safe statistical information about them at detailed geographical levels poses a challenge.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100027
    Description: Several challenges encountered when constructing U.S. administrative record-based (AR-based) population estimates for 2020 are identified. They include locational accuracy, person coverage and its consistency over time, filtering out non-residents and people not alive on the reference date, uncovering missing links across person and address records, and predicting demographic characteristics. Several ways to address these issues are discussed. Regression results illustrate how the challenges and solutions affect the AR-based county population estimates.
    Release date: 2025-09-08
Reference (380)

Reference (380) (310 to 320 of 380 results)

  • Notices and consultations: 13F0026M1999004
    Description:

    During September and October 1997, the Questionnaire Design Resource Centre (QDRC) completed 10 focus groups and 4 in-depth interviews with respondents and 6 debriefing sessions with interviewers in a test of the proposed questionnaires and data collection methodology for the 1998 Asset and Debt Survey (now called the Survey of Financial Security, to be done in 1999).

    The main goals of the testing were: to evaluate the data collection methodology and survey instruments (including the introductory materials [guide] and questionnaires [Part 1: background information about family members, Part 2: questions on assets and debts]); to identify problem areas; to make recommendations to ensure that the final survey instruments are respondent-friendly and interview-friendly, that the questionnaires can be easily understood and accurately completed; and finally, to investigate how respondents recall information.

    This report summarizes the highlights of the study, including the recommendations based on the findings of the focus groups, in-depth interviews and debriefing sessions, as well as those from the experience of the QDRC in carrying out similar studies for other household surveys.

    Release date: 1999-03-23

  • Geographic files and documentation: 92F0138M1993001
    Geography: Canada
    Description:

    The Geography Divisions of Statistics Canada and the U.S. Bureau of the Census have commenced a cooperative research program in order to foster an improved and expanded perspective on geographic areas and their relevance. One of the major objectives is to determine a common geographic area to form a geostatistical basis for cross-border research, analysis and mapping.

    This report, which represents the first stage of the research, provides a list of comparable pairs of Canadian and U.S. standard geographic areas based on current definitions. Statistics Canada and the U.S. Bureau of the Census have two basic types of standard geographic entities: legislative/administrative areas (called "legal" entities in the U.S.) and statistical areas.

    The preliminary pairing of geographic areas are based on face-value definitions only. The definitions are based on the June 4, 1991 Census of Population and Housing for Canada and the April 1, 1990 Census of Population and Housing for the U.S.A. The important aspect is the overall conceptual comparability, not the precise numerical thresholds used for delineating the areas.

    Data users should use this report as a general guide to compare the census geographic areas of Canada and the United States, and should be aware that differences in settlement patterns and population levels preclude a precise one-to-one relationship between conceptually similar areas. The geographic areas compared in this report provide a framework for further empirical research and analysis.

    Release date: 1999-03-05

  • Surveys and statistical programs – Documentation: 71F0023X1999001
    Description:

    This paper is an overview of the activities undertaken by Statistics Canada over the past several decades in the field of measuring and valuing unpaid work in all of its many forms. It was first prepared in the early 1990s when the Agency's accomplishments in the field of unpaid work were not as widely known as Statistics Canada would have liked. With each significant new achievement of the Agency, this note has been updated and further updates will be produced in step with the Agency's continuing outputs in this important area.

    Release date: 1999-01-28

  • Surveys and statistical programs – Documentation: 75F0002M1998002
    Description:

    This document presents the questions, responses and interview flow for the Contact and Demographic portions of the Survey of Labour and Income Dynamics (SLID) interviews.

    Release date: 1998-12-30

  • Surveys and statistical programs – Documentation: 75F0002M1998003
    Description:

    This paper provides a written approximation of the 1998 Survey of Labour and Income Dynamics (SLID) labour interview questionnaire.

    Release date: 1998-12-30

  • Surveys and statistical programs – Documentation: 75F0002M1998004
    Description:

    This paper presents the questions, possible responses and question flows for the 1998 Survey of Labour and Income Dynamics (SLID) preliminary questionnaire.

    Release date: 1998-12-30

  • Surveys and statistical programs – Documentation: 75F0002M1998005
    Description:

    This article gives an overview of the main goals of the Survey of Labour and Income Dynamics (SLID) and the methodology used.

    Release date: 1998-12-30

  • Surveys and statistical programs – Documentation: 75F0002M1998006
    Description:

    This paper describes the collection method and content of the 1999 Survey of Labour and Income Dynamics (SLID) income interview.

    Release date: 1998-12-30

  • Surveys and statistical programs – Documentation: 75F0002M1998012
    Description:

    This paper looks at the work of the task force responsible for reviewing Statistics Canada's household and family income statistics programs, and at one of associated program changes, namely, the integration of two major sources of annual income data in Canada, the Survey of Consumer Finances (SCF) and the Survey of Labour and Income Dynamics (SLID).

    Release date: 1998-12-30

  • Surveys and statistical programs – Documentation: 61F0041M1998003
    Description:

    This on-line product describes the personalization of the long-form questionnaires of Canada's Annual Survey of Manufactures (ASM). Personalization was motivated by the desire to reduce respondent burden. Prior to personalization, long-form questionnaires were the same for all the establishments of a given 4-digit SIC industry. Each questionnaire contained a list comprising almost all the commodities likely to be used as inputs or produced as outputs by that industry. For the typical establishment, only a small subset of the commodities listed was applicable. Personalization involved tailoring those lists to each individual establishment, based on the previous reporting of that same establishment.

    After first defining terms and then providing some quantification of the need for personalization, the paper details a number of the prerequisites - an algorithm for commodity selection, a set of stand-alone commodity descriptions, and an automated questionnaire production system. The paper next details a number of the impacts of personalization - and does so in terms of response burden, loss of information, and automation. The paper concludes with a summary and some recommendations.

    Release date: 1998-04-03