Statistical methods

Skip to filters. View results.

Key indicators

Changing any selection will automatically update the page content.

Selected geographical area:Canada

Selected geographical area:Newfoundland and Labrador

Selected geographical area:Prince Edward Island

Selected geographical area:Nova Scotia

Selected geographical area:New Brunswick

Selected geographical area:Quebec

Selected geographical area:Ontario

Selected geographical area:Manitoba

Selected geographical area:Saskatchewan

Selected geographical area:Alberta

Selected geographical area:British Columbia

Selected geographical area:Yukon

Selected geographical area:Northwest Territories

Selected geographical area:Nunavut

Sort Help
entries

Results

All (2,478)

All (2,478) (2,450 to 2,460 of 2,478 results)

  • Articles and reports: 12-001-X197600100002
    Description: A special class of missing data problems is discussed, namely that of typical survey data whereby zeros dominate the multivariate response space. Here, techniques which impute means (whether conditional or unconditional) distort rather than improve the quality of the data. A probabilistic model is described which provides reasonable estimates, but also upholds the integrity of the data base. Results are given from a comparative study of the proposed methodology with other estimation/imputation models.
    Release date: 1976-06-14

  • 2,452. Raking ratio estimators Archived
    Articles and reports: 12-001-X197600100003
    Description: This paper presents large sample results for the bias and variance of raking-ratio estimators for up to four iterations. Estimators of the bias and variance are also presented. An expression for the asymptotic covariance matrix of the maximum likelihood estimators of the cell proportions in a two-way table with known marginals is also given.
    Release date: 1976-06-14

  • Articles and reports: 12-001-X197600100004
    Description: With the recent review of the Labour Force Survey, several periphexal projects have been redesigned. This is the case with the LFS re-interview program which will for the coming years be oriented toward the measurement of response errors. This paper describes the new design of the program and discusses how data will be analysed to achieve the objectives.
    Release date: 1976-06-14

  • Articles and reports: 12-001-X197600100005
    Description: This paper presents the Behrens-Fisher problem and gives an overview of the major solutions brought forward to this date. The aim of the paper is to use the most appropriate approach to the problem for testing sets of six month Labour Force Survey data against those of a pilot study. This is done since in many cases (such as Methods Test Panel studies) studies are conducted for six consecutive months and comparisons are required on the basis of those sets of six month data. Empirical results are also given by testing Methods Test Panel Phase III data against corresponding Labour Force Survey data.
    Release date: 1976-06-14

  • Articles and reports: 12-001-X197600100006
    Description: Multi-stage statistical surveys as a means of obtaining socioeconomic characteristics for the population have been in use for many years. Each survey requires an extensive and precise sample design which is governed by the cost structure for obtaining the data and the variance of the characteristic data between units at various stages of sampling. The authors analyzed variance components derived from one month's data of the Canadian Labour Force Survey and examined the variance that would have resulted under different allocation strategies in Table 6 and for different average sizes of units in Table 7. The percentage components of variance, the design effects by stage of sampling and population variances between units of the various stages, as well as measures of homogeneity for households within stages, are derived and shown in Tables 2 to 5.

    The analysis was carried out for the Canadian Labour Force Survey, but the methodology of component of variance estimation (Gray [4]) and the methods used to analyze the results of a particular survey are readily applied to any multi-stage statistical sample survey, where Horvitz-Thompsen estimators and ratio estimation are applied.
    Release date: 1976-06-14

  • Articles and reports: 12-001-X197500254826
    Description: Exact formulae for bias and mean square error of an estimator of process average in single sampling with rectification for finite lots are obtained. Efficiency of the estimator as compared to an unbiased estimator based on the first sample is obtained for a number of values of lot size, sample size, acceptance number and process average used in sampling plans in quality control of data processing.
    Release date: 1975-12-15

  • Articles and reports: 12-001-X197500254827
    Description: In the Methods Test Panel Phase II it was required to do analysis of variance on proportions. Since such analysis gives only approximate results, two models were used in order to be able to draw safe conclusions. Analysis of variance was performed with the proportions as variable and also with the arc sine of the square root of the proportions. The two models are outlined in the present paper and empirical comparisons are made using the MTP Phase II data.
    Release date: 1975-12-15

  • Articles and reports: 12-001-X197500254828
    Description: The Canadian Travel Survey, 1971 was the largest survey on travel of Canadian residents. This paper describes some important aspects of the methodology. Particular emphasis is given to the development of definitions in relation to the methodology, the sampling technique and interview strategy.
    Release date: 1975-12-15

  • Articles and reports: 12-001-X197500254829
    Description: J.N.K. Rao (1975) derived a general formula for estimating the variance in multistage sample designs. This general formula extends the previous results by Des Raj (1966) to the case where the conditional variance from a given primary sampling unit is a random variable. The authors reviewed Rao's paper for its application to Horvitz-Thompson and Yates-Grundy variance estimators as well as the variance estimator for the random group method by Rao, Hartley and Cochran (1962). The authors present an altered version of the Yates-Grundy variance estimators as a result of Rao's paper.
    Release date: 1975-12-15

  • Articles and reports: 12-001-X197500254830
    Description: This paper focuses on the improvement of sample survey estimates in the particular situation where the survey sample, or part of it, is included in a larger sample from which auxiliary information is available. The properties of a method of estimation - sometimes applied in specific circumstances - are investigated and the limitations of its application are found. The application of the method to rotation designs in continuing surveys is more closely studied in the context of composite estimation.
    Release date: 1975-12-15
Data (10)

Data (10) ((10 results))

  • Public use microdata: 89F0002X
    Description: The SPSD/M is a static microsimulation model designed to analyse financial interactions between governments and individuals in Canada. It can compute taxes paid to and cash transfers received from government. It is comprised of a database, a series of tax/transfer algorithms and models, analytical software and user documentation.
    Release date: 2026-02-12

  • Profile of a community or region: 46-26-0002
    Description: The National Address Register (NAR) is a list of commercial and residential addresses in Canada that are extracted from Statistics Canada's Building Register and deemed non-confidential.
    Release date: 2025-12-19

  • Table: 89-26-0006
    Description: PASSAGES is an open-source dynamic microsimulation model aimed at supporting policy analysis and research relating to Canadian retirement income system outcomes at the individual and family level. The publicly available version includes a synthetic starting database, a model, and documentation. A confidential starting database is also available.
    Release date: 2025-03-12

  • Data Visualization: 71-607-X2020010
    Description: The Canadian Statistical Geospatial Explorer empowers users to discover geo enabled data holdings of Statistics Canada at various levels of geography including at the neighbourhood level. Users are able to visualize, thematically map, spatially explore and analyze, export and consume data in various formats. Users can also view the data superimposed on satellite imagery, topographic and street layers.
    Release date: 2024-08-21

  • Table: 11-10-0074-01
    Geography: Census tract
    Frequency: Occasional
    Description:

    The divergence index (D-index) describes the degree that families with different income levels are mixing together in neighbourhoods. It compares neighbourhood (census tract, CT) discrete income distributions to a base distribution, which is the income quintiles of the neighbourhood’s census metropolitan area (CMA).

    Release date: 2020-06-22

  • Data Visualization: 71-607-X2019010
    Description: The Housing Data Viewer is a visualization tool that allows users to explore Statistics Canada data on a map. Users can use the tool to navigate, compare and export data.
    Release date: 2019-10-30

  • Table: 53-500-X
    Description:

    This report presents the results of a pilot survey conducted by Statistics Canada to measure the fuel consumption of on-road motor vehicles registered in Canada. This study was carried out in connection with the Canadian Vehicle Survey (CVS) which collects information on road activity such as distance traveled, number of passengers and trip purpose.

    Release date: 2004-10-21

  • Table: 13-220-X
    Description: In the 1997 edition, new and revised benchmarks were introduced for 1992 and 1988. The indicators are used to monitor supply, demand and employment for tourism in Canada on a timely basis. The annual tables are derived using the National Income and Expenditure Accounts (NIEA) and various industry and travel surveys. Tables providing actual data and percentage changes, for seasonally adjusted current and constant price estimates are included. In addition, an analytical section provides graphs, and time series of first differences, percentage changes, and seasonal factors for selected indicators. Data are published from 1987 and the publication will be available on the day of release. New data are included in the demand tables for non-tourism commodities produced by non-tourism industries and in the employment tables covering direct tourism employment generated by non-tourism industries. This product was commissioned by the Canadian Tourism Commission to provide annual updates for the Tourism Satellite Account.
    Release date: 2003-01-08

  • Table: 11-516-X
    Description:

    The second edition of Historical statistics of Canada was jointly produced by the Social Science Federation of Canada and Statistics Canada in 1983. This volume contains about 1,088 statistical tables on the social, economic and institutional conditions of Canada from the start of Confederation in 1867 to the mid-1970s. The tables are arranged in sections with an introduction explaining the content of each section, the principal sources of data for each table, and general explanatory notes regarding the statistics. In most cases, there is sufficient description of the individual series to enable the reader to use them without consulting the numerous basic sources referenced in the publication.

    The electronic version of this historical publication is accessible on the Internet site of Statistics Canada as a free downloadable document: text as HTML pages and all tables as individual spreadsheets in a comma delimited format (CSV) (which allows online viewing or downloading).

    Release date: 1999-07-29

  • Table: 82-567-X
    Description:

    The National Population Health Survey (NPHS) is designed to enhance the understanding of the processes affecting health. The survey collects cross-sectional as well as longitudinal data. In 1994/95 the survey interviewed a panel of 17,276 individuals, then returned to interview them a second time in 1996/97. The response rate for these individuals was 96% in 1996/97. Data collection from the panel will continue for up to two decades. For cross-sectional purposes, data were collected for a total of 81,000 household residents in all provinces (except people on Indian reserves or on Canadian Forces bases) in 1996/97.

    This overview illustrates the variety of information available by presenting data on perceived health, chronic conditions, injuries, repetitive strains, depression, smoking, alcohol consumption, physical activity, consultations with medical professionals, use of medications and use of alternative medicine.

    Release date: 1998-07-29
Analysis (2,036)

Analysis (2,036) (40 to 50 of 2,036 results)

  • Articles and reports: 11-522-X202500100018
    Description: The Child Poverty Reduction Act (2018) outlines a need for the New Zealand Government to set three- and ten-yearly persistent child poverty reduction targets come end of 2024. In the absence of longitudinal survey data, a survey-administrative data hybrid method that will facilitate the production of these reduction targets and official estimates of persistent child poverty once reporting is required for the 2025/2026 financial year onwards is outlined. This hybrid approach leverages off the cross-sectional Household Economic Survey (HES), administrative-based beneficiary's family data, and recent advances developed for the construction of households within the Administrative Population Census (APC) at Statistics New Zealand. With increasing data collection challenges due to rising non-response and costs, this survey-admin hybrid method represents an alternative to longitudinal survey data collection, ensuring ongoing sustainable and quality statistics to produce persistent child poverty estimates.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100019
    Description: Accurate and efficient record linkage is crucial for maintaining a comprehensive and current Statistical Business Register (SBR) at Statistics Canada. Linking external business lists to the SBR by name presents computational and methodological challenges, especially as data volumes grow. This paper describes a scalable methodology that employs blocking techniques to constrain the computational search space and integrates multiple similarity measures—from edit distances and n-gram overlaps to embedding-based methods using Sentence-BERT (SBERT)—to identify likely matches. By combining simple character-level comparisons with more advanced semantic embedding methods, the approach can adapt to various naming conventions and complexities. While it does not guarantee superior accuracy in all circumstances, it offers a pragmatic balance between computational feasibility and linkage quality.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100020
    Description: At Statistics Canada, many data sets are linked with quasi-identifiers such as the first name, last name, or address. In such cases, linkage errors are a potential concern and must be measured. In that regard, previous studies have shown that the evaluation may be based on modeling the number of links from a given record while accounting for all the interactions among the linkage variables and dispensing with clerical reviews, so long as the decision to link two records does not involve other records. In this communication, the methodology is adapted for a class of practical strategies, which violate this constraint by linking the records in consecutive waves, where a given wave links a subset of the records that are not linked in previous waves. In particular, the linkage may be based on a deterministic wave followed by a probabilistic one.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100021
    Description: Optimal threshold selection is a critical challenge in probabilistic linkage, with significant implications for the accuracy and reliability of linked datasets. This paper analyzes the performance of the neighbour model, a recently proposed error model which models linkage errors by the number of links from each record. Three threshold selection algorithms utilizing the neighbour model were assessed, highlighting the strengths and limitations of each. Their performance was assessed through simulation studies, which demonstrated that methods using the neighbour model achieved lower relative bias compared to two established methods for threshold selection. Additionally, the practical utility was validated through goodness-of-fit tests conducted on four agricultural datasets, showing the potential of the model for use in real-world applications.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100022
    Description: In Canada, T1 Tax forms are used to report personal income, whether earned as an employee or through self-employment. Income from self-employment, or "T1 Business Income" is reported by sole proprietorships or partnerships. A T1 partnership involves two or more legal entities jointly filing for a shared business. T1 business data is received as individual filings, meaning partnerships are received separately for each partner. Internal record linkage within the T1 business database is performed to identify partnerships and prevent overcoverage within the final population of T1 businesses. This new T1 partnership identification process takes advantage of newer algorithms, such as DBSCAN numerical clustering fuzzy matching, to identify internal linkages. Graph theory is used to construct the list of partnerships from the row-pairs identified in the linkage process.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100023
    Description: The latest Canadian Census Health and Environment Cohort (CanCHEC) continues a series of population-based microdata linkages focused on population health research by demographic, social and economic characteristics. The 2021 CanCHEC consists of 95.5% of the 2021 Census long-form sample survey records. The records of survey respondents that could not be linked to the Derived Record Depository and those presumed to be duplicates account for the remaining 4.5%. Linkage-adjusted main and replicate weights allow researchers to estimate and evaluate the variance of summary measures about population health in the presence of missed linked pairs to better understand the experiences of diverse population groups.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100024
    Description: This paper explores a vision for the future of National Statistics Offices (NSOs). It analyses the history and role of NSOs before exploring current and future challenges and opportunities for NSOs, before finally outlining a future where NSOs become more agile, open, and collaborative while maintaining their high level of trust in the community, thereby allowing them to fulfil their new role as data stewards in a rapidly evolving data landscape.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100025
    Description: National statistical offices have increasingly adopted machine learning (ML) for its potential to improve survey estimates. ML techniques offer significant advantages, notably the ability to manage high-dimensional data and to capture complex, nonlinear relationships, thereby enhancing the overall quality of survey statistics. In this article, following the approach of Chernozhukov et al. (2018), we describe a double debiased machine learning framework that enables valid statistical inference when imputed estimators are derived from ML procedures. Simulation results suggest that the proposed framework performs well in a wide range of scenarios.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100026
    Description: In 2022, Canada became the first country to release statistical information about its transgender and non-binary populations based on census data. Moreover, following a 2018 government-wide policy direction, Statistics Canada's surveys have been collecting and disseminating information about gender by default rather than sex at birth. Due to the small size of the transgender and non-binary populations, disseminating safe statistical information about them at detailed geographical levels poses a challenge.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100027
    Description: Several challenges encountered when constructing U.S. administrative record-based (AR-based) population estimates for 2020 are identified. They include locational accuracy, person coverage and its consistency over time, filtering out non-residents and people not alive on the reference date, uncovering missing links across person and address records, and predicting demographic characteristics. Several ways to address these issues are discussed. Regression results illustrate how the challenges and solutions affect the AR-based county population estimates.
    Release date: 2025-09-08
Reference (380)

Reference (380) (300 to 310 of 380 results)

  • Surveys and statistical programs – Documentation: 11-522-X19980015037
    Description:

    For longitudinal data, mixed models are often used, since they allow analysts to take account of the correlation between different observations from the same individual. The finite mixture model may be considered as a special case of a mixed model. In this document, attention will be given to the maximum likelihood method. The maximization of the likelihood function for a finite mixture of distributions is generally more difficult than in the usual case of a single distribution and can require considerable time. The objective of this project was therefore primarily to identify the one or more algorithms that best meet the criteria of run time and of efficiency in finding the solution. To achieve this objective, a simulation study was carried out. Only the situation in which the dependent variable is dichotomous was considered. This situation is very useful in practice, since among other things it can be used to model discrete durations, such as the length of time in "low income" status.

    Release date: 1999-10-22

  • Surveys and statistical programs – Documentation: 13F0026M1999006
    Description:

    Although income and expenditure data provide an indication of current consumption and ability to purchase goods and services, they provide little information on the long-term ability of families to sustain themselves. The results of this survey will provide information on the net worth (wealth) of Canadian families, that is, the value of their assets less their debts.

    This paper examines the objectives of the survey, how the survey has changed since 1984, the types of questions being asked and information that will be provided, as well as other survey background. An accompanying table outlines the content of the questionnaire. The intent of this paper is to describe the work done to date and the next steps for this important subject.

    Release date: 1999-09-27

  • Surveys and statistical programs – Documentation: 75F0002M1999003
    Description:

    This document presents the questions, responses and interview flow for the Contact and Demographic portions of the Survey of Labour and Income Dynamics (SLID) interviews.

    Release date: 1999-09-27

  • Surveys and statistical programs – Documentation: 75F0002M1999004
    Description:

    This paper presents the questions, possible responses and question flows for the 1999 Survey of Labour and Income Dynamics (SLID) preliminary questionnaire.

    Release date: 1999-09-27

  • Surveys and statistical programs – Documentation: 75F0002M1999005
    Description:

    This paper outlines the structure of the January 1999 Survey of Labour and Income Dynamics (SLID) labour interview questionnaire, including question wording, possible responses, and flows of questions.

    Release date: 1999-09-27

  • Surveys and statistical programs – Documentation: 68F0015X
    Description:

    The purpose of this paper is to provide some general background and describe the methodology of the pilot year Unified Enterprise Survey (UES). It also illustrates the role of the Unified Enterprise Survey Program (UESP) within The Project to Improve Provincial Economic Statistics (PIPES) program. This information package is targeted toward external clients, for example the Provincial Focal Points, enabling them to assess future data releases planned by industry sector. The scope of this information package will be expanded as subsequent data releases over the next six months or so provide more industry specific details for the seven new pilot industries included in the 1997 UES. This document is approximately twenty-two pages in length and is to be offered at no charge to callers requesting information on the UES.

    Release date: 1999-09-01

  • Surveys and statistical programs – Documentation: 92-353-X
    Description:

    This report deals with age, sex, marital status and common-law status. It is aimed at informing users about the complexity of the data and any difficulties that could affect their use. It explains the theoretical framework and definitions used to gather the data, and describes unusual circumstances that could affect data quality. Moreover, the report touches upon data capture, edit and imputation, and deals with the historical comparability of the data.

    Release date: 1999-04-16

  • Notices and consultations: 13F0026M1999001
    Description:

    The main objectives of a new Canadian survey measuring asset and debt holding of families and individuals will be to update wealth information that is over one decade old; to improve the reliability of the wealth estimates; and, to provide a primary tool for analysing many important policy issues related to the distribution of assets and debts, future consumption possibilities, and savings behaviour that is of interest to governments, business and communities.

    This paper is the document that launched the development of the new asset and debt survey, subsequently renamed the Survey of Financial Security. It looks at the conceptual framework for the survey, including the appropriate unit of measurement (family, household or person) and discusses measurement issues such as establishing an accounting framework for assets and debts. The variables proposed for inclusion are also identified. The paper poses several questions to readers and asks for comments and feedback.

    Release date: 1999-03-23

  • Notices and consultations: 13F0026M1999002
    Description:

    This document summarizes the comments and feedback received on an earlier document: Towards a new Canadian asset and debt survey - A content discussion paper. The new asset and debt survey (now called the Survey of Financial Security) is to update the wealth information on Canadian families and unattached individuals. Since the last data collection was conducted in 1984, it was essential to include a consultative process in the development of the survey in order to obtain feedback on issues of concern and to define the conceptual framework for the survey.

    Comments on the content discussion paper are summarized by major theme and sections indicate how the suggestions are being incorporated into the survey or why they could not be incorporated. This paper also mentions the main objectives of the survey and provides an overview of the survey content, revised according to the feedback from the discussion paper.

    Release date: 1999-03-23

  • Surveys and statistical programs – Documentation: 13F0026M1999003
    Description:

    This paper presents a proposal for conducting a Canadian asset and debt survey. The first step in preparing this proposal was the release, in February 1997, of a document entitled Towards a new Canadian asset and debt survey whose intent was to elicit feedback on the initial thinking regarding the content of the survey.

    This paper reviews the conceptual framework for a new asset and debt survey, data requirements, survey design, collection methodology and testing. It provides also an overview of the anticipated data processing system, describes the analysis and dissemination plan (analytical products and microdata files), and identifies the survey costs and major milestones. Finally, it presents the management/coordination approach used.

    Release date: 1999-03-23