Statistical methods

Skip to filters. View results.

Key indicators

Changing any selection will automatically update the page content.

Selected geographical area:Canada

Selected geographical area:Newfoundland and Labrador

Selected geographical area:Prince Edward Island

Selected geographical area:Nova Scotia

Selected geographical area:New Brunswick

Selected geographical area:Quebec

Selected geographical area:Ontario

Selected geographical area:Manitoba

Selected geographical area:Saskatchewan

Selected geographical area:Alberta

Selected geographical area:British Columbia

Selected geographical area:Yukon

Selected geographical area:Northwest Territories

Selected geographical area:Nunavut

Sort Help
entries

Results

All (2,481)

All (2,481) (40 to 50 of 2,481 results)

  • Articles and reports: 11-522-X202500100006
    Description: Small area estimation is frequently used to produce estimates at a disaggregated level where direct survey estimation does not have sufficient sample to produce precise estimates. Often this is done using the area-level Fay-Herriot model, by assuming the direct estimates are independent under the design and have a known variance, and applying a smoothing process to the variance estimates of the direct estimates to better meet that last assumption. It is not rare that small area estimates are benchmarked/raked to aggregated level direct estimates. This article shows that wrongly assuming independence can have a big impact on the MSE of the raked estimates. Values of the covariances between direct estimates are thus required for good point and MSE estimates. Getting good estimates of those covariances is difficult given the small sample sizes in some areas. An original way of deriving values for those covariances, by reverse-engineering a hypothetical raking process, is presented.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100007
    Description: This paper employs the Pseudo Maximum Likelihood (PML) estimator to the non-probability two-phase sampling when relevant auxiliary information is available from both probability survey sample and non-probability survey sample. To accommodate various weight adjustments and estimates variance beyond totals and means such as medians and quantiles, a simplified pseudo-population bootstrap procedure is proposed to approximately estimate the second-phase variance. Specifically, the simplification ignores the second phase sampling variability (i.e., treated as fixed, while in fact it is random), if the first-phase sampling fraction of the non-probability sample is negligible. Using the Bank of Canada 2020 Cash Alternative Survey Wave 2, the performance of the proposed method is compared to alternative methods, which either do not explicitly model the selection probability (i.e., raking) or ignore the valuable information from Phase 1 (i.e., Phase-2-Only). The results show that the PML-based approach performs better than raking and Phase-2-Only estimates in terms of reducing the selection bias for both phases' payment-related variables, especially for the low-response youth group. Estimated variances of the PML-based estimates are stable.

    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100008
    Description: In 2020, Statistics Canada started to use probabilistic web panels as an alternate method of collecting official statistics. In a web panel, respondents to another survey are asked for contact information to participate in future short surveys. This paper will highlight Statistics Canada's experience with panels after 4 years, including what has been learned about the recruitment of panel participants and how to subsequently collect data using panel surveys. The ways in which recruitment questions are presented can result in very different rates of participation. Moreover, the wealth of auxiliary information available on the recruitment survey can be used to actively manage panel collection operations, by predicting the probability of response and using this information to target follow-up efforts.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100009
    Description: Three series of web panels were implemented at Statistics Canada from 2020 to 2024. Participants for these web panel series were recruited from respondents of large probabilistic social surveys (recruitment surveys), and subsequently were invited to complete a series of short online surveys. Estimates of recruitment survey variables were calculated using both recruitment survey weights and web panel weights, and these were compared; differences signal the possibility of residual bias that was not corrected by the web panel weighting process. This investigation found more significant differences than would be expected if the web panel estimator fully corrected for the bias resulting from the web panel response process. Questions related to certain topics such as politics and voting, sense of belonging, and media consumption were found to have the most significant differences between web panel estimates and recruitment survey estimates.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100010
    Description: Statistics Canada's Labour Force Survey (LFS) plays an essential role in the estimation of labour market conditions in Canada. Periodically, LFS revises its data to the most recent industry and occupational classification versions. Differences in versions can be extensive, including high-level and unit-group structural changes, creations, deletions, split-offs and combination of classification units (classes). Historically, to reconcile split-off classes - where one class splits into multiple classes - a sample of LFS split-off records would be manually recoded to the new classification version. Based on the split-off proportion observed in the recoded sample, a random allocation method would be applied on all data to reflect the changing Canadian labour market over time. This article proposes using machine learning (fastText), constrained to split-off proportions using linear programming, to revise industry and occupation classifications in LFS. The hybrid framework benefits from a text-based revision mechanism while adhering to traditional proportions driven estimates, thus ensuring a minimal impact on the comparability of published labour market indicators.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100011
    Description: The use of modern "data"-driven imputation methods to treat non-response in the context of surveys processed in the Integrated Business Statistics Program at Statistics Canada has previously been explored. It was observed that these methods can lead to high quality imputation and further have the potential to result in broad efficiencies when setting up a particular survey's edit and imputation strategy. However, estimation of the associated total variance, more specifically the component due to imputation, remains a challenge. In this article, two methods for estimation of total variance are proposed and show preliminary results that have motivated us to pursue further research in this area.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100012
    Description: In 2022, the Institut de la statistique du Québec conducted a survey of high school students in Nunavik, a unique, remote region of Quebec. The survey aimed to develop a portrait of the state of the students' physical and mental health, their lifestyle habits and their environment. This article describes the challenges encountered during the survey and the solutions put in place to overcome them.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100013
    Description: As part of answering the call to action for the United Nations' (UN) 17 Sustainable Development Goals, as well as addressing social, economic, and equity challenges within Canada, Statistics Canada's five-year development phase for the Disaggregated Data Action Plan (DDAP) was funded in 2021 to support data driven decision around these challenges. In turn, the document "Guiding Principles: Leveraging the 2021 Census of Populations Data for DDAP Groups of Interest" were created. The guiding principles document explains the organizational framework of the DDAP in the Agency, describes existing data sources, addresses ethical and privacy concerns, and centralizes sampling methods tailored for DDAP initiatives while accounting for characteristics which can complicate sampling and data collection procedures.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100014
    Description: Artificial intelligence (AI) with its subfield machine learning (ML) has found its way into administration in general and also into official statistics in Germany in particular. This paper highlights the ethical issues that may arise when using AI/ML in official statistics and examines whether a separate ethical framework is needed to deal with these issues appropriately, as is proposed by institutions of other countries and intergovernmental institutions related to official statistics. The results of the study are presented to show that the implementation of the requirements of the existing and mostly non-AI/ML-specific frames of reference such as law and quality is already sufficient to adequately address the ethical issues based on risk scenarios.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100015
    Description: Currently, Statistics Canada has no official guidance on confidentiality rules for releasing small area estimate. In recent years, there has been increasing demand from Research Data Centre (RDC) researchers for comprehensive confidentiality guidelines such that they can publish small area estimates in their research. This confidentiality analysis applies to area-level small area estimation.
    Release date: 2025-09-08
Data (10)

Data (10) ((10 results))

  • Public use microdata: 89F0002X
    Description: The SPSD/M is a static microsimulation model designed to analyse financial interactions between governments and individuals in Canada. It can compute taxes paid to and cash transfers received from government. It is comprised of a database, a series of tax/transfer algorithms and models, analytical software and user documentation.
    Release date: 2026-02-12

  • Profile of a community or region: 46-26-0002
    Description: The National Address Register (NAR) is a list of commercial and residential addresses in Canada that are extracted from Statistics Canada's Building Register and deemed non-confidential.
    Release date: 2025-12-19

  • Table: 89-26-0006
    Description: PASSAGES is an open-source dynamic microsimulation model aimed at supporting policy analysis and research relating to Canadian retirement income system outcomes at the individual and family level. The publicly available version includes a synthetic starting database, a model, and documentation. A confidential starting database is also available.
    Release date: 2025-03-12

  • Data Visualization: 71-607-X2020010
    Description: The Canadian Statistical Geospatial Explorer empowers users to discover geo enabled data holdings of Statistics Canada at various levels of geography including at the neighbourhood level. Users are able to visualize, thematically map, spatially explore and analyze, export and consume data in various formats. Users can also view the data superimposed on satellite imagery, topographic and street layers.
    Release date: 2024-08-21

  • Table: 11-10-0074-01
    Geography: Census tract
    Frequency: Occasional
    Description:

    The divergence index (D-index) describes the degree that families with different income levels are mixing together in neighbourhoods. It compares neighbourhood (census tract, CT) discrete income distributions to a base distribution, which is the income quintiles of the neighbourhood’s census metropolitan area (CMA).

    Release date: 2020-06-22

  • Data Visualization: 71-607-X2019010
    Description: The Housing Data Viewer is a visualization tool that allows users to explore Statistics Canada data on a map. Users can use the tool to navigate, compare and export data.
    Release date: 2019-10-30

  • Table: 53-500-X
    Description:

    This report presents the results of a pilot survey conducted by Statistics Canada to measure the fuel consumption of on-road motor vehicles registered in Canada. This study was carried out in connection with the Canadian Vehicle Survey (CVS) which collects information on road activity such as distance traveled, number of passengers and trip purpose.

    Release date: 2004-10-21

  • Table: 13-220-X
    Description: In the 1997 edition, new and revised benchmarks were introduced for 1992 and 1988. The indicators are used to monitor supply, demand and employment for tourism in Canada on a timely basis. The annual tables are derived using the National Income and Expenditure Accounts (NIEA) and various industry and travel surveys. Tables providing actual data and percentage changes, for seasonally adjusted current and constant price estimates are included. In addition, an analytical section provides graphs, and time series of first differences, percentage changes, and seasonal factors for selected indicators. Data are published from 1987 and the publication will be available on the day of release. New data are included in the demand tables for non-tourism commodities produced by non-tourism industries and in the employment tables covering direct tourism employment generated by non-tourism industries. This product was commissioned by the Canadian Tourism Commission to provide annual updates for the Tourism Satellite Account.
    Release date: 2003-01-08

  • Table: 11-516-X
    Description:

    The second edition of Historical statistics of Canada was jointly produced by the Social Science Federation of Canada and Statistics Canada in 1983. This volume contains about 1,088 statistical tables on the social, economic and institutional conditions of Canada from the start of Confederation in 1867 to the mid-1970s. The tables are arranged in sections with an introduction explaining the content of each section, the principal sources of data for each table, and general explanatory notes regarding the statistics. In most cases, there is sufficient description of the individual series to enable the reader to use them without consulting the numerous basic sources referenced in the publication.

    The electronic version of this historical publication is accessible on the Internet site of Statistics Canada as a free downloadable document: text as HTML pages and all tables as individual spreadsheets in a comma delimited format (CSV) (which allows online viewing or downloading).

    Release date: 1999-07-29

  • Table: 82-567-X
    Description:

    The National Population Health Survey (NPHS) is designed to enhance the understanding of the processes affecting health. The survey collects cross-sectional as well as longitudinal data. In 1994/95 the survey interviewed a panel of 17,276 individuals, then returned to interview them a second time in 1996/97. The response rate for these individuals was 96% in 1996/97. Data collection from the panel will continue for up to two decades. For cross-sectional purposes, data were collected for a total of 81,000 household residents in all provinces (except people on Indian reserves or on Canadian Forces bases) in 1996/97.

    This overview illustrates the variety of information available by presenting data on perceived health, chronic conditions, injuries, repetitive strains, depression, smoking, alcohol consumption, physical activity, consultations with medical professionals, use of medications and use of alternative medicine.

    Release date: 1998-07-29
Analysis (2,037)

Analysis (2,037) (2,000 to 2,010 of 2,037 results)

  • Articles and reports: 12-001-X197700100003
    Description: This paper describes the methodology of the Response Incentives Experiment which was carried out in the Canadian Labour Force Survey in order to determine the effectiveness of a response incentive on improving respondent relations and interviewer performance. Included in the paper are various results relating to non-response rates and refusal rates as well as results of an evaluation questionnaire which was completed by all interviewers at the conclusion of the experiment.
    Release date: 1977-06-20

  • Articles and reports: 12-001-X197700100004
    Description: The 1971 and 1976 Censuses of Population and Housing have utilized the raking ratio estimation procedure to obtain estimates for variables collected only on a sample basis. This paper derives large sample approximations for the bias and variance of such estimates and examines their performance in an empirical study.
    Release date: 1977-06-20

  • Articles and reports: 12-001-X197700100005
    Description: Objective yield surveys have been conducted annually in the Niagara Peninsula since 1964. The aim of each of these annual surveys is to provide a forecast of the marketable production change in the region from the previous year. These estimates are determined far enough in advance of the harvest to enable them to serve as important factors in price negotiations between growers and processors, as well as indicators of particular crop situations which could necessitate immediate changes in strategy by the marketing agencies. In 1973 an extensive redesign project was initiated. This report provides a summary of the sample design, data collection procedures and estimation procedures which were incorporated in the redesign of the sour cherry, peach and grape objective yield surveys.
    Release date: 1977-06-20

  • Articles and reports: 12-001-X197700100006
    Description: The problem considered is the estimation of population total of some characteristic from a simple random sample containing a few large or extreme observations. The effect of these large units in the sample is to distort the estimate of the population total. It is therefore important to correct the weights for such units or deflate their values at the estimation stage once they have been sampled and identified as unusually large units. In this paper, three estimators which alter the usual sampling weights have been considered. The efficiencies of these estimators have been worked out in terms of the ratio of the variance of the usual estimator of the population total to the mean square error of these estimators. An empirical study of these estimators is also discussed.
    Release date: 1977-06-20

  • Articles and reports: 12-001-X197700100007
    Description: The paper attempts to examine some of the procedures used for compensation for non-response. Using the concept of response probabilities, a simple response - non-response error model is developed and the components of response and non-response errors are identified under various imputation procedures. A graph is also given in order to provide an idea of the magnitude of the non­response bias in a particular situation. Two examples of the practical application of imputation are discussed.
    Release date: 1977-06-20

  • Articles and reports: 12-001-X197600200001
    Description: This paper presents results on rotation group biases in the Canadian Labour Force Survey (LFS). The biases are studied in detail by decomposition into components responsible for the biases. Also, a comparison between the old and the new LFS is done on the basis of 1975 parallel run and differences are analyzed. Some conclusions are drawn and recommendations for other studies presented.
    Release date: 1976-12-13

  • Articles and reports: 12-001-X197600200002
    Description: To obtain estimates of means or totals for a universe, a sample of units is often drawn to represent the universe and these units are then surveyed. One of the most important procedures used in the selection of the units is that of stratification, whereby the universe is split up into strata and independent samples of units are drawn from each stratum. A stratification index is developed to indicate the approximate fractional reduction in the sampling variance from that which would result if no stratification were undertaken. Also the methodology is extended to examine the effect of stratification on the sampling variance at different levels of stratification through the concept of a summary index. The stratification index is also extended to the case of ratio estimates using independent source data to re-weight the sample data. The index has been applied to the Canadian Labour Force Survey (LFS), a typical multi-stage stratified sample where ratio estimation, using projected age-sex population estimates is applied and empirical data are presented and analyzed.
    Release date: 1976-12-13

  • Articles and reports: 12-001-X197600200003
    Description: The 1974 Survey of Housing Units was carried out by Statistics Canada on behalf of the Central Mortgage and Housing Corporation during the autumn of 1974. Statistics Canada's responsibilities on this project included the design and implementation of all phases of the survey up to and including the production of "clean" micro data tapes. The sponsoring department was in turn responsible for the specification of objectives and data requirements and for the analysis of the resulting data.

    This report, which is a modification of the summary report produced by the project team at the conclusion of the project, provides a general description of the survey and the work done by Statistics Canada on the survey.
    Release date: 1976-12-13

  • Articles and reports: 12-001-X197600200004
    Description: Published reports for the 1976 Census will include estimates of Total Variance as indicators of the reliability of the figures in these reports. In order to obtain these estimates of Total Variance, an Interpenetrating Design Experiment was incorporated into the collection methods for a sample of enumeration areas. In this paper we derive the formula for Total Variance in terms of variances due to sampling, correlated response and simple response. We then show how the Total Variance, and its components, can be estimated from the design and we give the estimators that will be used for the 1976 Census. The estimates of sampling and correlated response variance are unbiased but the simple response variance estimate is not.
    Release date: 1976-12-13

  • Articles and reports: 12-001-X197600200005
    Description: The 1971 Reverse Record Check is one of the most important studies that were carried out as part of the 1971 Census Evaluation Programme. Its main purpose was to investigate the incidence of under-enumeration in the 1971 Census. To do this, a frame containing all persons who should be enumerated in the 1971 Census was built up from the 1966 Census returns, plus birth and immigrant registrations. A random sample was selected from the frame and each selected person was traced to his current Census address. Current Census returns were then checked to see whether or not the selected person was enumerated. Sample figures were weighted up to the population level to obtain estimates of undercoverage. This paper gives a general description of the methodology of this study, and indicates some of the resulting improvements incorporated for 1976.
    Release date: 1976-12-13
Reference (382)

Reference (382) (40 to 50 of 382 results)

  • Surveys and statistical programs – Documentation: 99-011-X
    Description:

    This topic presents data on the Aboriginal peoples of Canada and their demographic characteristics. Depending on the application, estimates using any of the following concepts may be appropriate for the Aboriginal population: (1) Aboriginal identity, (2) Aboriginal ancestry, (3) Registered or Treaty Indian status and (4) Membership in a First Nation or Indian band. Data from the 2011 National Household Survey are available for the geographical locations where these populations reside, including 'on reserve' census subdivisions and Inuit communities of Inuit Nunangat as well as other geographic areas such as the national (Canada), provincial and territorial levels.

    Analytical products

    The analytical document provides analysis on the key findings and trends in the data, and is complimented with the short articles found in NHS in Brief and the NHS Focus on Geography Series.

    Data products

    The NHS Profile is one data product that provides a statistical overview of user selected geographic areas based on several detailed variables and/or groups of variables. Other data products include data tables which represent a series of cross tabulations ranging in complexity and are available for various levels of geography.

    Release date: 2019-10-29

  • Surveys and statistical programs – Documentation: 11-621-M2018105
    Description:

    Statistics Canada needs to respond to the legalization of cannabis for non-medical use by measuring various aspects of the introduction of cannabis in the Canadian economy and society. An important part of measuring the economy and society is using statistical classifications. It is common practice with classifications that they are updated and revised as new industries, products, occupations and educational programs are introduced into the Canadian economy and society. This paper describes the changes to the various statistical classifications used by Statistics Canada in order to measure the introduction of legal non-medical cannabis.

    Release date: 2019-07-24

  • Surveys and statistical programs – Documentation: 11-633-X2019001
    Description:

    The mandate of the Analytical Studies Branch (ASB) is to provide high-quality, relevant and timely information on economic, health and social issues that are important to Canadians. The branch strategically makes use of expert knowledge and a large range of statistical sources to describe, draw inferences from, and make objective and scientifically supported deductions about the evolving nature of the Canadian economy and society. Research questions are addressed by applying leading-edge methods, including microsimulation and predictive analytics using a range of linked and integrated administrative and survey data. In supporting greater access to data, ASB linked data are made available to external researchers and policy makers to support evidence-based decision making. Research results are disseminated by the branch using a range of mediums (i.e., research papers, studies, infographics, videos, and blogs) to meet user needs. The branch also provides analytical support and training, feedback, and quality assurance to the wide range of programs within and outside Statistics Canada.

    Release date: 2019-05-29

  • Notices and consultations: 75F0002M2019006
    Description:

    In 2018, Statistics Canada released two new data tables with estimates of effective tax and transfer rates for individual tax filers and census families. These estimates are derived from the Longitudinal Administrative Databank. This publication provides a detailed description of the methods used to derive the estimates of effective tax and transfer rates.

    Release date: 2019-04-16

  • Surveys and statistical programs – Documentation: 75-005-M2019001
    Description:

    The production of statistics from the Labour Force Survey (LFS) involves many activities, one of which is data processing. This step involves the verification and correction of survey data when required in order to produce microdata files. Beginning in January 2019, LFS processing will be transitioned to a new system, the Social Survey Processing Environment. This document describes the development and testing that preceded the implementation of the new system, and demonstrates that the transition is expected to have minimal impact on LFS estimates and be transparent to users of LFS data.

    Release date: 2019-02-08

  • Surveys and statistical programs – Documentation: 11-633-X2018019
    Description:

    The Longitudinal Immigration Database (IMDB) is a comprehensive source of data that plays a key role in the understanding of the economic behaviour of immigrants. It is the only annual Canadian dataset that allows users to study the characteristics of immigrants to Canada at the time of admission and their economic outcomes and regional (inter-provincial) mobility over a time span of more than 30 years. The IMDB combines administrative files on immigrant admissions and non-permanent resident permits from Immigration, Refugees and Citizenship Canada (IRCC) with tax files from the Canadian Revenue Agency (CRA). Information is available for immigrant taxfilers admitted since 1980. Tax records for 1982 and subsequent years are available for immigrant taxfilers. This report will discuss the IMDB data sources, concepts and variables, record linkage, data processing, dissemination, data evaluation and quality indicators, comparability with other immigration datasets, and the analyses possible with the IMDB.

    Release date: 2018-12-10

  • Surveys and statistical programs – Documentation: 11-633-X2018011
    Description:

    The Longitudinal Immigration Database (IMDB) is a comprehensive source of data that plays a key role in the understanding of the economic behaviour of immigrants. It is the only annual Canadian dataset that allows users to study the characteristics of immigrants to Canada at the time of admission and their economic outcomes and regional (inter-provincial) mobility over a time span of more than 30 years. The IMDB combines administrative files on immigrant admissions and non-permanent resident permits from Immigration, Refugees and Citizenship Canada (IRCC) with tax files from the Canadian Revenue Agency (CRA). Information is available for immigrant taxfilers admitted since 1980. Tax records for 1982 and subsequent years are available for immigrant taxfilers.

    This report will discuss the IMDB data sources, concepts and variables, record linkage, data processing, dissemination, data evaluation and quality indicators, comparability with other immigration datasets, and the analyses possible with the IMDB.

    Release date: 2018-01-08

  • Surveys and statistical programs – Documentation: 71-526-X
    Description:

    The Canadian Labour Force Survey (LFS) is the official source of monthly estimates of total employment and unemployment. Following the 2011 census, the LFS underwent a sample redesign to account for the evolution of the population and labour market characteristics, to adjust to changes in the information needs and to update the geographical information used to carry out the survey. The redesign program following the 2011 census culminated with the introduction of a new sample at the beginning of 2015. This report is a reference on the methodological aspects of the LFS, covering stratification, sampling, collection, processing, weighting, estimation, variance estimation and data quality.

    Release date: 2017-12-21

  • Surveys and statistical programs – Documentation: 12-606-X
    Description: This is a toolkit intended to aid data producers and data users external to Statistics Canada.
    Release date: 2017-09-27

  • Surveys and statistical programs – Documentation: 91F0015M2017013
    Description:

    Using records linkage, this article compares the place of residence in the 2011 Census to that of the 2010 T1 Family File (T1FF). The main result is that although the overall level of consistency in the place of residence is relatively high, it decreases, sometimes substantially, for some segments of the population.

    Release date: 2017-09-26