Statistical techniques

Filter results by

Search Help
Currently selected filters that can be removed

Keyword(s)

Geography

3 facets displayed. 0 facets selected.

Survey or statistical program

48 facets displayed. 0 facets selected.

Content

1 facets displayed. 0 facets selected.
Sort Help
entries

Results

All (188)

All (188) (0 to 10 of 188 results)

  • Articles and reports: 75-005-M2024004
    Description: This article provides information about population totals in the Labour Force Survey (LFS), including details on who is included in the survey target population, and a description of the methodology used to produce monthly population totals in the LFS. The note also provides guidance on how to interpret population statistics in the LFS, and discusses the extent to which the LFS can be used to examine disaggregated labour market indicators for new immigrants and non-permanent residents.
    Release date: 2024-09-20

  • Journals and periodicals: 11-633-X
    Description: Papers in this series provide background discussions of the methods used to develop data for economic, health, and social analytical studies at Statistics Canada. They are intended to provide readers with information on the statistical methods, standards and definitions used to develop databases for research purposes. All papers in this series have undergone peer and institutional review to ensure that they conform to Statistics Canada's mandate and adhere to generally accepted standards of good professional practice.
    Release date: 2024-09-11

  • Articles and reports: 11-522-X202200100008
    Description: The publication of more disaggregated data can increase transparency and provide important information on underrepresented groups. Developing more readily available access options increases the amount of information available to and produced by researchers. Increasing the breadth and depth of the information released allows for a better representation of the Canadian population, but also puts a greater responsibility on Statistics Canada to do this in a way that preserves confidentiality, and thus it is helpful to develop tools which allow Statistics Canada to quantify the risk from the additional data granularity. In an effort to evaluate the risk of a database reconstruction attack on Statistics Canada’s published Census data, this investigation follows the strategy of the US Census Bureau, who outlined a method to use a Boolean satisfiability (SAT) solver to reconstruct individual attributes of residents of a hypothetical US Census block, based just on a table of summary statistics. The technique is expanded to attempt to reconstruct a small fraction of Statistics Canada’s Census microdata. This paper will discuss the findings of the investigation, the challenges involved in mounting a reconstruction attack, and the effect of an existing confidentiality measure in mitigating these attacks. Furthermore, the existing strategy is compared to other potential methods used to protect data – in particular, releasing tabular data perturbed by some random mechanism, such as those suggested by differential privacy.
    Release date: 2024-03-25

  • Articles and reports: 11-522-X202200100014
    Description: Ethnic minorities are often underrepresented in survey research, due to the challenges many researchers face in including these populations. While some studies discuss several methods in comparison, few have directly compared these methods empirically, leaving researchers seeking to include ethnic minorities in their studies unsure of their best options. In this article, I briefly review the methodological and ethical reasons for increasing ethnic minority representation in social science research, as well as challenges of doing so. I then present findings from ten studies which empirically compare methods of sampling and/or recruiting ethnic minority individuals. Finally, I discuss some implications for future research.
    Release date: 2024-03-25

  • Articles and reports: 12-001-X202300200005
    Description: Population undercoverage is one of the main hurdles faced by statistical analysis with non-probability survey samples. We discuss two typical scenarios of undercoverage, namely, stochastic undercoverage and deterministic undercoverage. We argue that existing estimation methods under the positivity assumption on the propensity scores (i.e., the participation probabilities) can be directly applied to handle the scenario of stochastic undercoverage. We explore strategies for mitigating biases in estimating the mean of the target population under deterministic undercoverage. In particular, we examine a split population approach based on a convex hull formulation, and construct estimators with reduced biases. A doubly robust estimator can be constructed if a followup subsample of the reference probability survey with measurements on the study variable becomes feasible. Performances of six competing estimators are investigated through a simulation study and issues which require further investigation are briefly discussed.
    Release date: 2024-01-03

  • Articles and reports: 11-633-X2023003
    Description: This paper spans the academic work and estimation strategies used in national statistics offices. It addresses the issue of producing fine, grid-level geography estimates for Canada by exploring the measurement of subprovincial and subterritorial gross domestic product using Yukon as a test case.
    Release date: 2023-12-15

  • Surveys and statistical programs – Documentation: 84-538-X
    Geography: Canada
    Description: This electronic publication presents the methodology underlying the production of the life tables for Canada, provinces and territories.
    Release date: 2023-08-28

  • Articles and reports: 12-001-X202300100001
    Description: Recent work in survey domain estimation allows for estimation of population domain means under a priori assumptions expressed in terms of linear inequality constraints. For example, it might be known that the population means are non-decreasing along ordered domains. Imposing the constraints has been shown to provide estimators with smaller variance and tighter confidence intervals. In this paper we consider a formal test of the null hypothesis that all the constraints are binding, versus the alternative that at least one constraint is non-binding. The test of constant versus increasing domain means is a special case. The power of the test is substantially better than the test with the same null hypothesis and an unconstrained alternative. The new test is used with data from the National Survey of College Graduates, to show that salaries are positively related to the subject’s father’s educational level, across fields of study and over several years of cohorts.
    Release date: 2023-06-30

  • Articles and reports: 12-001-X202300100002
    Description: We consider regression analysis in the context of data integration. To combine partial information from external sources, we employ the idea of model calibration which introduces a “working” reduced model based on the observed covariates. The working reduced model is not necessarily correctly specified but can be a useful device to incorporate the partial information from the external data. The actual implementation is based on a novel application of the information projection and model calibration weighting. The proposed method is particularly attractive for combining information from several sources with different missing patterns. The proposed method is applied to a real data example combining survey data from Korean National Health and Nutrition Examination Survey and big data from National Health Insurance Sharing Service in Korea.
    Release date: 2023-06-30

  • Articles and reports: 11-637-X202200100007
    Description:

    As the seventh goal outlined in the 2030 Agenda for Sustainable Development, Canada and other UN member states have committed to ensure access to affordable, reliable, sustainable and modern energy for all by 2030. This 2022 infographic provides an overview of indicators underlying the seventh Sustainable Development Goal in support of affordable and clean energy, and the statistics and data sources used to monitor and report on this goal in Canada.

    Release date: 2022-12-13
Data (1)

Data (1) ((1 result))

  • Table: 11-10-0074-01
    Geography: Census tract
    Frequency: Occasional
    Description:

    The divergence index (D-index) describes the degree that families with different income levels are mixing together in neighbourhoods. It compares neighbourhood (census tract, CT) discrete income distributions to a base distribution, which is the income quintiles of the neighbourhood’s census metropolitan area (CMA).

    Release date: 2020-06-22
Analysis (180)

Analysis (180) (40 to 50 of 180 results)

  • Stats in brief: 89-20-00062021003
    Description:

    In this video, viewers will learn the differences between three types of measure: proportions, ratios, and rates. In addition, viewers by the end of this video will be able to determine how each measure is calculated and when it is best to use one measure rather than the other.

    Release date: 2021-05-03

  • Stats in brief: 89-20-00062021004
    Description:

    One important distinction we will make in this video is the differences between Data Science, Artificial Intelligence and Machine Learning. You'll learn what machine learning can be used for, how it works, and some different methods for doing it. And you'll also learn how to build and use machine learning processes responsibly.

    This video is recommended for those who already have some familiarity with the concepts and techniques associated with computer programming and using algorithms to analyze data.

    Release date: 2021-05-03

  • Stats in brief: 89-20-00062021005
    Description:

    By the end of this video, you should have a deeper understanding of the fundamentals of using data to tell a story. We will go over some the principle components of storytelling including the data, the narrative and visualization, and discuss how they can be used to construct concise, informative and interesting messages your audience can trust. And then, you will learn the importance of a well planned data story, which includes learning who your audience will be, what they should know and how to best deliver that information.

    Release date: 2021-05-03

  • Stats in brief: 89-20-00062021006
    Description:

    In this video, you'll learn what we can do to data itself, to make it easier to work with. That's the role of data standards. And you'll learn what extra information we can provide to make data easier to use. That's the role of metadata.

    Release date: 2021-05-03

  • Articles and reports: 12-001-X202000200005
    Description:

    In surveys, text answers from open-ended questions are important because they allow respondents to provide more information without constraints. When classifying open-ended questions automatically using supervised learning, often the accuracy is not high enough. Alternatively, a semi-automated classification strategy can be considered: answers in the easy-to-classify group are classified automatically, answers in the hard-to-classify group are classified manually. This paper presents a semi-automated classification method for multi-label open-ended questions where text answers may be associated with multiple classes simultaneously. The proposed method effectively combines multiple probabilistic classifier chains while avoiding prohibitive computational costs. The performance evaluation on three different data sets demonstrates the effectiveness of the proposed method.

    Release date: 2020-12-15

  • Articles and reports: 11-637-X202000100001
    Description: As the first goal outlined in the 2030 Agenda for Sustainable Development, Canada and other UN member states have committed to end poverty in all its forms everywhere by 2030. This 2020 infographic provides an overview of indicators underlying the first Sustainable Development Goal in support of eradicating poverty, and the statistics and data sources used to monitor and report on this goal in Canada.
    Release date: 2020-10-20

  • Articles and reports: 11-637-X202000100002
    Description: As the second goal outlined in the 2030 Agenda for Sustainable Development, Canada and other UN member states have committed to end hunger, achieve food security and improved nutrition, and promote sustainable agriculture by 2030. This 2020 infographic provides an overview of indicators underlying the second Sustainable Development Goal in support of ending hunger, and the statistics and data sources used to monitor and report on this goal in Canada.
    Release date: 2020-10-20

  • Articles and reports: 11-637-X202000100003
    Description: As the third goal outlined in the 2030 Agenda for Sustainable Development, Canada and other UN member states have committed to ensure healthy lives and promote well-being for all at all ages by 2030. This 2020 infographic provides an overview of indicators underlying the third Sustainable Development Goal in support of Good Health and Well-being, and the statistics and data sources used to monitor and report on this goal in Canada.
    Release date: 2020-10-20

  • Articles and reports: 11-637-X202000100004
    Description: As the fourth goal outlined in the 2030 Agenda for Sustainable Development, Canada and other UN member states have committed to ensure inclusive and equitable quality education and promote lifelong learning opportunities for all by 2030. This 2020 infographic provides an overview of indicators underlying the fourth Sustainable Development Goal in support of Quality Education, and the statistics and data sources used to monitor and report on this goal in Canada.
    Release date: 2020-10-20

  • Articles and reports: 11-637-X202000100005
    Description: As the fifth goal outlined in the 2030 Agenda for Sustainable Development, Canada and other UN member states have committed to achieve gender equality and empower all women and girls by 2030. This 2020 infographic provides an overview of indicators underlying the fifth Sustainable Development Goal in support of Gender Equality, and the statistics and data sources used to monitor and report on this goal in Canada.
    Release date: 2020-10-20
Reference (7)

Reference (7) ((7 results))

  • Surveys and statistical programs – Documentation: 84-538-X
    Geography: Canada
    Description: This electronic publication presents the methodology underlying the production of the life tables for Canada, provinces and territories.
    Release date: 2023-08-28

  • Surveys and statistical programs – Documentation: 82-225-X200701010508
    Description:

    The Record Linkage Overview describes the process used in annual internal record linkage of the Canadian Cancer Registry. The steps include: preparation; pre-processing; record linkage; post-processing; analysis and resolution; resolution entry; and, resolution processing.

    Release date: 2008-01-18

  • Surveys and statistical programs – Documentation: 11-522-X20050019476
    Description:

    The paper will show how, using data published by Statistics Canada and available from member libraries of the CREPUQ, a linkage approach using postal codes makes it possible to link the data from the outcomes file to a set of contextual variables. These variables could then contribute to producing, on an exploratory basis, a better index to explain the varied outcomes of students from schools. In terms of the impact, the proposed index could show more effectively the limitations of ranking students and schools when this information is not given sufficient weight.

    Release date: 2007-03-02

  • Surveys and statistical programs – Documentation: 68-514-X
    Description:

    Statistics Canada's approach to gathering and disseminating economic data has developed over several decades into a highly integrated system for collection and estimation that feeds the framework of the Canadian System of National Accounts.

    The key to this approach was creation of the Unified Enterprise Survey, the goal of which was to improve the consistency, coherence, breadth and depth of business survey data.

    The UES did so by bringing many of Statistics Canada's individual annual business surveys under a common framework. This framework included a single survey frame, a sample design framework, conceptual harmonization of survey content, means of using relevant administrative data, common data collection, processing and analysis tools, and a common data warehouse.

    Release date: 2006-11-20

  • Surveys and statistical programs – Documentation: 89-612-X
    Description:

    This paper describes the structure and linkage of two databases: the Longitudinal Administrative Databank (LAD), and the Longitudinal Immigration Database (IMDB). The combined data associate landed immigrant taxfilers on the LAD with their key characteristics upon immigration. The paper highlights how the combined information, referred to here as the LAD_IMDB, enhances and complements the existing separate databases. The paper compares the full IMDB file with the sample of immigrants to assess the representativeness of the sample file.

    Release date: 2004-01-05

  • Surveys and statistical programs – Documentation: 81-595-M2003005
    Geography: Canada
    Description:

    This paper develops technical procedures that may enable ministries of education to link provincial tests with national and international tests in order to compare standards and report results on a common scale.

    Release date: 2003-05-29

  • Surveys and statistical programs – Documentation: 85-602-X
    Description:

    The purpose of this report is to provide an overview of existing methods and techniques making use of personal identifiers to support record linkage. Record linkage can be loosely defined as a methodology for manipulating and / or transforming personal identifiers from individual data records from one or more operational databases and subsequently attempting to match these personal identifiers to create a composite record about an individual. Record linkage is not intended to uniquely identify individuals for operational purposes; however, it does provide probabilistic matches of varying degrees of reliability for use in statistical reporting. Techniques employed in record linkage may also be of use for investigative purposes to help narrow the field of search against existing databases when some form of personal identification information exists.

    Release date: 2000-12-05
Date modified: