Other content related to Statistical methods

Filter results by

Search Help
Currently selected filters that can be removed

Keyword(s)

Geography

2 facets displayed. 0 facets selected.

Content

1 facets displayed. 0 facets selected.
Sort Help
entries

Results

All (163)

All (163) (0 to 10 of 163 results)

  • Articles and reports: 11-522-X202200100002
    Description: The authors used the Splink probabilistic linkage package developed by the UK Ministry of Justice, to link census data from England and Wales to itself to find duplicate census responses. A large gold standard of confirmed census duplicates was available meaning that the results of the Splink implementation could be quality assured. This paper describes the implementation and features of Splink, gives details of the settings and parameters that we used to tune Splink for our particular project, and gives the results that we obtained.
    Release date: 2024-03-25

  • Articles and reports: 11-522-X202200100017
    Description: In this paper, we look for presence of heterogeneity in conducting impact evaluations of the Skills Development intervention delivered under the Labour Market Development Agreements. We use linked longitudinal administrative data covering a sample of Skills Development participants from 2010 to 2017. We apply a causal machine-learning estimator as in Lechner (2019) to estimate the individualized program impacts at the finest aggregation level. These granular impacts reveal the distribution of net impacts facilitating further investigation as to what works for whom. The findings suggest statistically significant improvements in labour market outcomes for participants overall and for subgroups of policy interest.
    Release date: 2024-03-25

  • Articles and reports: 11-522-X202200100020
    Description: The reconciliation of 2021 census dwellings with the new Statistical Building Register (SBgR) presented linkage challenges. The Census of Population collected information from various dwelling types. For a large proportion of the population, mailing addresses were at the centre: they were used for reaching out to people and collected as contact info. In parallel, the register environment has been evolving. The agency is transitioning from the Address Register (AR) to the SBgR holding both mailing and location addresses, while also covering non-residential buildings. The reconciliation was conducted using a combination of systems, notably the new Register Matching Engine (RME) for difficult cases. The RME holds an interesting range of sophisticated string comparators. A deterministic linkage approach was used, while incorporating some data knowledge like the entropy. Through metadata, the matching expert could also reduce the amounts of false positives and false negatives.
    Release date: 2024-03-25

  • Journals and periodicals: 11-522-X
    Description: Since 1984, an annual international symposium on methodological issues has been sponsored by Statistics Canada. Proceedings have been available since 1987.
    Release date: 2024-03-25

  • Journals and periodicals: 12-001-X
    Geography: Canada
    Description: The journal publishes articles dealing with various aspects of statistical development relevant to a statistical agency, such as design issues in the context of practical constraints, use of different data sources and collection techniques, total survey error, survey evaluation, research in survey methodology, time series analysis, seasonal adjustment, demographic studies, data integration, estimation and data analysis methods, and general survey systems development. The emphasis is placed on the development and evaluation of specific methodologies as applied to data collection or the data themselves.
    Release date: 2024-01-03

  • Articles and reports: 82-003-X202301200002
    Description: The validity of survival estimates from cancer registry data depends, in part, on the identification of the deaths of deceased cancer patients. People whose deaths are missed seemingly live on forever and are informally referred to as “immortals”, and their presence in registry data can result in inflated survival estimates. This study assesses the issue of immortals in the Canadian Cancer Registry (CCR) using a recently proposed method that compares the survival of long-term survivors of cancers for which “statistical” cure has been reported with that of similar people from the general population.
    Release date: 2023-12-20

  • Journals and periodicals: 12-206-X
    Description: This report summarizes the annual achievements of the Methodology Research and Development Program (MRDP) sponsored by the Modern Statistical Methods and Data Science Branch at Statistics Canada. This program covers research and development activities in statistical methods with potentially broad application in the agency’s statistical programs; these activities would otherwise be less likely to be carried out during the provision of regular methodology services to those programs. The MRDP also includes activities that provide support in the application of past successful developments in order to promote the use of the results of research and development work. Selected prospective research activities are also presented.
    Release date: 2023-10-11

  • 19-22-0011
    Description: An introduction to the role geography plays in Statistics Canada data. Viewers will learn about the different geographic levels Statistics Canada uses and how they are related, as well as two products - GeoSuite and GeoSearch - that the public can use to find detailed information for any place in Canada. Two case studies will be shown to demonstrate applications of these two products.

    https://www.statcan.gc.ca/en/services/webinars/19220011 
    Release date: 2023-09-12

  • Articles and reports: 75F0002M2022003
    Description: This discussion paper describes the proposed methodology for a Northern Market Basket Measure (MBM-N) for Nunavut, as well as identifies research which could be conducted in preparation for the 2023 review. The paper presents initial MBM-N thresholds and provides preliminary poverty estimates for reference years 2018 to 2021. A review period will follow the release of this paper, during which time Statistics Canada and Employment and Social Development Canada will welcome feedback from interested parties and work with experts, stakeholders, indigenous organizations, federal, provincial and territorial officials to validate the results.
    Release date: 2023-06-21

  • Surveys and statistical programs – Documentation: 75-514-G
    Description: The Guide to the Job Vacancy and Wage Survey contains a dictionary of concepts and definitions, and covers topics such as survey methodology, data collection, processing, and data quality. The guide covers both components of the survey: the job vacancy component, which is quarterly, and the wage component, which is annual.
    Release date: 2023-05-25
Data (1)

Data (1) ((1 result))

  • Table: 82-567-X
    Description:

    The National Population Health Survey (NPHS) is designed to enhance the understanding of the processes affecting health. The survey collects cross-sectional as well as longitudinal data. In 1994/95 the survey interviewed a panel of 17,276 individuals, then returned to interview them a second time in 1996/97. The response rate for these individuals was 96% in 1996/97. Data collection from the panel will continue for up to two decades. For cross-sectional purposes, data were collected for a total of 81,000 household residents in all provinces (except people on Indian reserves or on Canadian Forces bases) in 1996/97.

    This overview illustrates the variety of information available by presenting data on perceived health, chronic conditions, injuries, repetitive strains, depression, smoking, alcohol consumption, physical activity, consultations with medical professionals, use of medications and use of alternative medicine.

    Release date: 1998-07-29
Analysis (103)

Analysis (103) (20 to 30 of 103 results)

  • Articles and reports: 11-522-X202100100002
    Description:

    A framework for the responsible use of machine learning processes has been developed at Statistics Canada. The framework includes guidelines for the responsible use of machine learning and a checklist, which are organized into four themes: respect for people, respect for data, sound methods, and sound application. All four themes work together to ensure the ethical use of both the algorithms and results of machine learning. The framework is anchored in a vision that seeks to create a modern workplace and provide direction and support to those who use machine learning techniques. It applies to all statistical programs and projects conducted by Statistics Canada that use machine learning algorithms. This includes supervised and unsupervised learning algorithms. The framework and associated guidelines will be presented first. The process of reviewing projects that use machine learning, i.e., how the framework is applied to Statistics Canada projects, will then be explained. Finally, future work to improve the framework will be described.

    Keywords: Responsible machine learning, explainability, ethics

    Release date: 2021-10-15

  • Articles and reports: 11-522-X202100100003
    Description:

    The increasing size and richness of digital data allow for modeling more complex relationships and interactions, which is the strongpoint of machine learning. Here we applied gradient boosting to the Dutch system of social statistical datasets to estimate transition probabilities into and out of poverty. Individual estimates are reasonable, but the main advantages of the approach in combination with SHAP and global surrogate models are the simultaneous ranking of hundreds of features by their importance, detailed insight into their relationship with the transition probabilities, and the data-driven identification of subpopulations with relatively high and low transition probabilities. In addition, we decompose the difference in feature importance between general and subpopulation into a frequency and a feature effect. We caution for misinterpretation and discuss future directions.

    Key Words: Classification; Explainability; Gradient boosting; Life event; Risk factors; SHAP decomposition.

    Release date: 2021-10-15

  • Articles and reports: 11-522-X202100100019
    Description: Official statistical agencies must continually seek new methods and techniques that can increase both program efficiency and product relevance. The U.S. Census Bureau’s measurement of construction activity is currently a resource-intensive endeavor, relying heavily on monthly survey response via questionnaires and extensive field data collection. While our data users continually require more timely and granular data products, the traditional survey approach and associated collection cost and respondent burden limits our ability to meet that need. In 2019, we began research on whether the application of machine learning techniques to satellite imagery could accurately estimate housing starts and completions while meeting existing monthly indicator timelines at a cost equal to or less than existing methods. Using historical Census construction survey data in combination with targeted satellite imagery, the team trained, tested, and validated convolutional neural networks capable of classifying images by their stage of construction demonstrating the viability of a data science-based approach to producing official measures of construction activity.

    Key Words: Official Statistics; Housing Starts, Machine Learning, Satellite Imagery

    Release date: 2021-10-15

  • Stats in brief: 89-20-00062020002
    Description:

    This video is intended to teach viewers the differences between three fundamental statistical concepts. First, the mean, then the median and finally, the mode.

    Release date: 2021-05-03

  • Stats in brief: 89-20-00062020003
    Description:

    In this module, we will explore the concept of dispersion, also called variability. This concept includes: the range, the interquartile range, the standard deviation and the normal distribution.

    Release date: 2021-05-03

  • Stats in brief: 11-001-X202104628783
    Description: Release published in The Daily – Statistics Canada’s official release bulletin
    Release date: 2021-02-15

  • Articles and reports: 18-001-X2020001
    Description:

    This paper presents the methodology used to generate the first nationwide database of proximity measures and the results obtained with a first set of ten measures. The computational methods are presented as a generalizable model due to the fact that it is now possible to apply similar methods to a multitude of other services or amenities, in a variety of alternative specifications.

    Release date: 2021-02-15

  • Stats in brief: 11-627-M2020072
    Description:

    This infographic provides an overview of the Canadian Research and Development Classification (CRDC), a national standard jointly developed by the Canada Foundation for Innovation (CFI), the Canadian Institutes of Health Research (CIHR), the Natural Sciences and Engineering Research Council of Canada (NSERC), the Social Sciences and Humanities Research Council of Canada (SSHRC), and Statistics Canada.

    Release date: 2020-10-05

  • Stats in brief: 89-20-00062020006
    Description:

    The data terminology and concepts covered in this video are datasets, databases, data protection, data variables, micro and macro data, and statistical information.

    Release date: 2020-09-23

  • Stats in brief: 89-20-00062020007
    Description:

    In this video you will learn about the steps and activities in the data journey, as well as the foundation supporting it.

    Release date: 2020-09-23
Reference (54)

Reference (54) (30 to 40 of 54 results)

  • Surveys and statistical programs – Documentation: 62F0026M2009002
    Geography: Province or territory
    Description:

    This guide presents information of interest to users of data from the Survey of Household Spending, which gathers information on the spending habits, dwelling characteristics and household equipment of Canadian households. The survey covers private households in the 10 provinces. (The territories are surveyed every second year, starting in 1999.)

    This guide includes definitions of survey terms and variables, as well as descriptions of survey methodology and data quality. One section describes the various statistics that can be created using expenditure data (e.g., budget share, market share, aggregates and medians).

    Release date: 2009-12-18

  • Surveys and statistical programs – Documentation: 16-001-M2009007
    Description:

    In this paper, we present the methodology developed by Statistics Canada to calculate the average annual water yield for Canada. Water yield, for the purposes of this paper, is defined as the amount of freshwater derived from unregulated flow (m3 s-1) measurements for a given geographic area over a defined period of time. The methodology is applied to the 1971 to 2000 time period.

    This research was conducted to fill data gaps in Statistics Canada's water statistics program. These gaps exist because estimates of freshwater flow for Canada have not been calculated regularly and have been produced using a variety of methods that do not necessarily generate comparable results. The methodology developed in this study produced results that are coherent through space and time. These results will be used in the future to investigate changes in water yield on a more disaggregated basis.

    To achieve the water yield estimate a database of natural streamflow observations from 1971 to 2000 was compiled. The streamflow values were then converted to a runoff depth and interpolated using ordinary kriging to produce spatial estimates of runoff. The spatial estimates were then scaled to create a National estimate of water yield as a thirty-year average. The methodology and results were then validated using a stability analysis and several techniques involving uncertainty. The result of the methodology indicates that the thirty-year average water yield for Canada is 3435 km3.

    Release date: 2009-06-01

  • Surveys and statistical programs – Documentation: 92-569-X2006001
    Description:

    The 2006 Census Technical Report on Aboriginal Peoples deals with: (i) Aboriginal ancestry, (ii) Aboriginal identity, (iii) registered Indian status, and (iv) First Nation or Band membership. The report aims to inform users about the complexity of the data and any difficulties that could affect their use. It explains the conceptual framework and definitions used to gather the data, and it discusses factors that could affect data quality. The historical comparability of the data is also discussed.

    Release date: 2009-05-12

  • Surveys and statistical programs – Documentation: 91F0015M2008010
    Geography: Canada
    Description:

    The objective of this study is to examine the feasibility of using provincial and territorial health care files of new registrants as an independent measure of preliminary inter-provincial and inter-territorial migration. The study aims at measuring the conceptual and quantifiable differences between this data source and our present source of the Canada Revenue Agency's Canadian Child Tax Benefit.

    Criteria were established to assess the quality and appropriateness of these provincial/territorial health care records as a proxy for our migration estimates: coverage, consistency, timeliness, reliability, level of detail, uniformity and accuracy.

    Based on the present analysis, the paper finds that these data do not ameliorate the estimates and would not be suitable at this time as a measure of inter-provincial/territorial migration. These Medicare data though are an important independent data source that can be used for quality evaluation.

    Release date: 2009-01-13

  • Surveys and statistical programs – Documentation: 16-001-M2007003
    Description:

    The objective of the present study is to understand and explain how the Canadian Council of Ministers of the Environment (CCME) Water Quality Index (WQI) behaves, and at the same time determine its limitations to make a better use of it in the future. In order to do so, four data sets were made available to us thanks to participation of the following provinces: Newfoundland, Ontario, British Columbia and Quebec.

    Release date: 2007-09-19

  • Surveys and statistical programs – Documentation: 12-592-X
    Geography: Canada
    Description:

    This reference document presents an overview of the different questions used by Statistics Canada to identify Aboriginal peoples. It is divided into three parts. Part one is a brief description of the data sources and their limitations. Part 2 deals with the 2006 census questions used to identify Aboriginal peoples while Part 3 deals with the identification questions used in the Aboriginal Peoples Survey (APS) and the Aboriginal Children's Survey (ACS).

    Release date: 2007-06-07

  • Surveys and statistical programs – Documentation: 62F0026M2006001
    Geography: Province or territory
    Description:

    This guide presents information of interest to users of data from the Survey of Household Spending, which gathers information on the spending habits, dwelling characteristics and household equipment of Canadian households. The survey covers private households in the 10 provinces. (The territories are surveyed every second year, starting in 1999.)

    This guide includes definitions of survey terms and variables, as well as descriptions of survey methodology and data quality. One section describes the various statistics that can be created using expenditure data (e.g., budget share, market share, aggregates and medians).

    Release date: 2006-12-12

  • Notices and consultations: 87-004-X20030039213
    Description:

    The Culture Statistics Program (CSP) has been Statistic Canada's chief source for analysis of the culture sector since the program's inception in 1972 and this role will continue. However, the CSP is making substantial changes to the way it collects culture data and, in effect, the data themselves. This article is intended to inform users of these data, of the scope of these upcoming changes and how the CSP is managing the challenges presented by this transition.

    Release date: 2006-06-12

  • Surveys and statistical programs – Documentation: 62F0026M2005007
    Geography: Province or territory
    Description:

    This guide presents information of interest to users of data from the Survey of Household Spending, which gathers information on the spending habits, dwelling characteristics and household equipment of Canadian households. The survey covers private households in the 10 provinces. (The territories are surveyed every second year, starting in 1999.)

    This guide includes definitions of survey terms and variables, as well as descriptions of survey methodology and data quality. One section describes the various statistics that can be created using expenditure data (e.g., budget share, market share, aggregates and medians).

    Release date: 2005-12-12

  • Surveys and statistical programs – Documentation: 62F0026M2005004
    Description:

    The Food Expenditure Survey (FES) is a periodic survey collecting data from households on food spending habits. Data are collected mainly using weekly diaries of purchases that the respondents must fill in daily during two consecutive weeks.

    This paper presents a detailed description of the methodology of this survey. First, we briefly described the sample design which is mainly based on the plan of the Labour Force Survey. Then we present the methods of collection, data processing, weighting, and variance estimation, as well as the suppression of unreliable data in the tables of estimates.

    Release date: 2005-07-08
Date modified: