Filter results by

Search Help
Currently selected filters that can be removed

Keyword(s)

Year of publication

1 facets displayed. 1 facets selected.

Survey or statistical program

122 facets displayed. 0 facets selected.

Content

1 facets displayed. 0 facets selected.
Sort Help
entries

Results

All (401)

All (401) (50 to 60 of 401 results)

  • Articles and reports: 62F0014M2021016
    Description:

    Using various sources of expenditure data, Statistics Canada, in partnership with the Bank of Canada, has estimated monthly adjusted consumer expenditure weights that reflect shifts in consumption patterns as the COVID-19 pandemic evolves. The Adjusted price index has been updated to incorporate the 2020 basket weights and is now based on a Similarity-linked Fisher price index formula. The expenditure data cover all goods and services in the Consumer Price Index (CPI), and provide snapshot estimates of expenditure weights for June, July, August and September 2021. These estimates can provide insight into the impact of COVID-19 on the headline CPI.

    Release date: 2021-11-10

  • Articles and reports: 11F0019M2021008
    Description:

    A robust Internet typology is essential for monitoring how individuals are responding to the digital transformation and for assessing the divide between digital ‘haves’ and ‘have nots.’ Individuals’ capacities to use the Internet and digital technologies are an important aspect of this digital divide. Using data from the 2018 Canadian Internet Use Survey (CIUS), this study presents an Internet-use typology that is based on the range and complexity of online activities and digital skills that Canadians report they perform. Five Internet-user groups are identified.

    Release date: 2021-11-09

  • Articles and reports: 81-595-M2021005
    Description:

    Using data from Statistics Canada's Education and Labour Market Longitudinal Platform, this study examines populations designated as visible minorities in the skilled trades. The labour market outcomes one year after certification of journeypersons designated as visible minorities, who received certificates in the skilled trades between 2008 and 2017, were compared with the outcomes of journeypersons who are not visible minorities.

    Release date: 2021-11-08

  • Articles and reports: 81-595-M2021006
    Description:

    The COVID-19 pandemic had large impacts on those in the skilled trades, as these jobs often require hands-on and close-proximity interactions. Using data from the Education and Labour Market Longitudinal Platform, this study examines the pandemic's impacts by investigating the proportion of journeypersons who received the CERB among those who certified between 2008 and 2019. By examining the proportions across trades, geography and population groups, this study can provide further insight into how the pandemic affected those in the skilled trades and how the impacts were different across trades and groups.

    Release date: 2021-11-08

  • Articles and reports: 11-522-X202100100010
    Description:

    As part of processing for the 2021 Canadian Census, the write-in responses to 31 census questions must be coded. Up until, and including, 2016, this was a three stage process, including an “interactive (human) coding” step as the second stage. This human coding step is both lengthy and expensive, spanning many months and requiring the hiring and training of a large number of temporary employees. With this in mind, for 2021, this stage was either augmented with or replaced entirely by machine learning models using the "fastText" algorithm. This presentation will discuss the implementation of this algorithm and the challenges and decisions taken along the way.

    Key Words: Natural Language Processing, Machine Learning, fastText, Coding

    Release date: 2021-11-05

  • Articles and reports: 11-522-X202100100011
    Description: The ways in which AI may affect the world of official statistics are manifold and Statistics Netherlands (CBS) is actively exploring how it can use AI within its societal role. The paper describes a number of AI-related areas where CBS is currently active: use of AI for its own statistics production and statistical R&D, the development of a national AI monitor, the support of other government bodies with expertise on fair data and fair algorithms, data sharing under safe and secure conditions, and engaging in AI-related collaborations.

    Key Words: Artificial Intelligence; Official Statistics; Data Sharing; Fair Algorithms; AI monitoring; Collaboration.

    Release date: 2021-11-05

  • Articles and reports: 11-522-X202100100012
    Description: The modernization of price statistics by National Statistical Offices (NSO) such as Statistics Canada focuses on the adoption of alternative data sources that include the near-universe of all products sold in the country, a scale that requires machine learning classification of the data. The process of evaluating classifiers to select appropriate ones for production, as well as monitoring classifiers once in production, needs to be based on robust metrics to measure misclassification. As commonly utilized metrics, such as the Fß-score may not take into account key aspects applicable to prices statistics in all cases, such as unequal importance of categories, a careful consideration of the metric space is necessary to select appropriate methods to evaluate classifiers. This working paper provides insight on the metric space applicable to price statistics and proposes an operational framework to evaluate and monitor classifiers, focusing specifically on the needs of the Canadian Consumer Prices Index and demonstrating discussed metrics using a publicly available dataset.

    Key Words: Consumer price index; supervised classification; evaluation metrics; taxonomy

    Release date: 2021-11-05

  • Articles and reports: 11-522-X202100100013
    Description: Statistics Canada’s Labour Force Survey (LFS) plays a fundamental role in the mandate of Statistics Canada. The labour market information provided by the LFS is among the most timely and important measures of the Canadian economy’s overall performance. An integral part of the LFS monthly data processing is the coding of respondent’s industry according to the North American Industrial Classification System (NAICS), occupation according to the National Occupational Classification System (NOC) and the Primary Class of Workers (PCOW). Each month, up to 20,000 records are coded manually. In 2020, Statistics Canada worked on developing Machine Learning models using fastText to code responses to the LFS questionnaire according to the three classifications mentioned previously. This article will provide an overview on the methodology developed and results obtained from a potential application of the use of fastText into the LFS coding process. 

    Key Words: Machine Learning; Labour Force Survey; Text classification; fastText.

    Release date: 2021-11-05

  • Articles and reports: 11-522-X202100100029
    Description:

    In line with the path taken by the European Statistical System, Istat is investing on innovative methods to harness Big Data sources and to use them for the production of new and enriched Official Statistics products. Big Data sources are not, in general, directly tractable with traditional statistical techniques, just think of specific data types such as images and texts that are examples of the Variety dimension of Big Data. This motivates and justifies the growing interest of National Statistical Institutes in data science techniques. Istat is currently using data science techniques, including machine learning techniques, in innovation projects and for the publication of experimental statistics. This paper will provide an overview of the main current projects by Istat and will focus on two specific Big Data-based production pipelines, related to the processing of respectively text sources and imagery sources. The paper will highlight the main challenges these two pipelines and the solutions put in place to solve them.

    Key Words: Machine Learning; Text Processing; Image Processing; Big Data

    Release date: 2021-11-05

  • Stats in brief: 11-627-M2021081
    Description: The data on natural wealth presented reflects the proven & probable reserves for select minerals and established active reserves for energy (2020 preliminary).
    Release date: 2021-11-01
Stats in brief (145)

Stats in brief (145) (0 to 10 of 145 results)

  • Stats in brief: 45-20-00032021004
    Description:

    The age old practice of working with one’s hands in skill and time honoured endeavors has seen a resurgence during the pandemic as people seek out ways of keeping busy, exploring a fleeting interest or honing their skills on a professional level. But there’s much more to it, according to Meagen Black, director of the Canadian Crafts Federation. We discuss the arts and crafts movement across Canada, its renaissance and its necessity.

    Release date: 2021-12-22

  • Stats in brief: 45-28-0001202100100044
    Description:

    While all businesses in Canada have been impacted by the COVID-19 pandemic, different types of businesses have been impacted in different ways. This is also true for different communities that operate businesses in Canada. This article explores results from the Canadian Survey on Business Conditions by looking at the businesses majority-owned by women, First Nations, Métis or Inuit persons, immigrants to Canada and visible minorities in the fourth quarter of 2021.

    Release date: 2021-12-09

  • Stats in brief: 11-627-M2021092
    Description:

    This infographic provides a high-level overview of Statistics Canada’s Disaggregated Data Action Plan, which will produce detailed statistical information on specific population groups. This plan is essential to highlight the lived experiences of diverse groups of people in Canada, such as women, Indigenous peoples, racialized populations and people living with disabilities.

    Release date: 2021-12-08

  • Stats in brief: 45-20-00032021003
    Description:

    The pandemic presented a long list of threats to the mental and physical well-being of children, parents and educators across the nation. To move towards the end of the pandemic and a return to normalcy, difficult choices such as closing schools had to be made by policy-makers. With our guest Dr. Kelley Zwicker, a pediatric doctor at the Children’s Hospital of Eastern Ontario (CHEO), we discussed the potential short- and long-term effects of the school closures on students and their parents.

    Release date: 2021-12-07

  • Stats in brief: 45-28-0001202100100040
    Description:

    This article analyzes the Canada Emergency Wage Subsidy Regional and Community-level Database from a rural business perspective. This database covers the period from October 25, 2020 to January 16, 2021. It is based on Canada Revenue Agency (CRA) Canada Emergency Wage Subsidy (CEWS) microdata and administrative data sources available within Statistics Canada. Topics include number of CEWS supported employees and subsidy amounts in rural areas, comparison of rural and urban businesses, and analysis by industry and province/territory.

    Release date: 2021-12-06

  • Stats in brief: 45-28-0001202100100042
    Description:

    Non-profit organizations represent a significant portion of the Canadian economy. This article provides insights on the expectations of non-profit organizations as well as the specific realities faced by these organizations. It involves an examination of the data produced by the Canadian Survey on Business Conditions.

    Release date: 2021-12-06

  • Stats in brief: 11-627-M2021072
    Description:

    This infographic features indicators on apprenticeship programs across Canada. It presents the year-over-year changes in new registrations and certifications amongst trade groups and jurisdictions in 2020. This infographic will also highlight some of the impacts of COVID-19 on apprenticeship programs.

    Release date: 2021-12-06

  • Stats in brief: 11-627-M2021091
    Description:

    Beneficiaries and value of support of federal business innovation and growth support programs, by employment size, revenue size, industry and gender of primary owner, 2019.

    Release date: 2021-12-02

  • Stats in brief: 45-28-0001202100100041
    Description:

    This article uses data from the first series of the Canadian Social Survey - COVID-19 and Well-being (CSS-CW) to examine whether persons aged 15 to 49 made changes to their fertility plans because of the COVID-19 pandemic. Changes to fertility intentions are explored, including those related to the timing of childbearing and those impacting the number of desired children. Lastly, we examine to what extent persons having certain sociodemographic characteristics were more or less likely to adjust their fertility plans in response to the pandemic.

    Release date: 2021-12-01

  • Stats in brief: 11-627-M2021083
    Description:

    Using data from the 2017 Canadian Survey on Disability (CSD) and the 2020 Canada Emergency Response Benefit (CERB) program, this infographic provides a profile of Canadian workers with disabilities who received CERB payments during the period from March 15 to September 26, 2020. The focus of the analysis is on workers who had employment or self-employment income of at least $5,000 in 2019.

    Release date: 2021-12-01
Articles and reports (254)

Articles and reports (254) (40 to 50 of 254 results)

  • Articles and reports: 11-522-X202100100009
    Description:

    Use of auxiliary data to improve the efficiency of estimators of totals and means through model-assisted survey regression estimation has received considerable attention in recent years. Generalized regression (GREG) estimators, based on a working linear regression model, are currently used in establishment surveys at Statistics Canada and several other statistical agencies.  GREG estimators use common survey weights for all study variables and calibrate to known population totals of auxiliary variables. Increasingly, many auxiliary variables are available, some of which may be extraneous. This leads to unstable GREG weights when all the available auxiliary variables, including interactions among categorical variables, are used in the working linear regression model. On the other hand, new machine learning methods, such as regression trees and lasso, automatically select significant auxiliary variables and lead to stable nonnegative weights and possible efficiency gains over GREG.  In this paper, a simulation study, based on a real business survey sample data set treated as the target population, is conducted to study the relative performance of GREG, regression trees and lasso in terms of efficiency of the estimators.

    Key Words: Model assisted inference; calibration estimation; model selection; generalized regression estimator.

    Release date: 2021-10-29

  • Articles and reports: 11-522-X202100100018
    Description: Statistics Finland started publishing nowcasts of the trend indicator of output (TIO), the monthly indicator of real economic activity, to answer users´ needs during the Covid-19 pandemic. The indicator was first published in April 2020, at the very beginning of the pandemic in Finland, and had a monthly release schedule until June 2021. The TIO nowcasts are produced using open-source data on truck traffic volumes at about 100 automatic measuring points in the Helsinki/Uusimaa -region and the Economic Sentiment Indicator for Finland. Estimation is done using a machine learning approach and the methodology is based on previous work done by Statistics Finland and ETLA Economic Research.

    Key Words: nowcasting; flash estimates; machine learning; experimental statistics.

    Release date: 2021-10-29

  • Articles and reports: 11-522-X202100100024
    Description: The Economic Directorate of the U.S. Census Bureau is developing coordinated design and sample selection procedures for the Annual Integrated Economic Survey. The unified sample will replace the directorate’s existing practice of independently developing sampling frames and sampling procedures for a suite of separate annual surveys, which optimizes sample design features at the cost of increased response burden. Size attributes of business populations, e.g., revenues and employment, are highly skewed. A high percentage of companies operate in more than one industry. Therefore, many companies are sampled into multiple surveys compounding the response burden, especially for “medium sized” companies.

    This component of response burden is reduced by selecting a single coordinated sample but will not be completely alleviated. Response burden is a function of several factors, including (1) questionnaire length and complexity, (2) accessibility of data, (3) expected number of repeated measures, and (4) frequency of collection. The sample design can have profound effects on the third and fourth factors. To help inform decisions about the integrated sample design, we use regression trees to identify covariates from the sampling frame that are related to response burden. Using historic frame and response data from four independently sampled surveys, we test a variety of algorithms, then grow regression trees that explain relationships between expected levels of response burden (as measured by response rate) and frame covariates common to more than one survey. We validate initial findings by cross-validation, examining results over time. Finally, we make recommendations on how to incorporate our robust findings into the coordinated sample design.
    Release date: 2021-10-29

  • Articles and reports: 11-522-X202100100025
    Description:

    We propose a longitudinal analysis with a point of view connected to the organizational changes that have taken place in the Italian National Institute of Statistics in recent years. In 2016 the Institute introduced a new Directorate, intending to standardize and generalize the business process of Data Collection according to the European standard of the GAMSO model. The paper discusses the pros and cons of this change from the perspective of the survey's participation. The ICT survey response rate analysis demonstrates an increase of around 20% since the beginning of the new organization: the paper tries to focus on the impact of the changes introduced with the new organization. We focused our attention on two specific subsets of respondents - the so-called "wanted" - the ones who have never answered to an ICT survey or to any other Istat survey and - the so-called “lost” - the ones included in two consecutive survey’s samples and that answered in the previous edition but not in the current one. The paper aims to illustrate how an efficient organization of data collection reflects its benefits on survey results and what kind of actions should be taken to catch the attention of the "wanted". Finally, we apply a logistic model measuring the probability that an enterprise responding in 2018 (t-1) also answered in 2019 (t). All the analysis suggests some actions that could be taken to improve respondents' participation, data quality, and respondents' perception of the official statistics.

    Key Words: data collection strategy, response rate, paradata, response burden, ICT Survey.

    Release date: 2021-10-29

  • Articles and reports: 11-522-X202100100026
    Description:

    The Government of Canada’s Directive on Open Government aims to ensure that Canadians have greater access to government data and information. One solution for open data is smart synthetic files, which retain as much analytical value as possible and take into account confidentiality issues that arise from collecting personal information. In recent years, Statistics Canada has acquired a recognized expertise in producing synthetic data files of high analytical value. In a current project, Statistics Canada is tackling a new challenge to synthesize a database and preserve hierarchical structures in the form of families, where records are linked and share common traits that must be maintained. These challenges are also encountered when synthesizing structured data such as business data. This paper presents the challenges and solutions for building synthetic data with such hierarchical structures. Application of this strategy will be illustrated with the development of a synthetic database that supports the development of retirement income policies. This database includes over 20 variables and 8 million records structured into approximately 4 million family units. We will present how family structures have been preserved, discuss the practical and technical challenges inherent in developing such a large and complex database, present the risk and utility of the data, and propose avenues for future research.

    Keywords: synthetic data of high analytical value; family structures; modern data access solution.

    Release date: 2021-10-29

  • Articles and reports: 11-522-X202100100027
    Description:

    Privacy concerns are a barrier to applying remote analytics, including machine learning, on sensitive data via the cloud. In this work, we use a leveled fully Homomorphic Encryption scheme to train an end-to-end supervised machine learning algorithm to classify texts while protecting the privacy of the input data points. We train our single-layer neural network on a large simulated dataset, providing a practical solution to a real-world multi-class text classification task. To improve both accuracy and training time, we train an ensemble of such classifiers in parallel using ciphertext packing.

    Key Words: Privacy Preservation, Machine Learning, Encryption

    Release date: 2021-10-29

  • Articles and reports: 11-522-X202100100028
    Description:

    Many Government of Canada groups are developing codes to process and visualize various kinds data, often duplicating each other’s efforts, with sub-optimal efficiency and limited level of code quality reviewing. This paper informally presents a working-level approach to addressing this technical problem. The idea is to collaboratively build a common repository of code and knowledgebase for use by anyone in the public sector to perform many common data science tasks, and, in doing that, help each other to master both the data science coding skills and the industry standard collaborative practices. The paper explains why R language is used as the language of choice for collaborative data science code development. It summaries R advantages and addresses its limitations, establishes the taxonomy of discussion topics of highest interested to the GC data scientists working with R, provides an overview of used collaborative platforms, and presents the results obtained to date. Even though the code knowledgebase is developed mainly in R, it is meant to be valuable also for data scientists coding in Python and other development environments. Key Words: Collaboration; Data science; Data Engineering; R; Open Government; Open Data; Open Science

    Release date: 2021-10-29

  • Articles and reports: 75-006-X202100100010
    Description:

    Canada's religious landscape has undergone significant changes in recent decades, including a decline in religious affiliation and participation in individual and group religious activities. This study uses data from the General Social Survey to provide a portrait of the diverse relationships that Canadians have with religion. The study also presents key trends that characterize the evolution of religiosity in Canada since 1985.

    Release date: 2021-10-28

  • Articles and reports: 36-28-0001202101000001
    Description:

    Since the beginning of the COVID-19 pandemic, Statistics Canada has produced several studies on work from home. This article synthesizes the key findings of these studies and identifies questions for future research.

    Release date: 2021-10-27

  • Articles and reports: 36-28-0001202101000002
    Description:

    This article provides an integrated analysis of recent changes in output, consumer spending, business investment, international trade and employment. It also draws on new data sources that provide detailed information on the financial conditions facing businesses and households. The analysis is based on data that are publicly available as of October 8, 2021.

    Release date: 2021-10-27
Journals and periodicals (2)

Journals and periodicals (2) ((2 results))

  • Journals and periodicals: 82-625-X
    Geography: Canada
    Description:

    Health fact sheets will include short, focused, single-theme analysis documents. Over the course of the series, analysis will include topics on: Health conditions, lifestyle, well-being, disability, prevention and detection of disease, deaths, pregnancy and birth, health care services and environmental factors.

    Release date: 2021-10-27

  • Journals and periodicals: 91-209-X
    Geography: Canada
    Description:

    The Report on the Demographic Situation in Canada analyses recent demographic patterns at the national, provincial and subprovincial levels. Trends in population growth and the evolution of the various components of Canada's population growth - fertility, mortality and migration (interprovincial and international) - as well as marital status, are examined. The Report on the Demographic Situation in Canada has been published annually or biennially since 1985. Beginning in 2011, the Report is available as a dynamic, internet-only publication in order to provide the most recent data and analyses on Canadian demographics as soon as they are available.

    Release date: 2021-07-14
Date modified: