Data analysis
Filter results by
Search HelpKeyword(s)
Type
Survey or statistical program
- Census of Population (13)
- Canadian Community Health Survey - Annual Component (7)
- Labour Force Survey (7)
- Survey of Household Spending (6)
- Canadian Income Survey (4)
- Survey of Labour and Income Dynamics (3)
- Longitudinal Immigration Database (3)
- Canadian Health Measures Survey (3)
- Gross Domestic Product by Industry - National (Monthly) (2)
- Monthly Oil and Other Liquid Petroleum Products Pipeline Survey (2)
- Uniform Crime Reporting Survey (2)
- Census of Agriculture (2)
- Households and the Environment Survey (2)
- Time Use Survey (2)
- Biennial Drinking Water Plants Survey (2)
- Longitudinal Employment Analysis Program (2)
- Canada's International Transactions in Services (1)
- Waste Management Industry Survey: Government Sector (1)
- National Balance Sheet Accounts (1)
- National Gross Domestic Product by Income and by Expenditure Accounts (1)
- National Tourism Indicators (1)
- Biennial Waste Management Survey (1)
- Monthly Electricity Supply and Disposition Survey (1)
- Annual Electricity Supply and Disposition Survey (1)
- Consumer Price Index (1)
- Monthly New Motor Vehicle Sales Survey (1)
- Survey of Employment, Payrolls and Hours (1)
- Survey of Financial Security (1)
- Monthly Passenger Bus and Urban Transit Survey (1)
- Stock and Consumption of Fixed Non-residential Capital (1)
- Tuition and Living Accommodation Costs (1)
- Vital Statistics - Death Database (1)
- Annual Demographic Estimates: Canada, Provinces and Territories (1)
- Homeowner Repair and Renovation Survey (1)
- Annual Income Estimates for Census Families and Individuals (T1 Family File) (1)
- Annual Survey of Research and Development in Canadian Industry (1)
- Research and Development of Canadian Private Non-Profit Organizations (1)
- General Social Survey - Victimization (1)
- Postsecondary Student Information System (1)
- General Social Survey - Social Identity (1)
- Culture Services Trade (1)
- Canadian Community Health Survey - Nutrition (1)
- Canadian System of Environmental-Economic Accounts - Physical Flow Accounts (1)
- Air Quality Indicators (1)
- Freshwater Quality Indicator (1)
- Longitudinal and International Study of Adults (1)
- Government Finance Statistics (1)
- National Household Survey (1)
- Gross Domestic Expenditures on Research and Development (1)
- Survey of Safety in Public and Private Spaces (1)
- Canadian Housing Statistics Program (1)
- Study on International Money Transfers (1)
- Canadian Housing Survey (1)
- Survey on Early Learning and Child Care Arrangements (SELCCA) (1)
- Canadian Perspectives Survey Series (CPSS) (1)
- Canada Mortgage and Housing Corporation (1)
Results
All (289)
All (289) (40 to 50 of 289 results)
- Stats in brief: 89-20-00082021002Description: This video is part of the confidentiality vetting support series and presents examples of how to use SAS to create proportion output for researchers working with confidential data.Release date: 2022-04-27
- Stats in brief: 89-20-00082021003Description: This video is part of the confidentiality vetting support series and presents examples of how to use Stata to create proportion output for researchers working with confidential data.Release date: 2022-04-27
- 43. Confidentiality Vetting Support: Dominance and homogeneity using the tcensus function (Stata) ArchivedStats in brief: 89-20-00082021004Description: This video is part of the confidentiality vetting support series and presents examples of how to use Stata to perform the dominance and homogeneity test while using the Census.Release date: 2022-04-27
- Stats in brief: 89-20-00082021005Description: This video is part of the confidentiality vetting support series and presents examples of how to use R to create proportion output for researchers working with confidential data.Release date: 2022-04-27
- Stats in brief: 89-20-00082021006Description: This video is part of the confidentiality vetting support series and presents examples of how to use R to perform the dominance and homogeneity test while using the Census.Release date: 2022-04-27
- Stats in brief: 11-627-M2022016Description:
This infographic explains the steps involved in collecting data for all Statistics Canada household and business surveys. The responses are compiled, analyzed and used to make important decisions and are kept strictly confidential.
Release date: 2022-02-28 - Articles and reports: 11-522-X202100100029Description:
In line with the path taken by the European Statistical System, Istat is investing on innovative methods to harness Big Data sources and to use them for the production of new and enriched Official Statistics products. Big Data sources are not, in general, directly tractable with traditional statistical techniques, just think of specific data types such as images and texts that are examples of the Variety dimension of Big Data. This motivates and justifies the growing interest of National Statistical Institutes in data science techniques. Istat is currently using data science techniques, including machine learning techniques, in innovation projects and for the publication of experimental statistics. This paper will provide an overview of the main current projects by Istat and will focus on two specific Big Data-based production pipelines, related to the processing of respectively text sources and imagery sources. The paper will highlight the main challenges these two pipelines and the solutions put in place to solve them.
Key Words: Machine Learning; Text Processing; Image Processing; Big Data
Release date: 2021-11-05 - Articles and reports: 11-522-X202100100008Description:
Non-probability samples are being increasingly explored by National Statistical Offices as a complement to probability samples. We consider the scenario where the variable of interest and auxiliary variables are observed in both a probability and non-probability sample. Our objective is to use data from the non-probability sample to improve the efficiency of survey-weighted estimates obtained from the probability sample. Recently, Sakshaug, Wisniowski, Ruiz and Blom (2019) and Wisniowski, Sakshaug, Ruiz and Blom (2020) proposed a Bayesian approach to integrating data from both samples for the estimation of model parameters. In their approach, non-probability sample data are used to determine the prior distribution of model parameters, and the posterior distribution is obtained under the assumption that the probability sampling design is ignorable (or not informative). We extend this Bayesian approach to the prediction of finite population parameters under non-ignorable (or informative) sampling by conditioning on appropriate survey-weighted statistics. We illustrate the properties of our predictor through a simulation study.
Key Words: Bayesian prediction; Gibbs sampling; Non-ignorable sampling; Statistical data integration.
Release date: 2021-10-29 - Articles and reports: 11-522-X202100100009Description:
Use of auxiliary data to improve the efficiency of estimators of totals and means through model-assisted survey regression estimation has received considerable attention in recent years. Generalized regression (GREG) estimators, based on a working linear regression model, are currently used in establishment surveys at Statistics Canada and several other statistical agencies. GREG estimators use common survey weights for all study variables and calibrate to known population totals of auxiliary variables. Increasingly, many auxiliary variables are available, some of which may be extraneous. This leads to unstable GREG weights when all the available auxiliary variables, including interactions among categorical variables, are used in the working linear regression model. On the other hand, new machine learning methods, such as regression trees and lasso, automatically select significant auxiliary variables and lead to stable nonnegative weights and possible efficiency gains over GREG. In this paper, a simulation study, based on a real business survey sample data set treated as the target population, is conducted to study the relative performance of GREG, regression trees and lasso in terms of efficiency of the estimators.
Key Words: Model assisted inference; calibration estimation; model selection; generalized regression estimator.
Release date: 2021-10-29 - Articles and reports: 11-522-X202100100018Description: Statistics Finland started publishing nowcasts of the trend indicator of output (TIO), the monthly indicator of real economic activity, to answer users´ needs during the Covid-19 pandemic. The indicator was first published in April 2020, at the very beginning of the pandemic in Finland, and had a monthly release schedule until June 2021. The TIO nowcasts are produced using open-source data on truck traffic volumes at about 100 automatic measuring points in the Helsinki/Uusimaa -region and the Economic Sentiment Indicator for Finland. Estimation is done using a machine learning approach and the methodology is based on previous work done by Statistics Finland and ETLA Economic Research.
Key Words: nowcasting; flash estimates; machine learning; experimental statistics.
Release date: 2021-10-29
- Previous Go to previous page of All results
- 1 Go to page 1 of All results
- 2 Go to page 2 of All results
- 3 Go to page 3 of All results
- 4 Go to page 4 of All results
- 5 (current) Go to page 5 of All results
- 6 Go to page 6 of All results
- 7 Go to page 7 of All results
- ...
- 29 Go to page 29 of All results
- Next Go to next page of All results
Data (2)
Data (2) ((2 results))
- 1. Canadian Statistical Geospatial Explorer Hub ArchivedData Visualization: 71-607-X2020010Description: The Canadian Statistical Geospatial Explorer empowers users to discover geo enabled data holdings of Statistics Canada at various levels of geography including at the neighbourhood level. Users are able to visualize, thematically map, spatially explore and analyze, export and consume data in various formats. Users can also view the data superimposed on satellite imagery, topographic and street layers.Release date: 2024-08-21
- 2. Housing Data Viewer ArchivedData Visualization: 71-607-X2019010Description: The Housing Data Viewer is a visualization tool that allows users to explore Statistics Canada data on a map. Users can use the tool to navigate, compare and export data.Release date: 2019-10-30
Analysis (256)
Analysis (256) (250 to 260 of 256 results)
- Articles and reports: 12-001-X198000154834Description:
The paper illustrates several practical problems in the adaptation of statistical theory to survey design in the context of the revision of an employment survey programme.
Release date: 1980-06-16 - Articles and reports: 12-001-X197900254833Description:
This paper looks at the current state of development of social statistics in Canada. Some key concepts related to statistics and social information are defined and discussed. The availability and analysis of administrative data is highlighted, along with the need for social surveys. Suggestions are made about the types of data analysis needed for the development of social decision models to meet policy requirements. Finally, an outline of priorities for future work toward the effective use of social statistics is given.
Release date: 1979-12-14 - Articles and reports: 12-001-X197600200001Description: This paper presents results on rotation group biases in the Canadian Labour Force Survey (LFS). The biases are studied in detail by decomposition into components responsible for the biases. Also, a comparison between the old and the new LFS is done on the basis of 1975 parallel run and differences are analyzed. Some conclusions are drawn and recommendations for other studies presented.Release date: 1976-12-13
- 254. Method Test Panel Phase II ‒ Data analysis ArchivedArticles and reports: 12-001-X197500254827Description: In the Methods Test Panel Phase II it was required to do analysis of variance on proportions. Since such analysis gives only approximate results, two models were used in order to be able to draw safe conclusions. Analysis of variance was performed with the proportions as variable and also with the arc sine of the square root of the proportions. The two models are outlined in the present paper and empirical comparisons are made using the MTP Phase II data.Release date: 1975-12-15
- 255. Analytic studies of sample survey data ArchivedArticles and reports: 12-001-X197500300001Description: Most sample surveys in the past have been "descriptive" in the sense that the main objective is the computation of means or totals of a number of characters of interest along with their standard errors. However, in recent years data produced from "descriptive" surveys are also being increasingly used for "analytical" purposes, i.e., for investigating relationships among variables. Also some sample surveys might have primary "analytical goals" in which case the "optimal" designing of such "analytical surveys" becomes important. These lecture notes present an account of some recent developments in the analytical studies of sample survey data. Many challenging problems remain to be solved and I hope these notes will provide stimulation for further research in this important area.Release date: 1975-12-15
- 256. Controlled random rounding ArchivedArticles and reports: 12-001-X197500254825Description:
Random rounding is a technique to ensure confidentiality of aggregate statistics. By randomly rounding all the components of a total, independently, together with the random rounding of the total itself, substantial discrepancies may arise when aggregating the published data. This paper presents a procedure which avoids substantial discrepancies while still protecting the concept of confidentiality.
Release date: 1975-12-15
- Previous Go to previous page of Analysis results
- 1 Go to page 1 of Analysis results
- ...
- 20 Go to page 20 of Analysis results
- 21 Go to page 21 of Analysis results
- 22 Go to page 22 of Analysis results
- 23 Go to page 23 of Analysis results
- 24 Go to page 24 of Analysis results
- 25 Go to page 25 of Analysis results
- 26 (current) Go to page 26 of Analysis results
- Next Go to next page of Analysis results
Reference (26)
Reference (26) (20 to 30 of 26 results)
- Surveys and statistical programs – Documentation: 11F0019M2003207Geography: CanadaDescription:
The estimation of intergenerational earnings mobility is rife with measurement problems since the research does not observe permanent, lifetime earnings. Nearly all studies make corrections for mean variation in earnings because of the age differences among respondents. Recent works employ average earnings or instrumental variable methods to address the effects of measurement error as a result of transitory earnings shocks and mis-reporting. However, empirical studies of intergenerational mobility have paid no attention to the changes in earnings variance across the life cycle suggested by economic models of human capital investment.
Using information from the Intergenerational Income Data from Canada and the National Longitudinal Survey and Panel Study of Income Dynamics from the United States, this study finds a strong association between age at observation and estimated earnings persistence. Part of this age-dependence is related to a general increase in transitory earnings variance during the collection of data. An independent effect of life cycle investment is also identified. These findings are then applied to the variation among intergenerational earnings persistence studies. Among studies with similar methodologies, one-third of the variance in published estimates of earnings persistence is attributable to cross-study differences in the age of responding fathers. Finally, these results call into question tests for the importance of credit constraints based on measures of earnings at different points in the life cycle.
Release date: 2003-08-05 - Surveys and statistical programs – Documentation: 12-584-GDescription:
This book introduces technical aspects of the Statistics Canada Total Work Accounts System (TWAS). The TWAS is designed to facilitate the analysis of issues that require simultaneous consideration of both paid work and unpaid productive work. Its key contribution is to allocate the deemed output of each episode of unpaid work activity to a specific beneficiary or group of beneficiaries (called "destinations"). The guide presents the criteria used to decide the allocation of each work episode to one of the destinations, as well as the pseudo code for DESTIN, the key variable of the System. This pseudo code allows programmers to quickly create the actual programming code needed to derive the DESTIN variable in their own microdata files of diary-based time-use records. The guide also discusses illustrative applications of the System, as well as its key limitations.
Release date: 2002-02-12 - Notices and consultations: 87-003-X19970012882Geography: CanadaDescription:
The purpose of this article is to inform Travel-log readers of the availability of a new analytical tool - the National Tourism Indicators. These estimates, which measure trends in tourism in Canada, are placed in perspective here, taking into account the concepts and definitions used in developing them.
Release date: 1997-01-08 - Surveys and statistical programs – Documentation: 11F0019M1995083Geography: CanadaDescription:
This paper examines the robustness of a measure of the average complete duration of unemployment in Canada to a host of assumptions used in its derivation. In contrast to the average incomplete duration of unemployment, which is a lagging cyclical indicator, this statistic is a coincident indicator of the business cycle. The impact of using a steady state as opposed to a non steady state assumption, as well as the impact of various corrections for response bias are explored. It is concluded that a non steady state estimator would be a valuable compliment to the statistics on unemployment duration that are currently released by many statistical agencies, and particularly Statistics Canada.
Release date: 1995-12-30 - 25. Labour Force Classification in the Survey of Labour and Income Dynamics (SLID): Evaluation of Test 3A Results ArchivedSurveys and statistical programs – Documentation: 75F0002M1993014Description:
This paper presents the results from test 3A of the Survey of Labour and Income Dynamics (SLID), conducted in January 1993, with a view to identify any necessary changes to the questions or to the algorithm used to derive labour force status.
Release date: 1995-12-30 - Surveys and statistical programs – Documentation: 75F0002M1994018Description:
This document describes the demographic, cultural and geographic derived variables for the Survey of Labour and Income Dynamics (SLID).
Release date: 1995-12-30