Keyword search

Filter results by

Search Help
Currently selected filters that can be removed

Keyword(s)

Survey or statistical program

702 facets displayed. 0 facets selected.

Content

1 facets displayed. 0 facets selected.
Sort Help
entries

Results

All (24,373)

All (24,373) (0 to 10 of 24,373 results)

  • Articles and reports: 12-001-X202400100001
    Description: Inspired by the two excellent discussions of our paper, we offer some new insights and developments into the problem of estimating participation probabilities for non-probability samples. First, we propose an improvement of the method of Chen, Li and Wu (2020), based on best linear unbiased estimation theory, that more efficiently leverages the available probability and non-probability sample data. We also develop a sample likelihood approach, similar in spirit to the method of Elliott (2009), that properly accounts for the overlap between both samples when it can be identified in at least one of the samples. We use best linear unbiased prediction theory to handle the scenario where the overlap is unknown. Interestingly, our two proposed approaches coincide in the case of unknown overlap. Then, we show that many existing methods can be obtained as a special case of a general unbiased estimating function. Finally, we conclude with some comments on nonparametric estimation of participation probabilities.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100002
    Description: We provide comparisons among three parametric methods for the estimation of participation probabilities and some brief comments on homogeneous groups and post-stratification.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100003
    Description: Beaumont, Bosa, Brennan, Charlebois and Chu (2024) propose innovative model selection approaches for estimation of participation probabilities for non-probability sample units. We focus our discussion on the choice of a likelihood and parameterization of the model, which are key for the effectiveness of the techniques developed in the paper. We consider alternative likelihood and pseudo-likelihood based methods for estimation of participation probabilities and present simulations implementing and comparing the AIC based variable selection. We demonstrate that, under important practical scenarios, the approach based on a likelihood formulated over the observed pooled non-probability and probability samples performed better than the pseudo-likelihood based alternatives. The contrast in sensitivity of the AIC criteria is especially large for small probability sample sizes and low overlap in covariates domains.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100004
    Description: Non-probability samples are being increasingly explored in National Statistical Offices as an alternative to probability samples. However, it is well known that the use of a non-probability sample alone may produce estimates with significant bias due to the unknown nature of the underlying selection mechanism. Bias reduction can be achieved by integrating data from the non-probability sample with data from a probability sample provided that both samples contain auxiliary variables in common. We focus on inverse probability weighting methods, which involve modelling the probability of participation in the non-probability sample. First, we consider the logistic model along with pseudo maximum likelihood estimation. We propose a variable selection procedure based on a modified Akaike Information Criterion (AIC) that properly accounts for the data structure and the probability sampling design. We also propose a simple rank-based method of forming homogeneous post-strata. Then, we extend the Classification and Regression Trees (CART) algorithm to this data integration scenario, while again properly accounting for the probability sampling design. A bootstrap variance estimator is proposed that reflects two sources of variability: the probability sampling design and the participation model. Our methods are illustrated using Statistics Canada’s crowdsourcing and survey data.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100005
    Description: In this rejoinder, I address the comments from the discussants, Dr. Takumi Saegusa, Dr. Jae-Kwang Kim and Ms. Yonghyun Kwon. Dr. Saegusa’s comments about the differences between the conditional exchangeability (CE) assumption for causal inferences versus the CE assumption for finite population inferences using nonprobability samples, and the distinction between design-based versus model-based approaches for finite population inference using nonprobability samples, are elaborated and clarified in the context of my paper. Subsequently, I respond to Dr. Kim and Ms. Kwon’s comprehensive framework for categorizing existing approaches for estimating propensity scores (PS) into conditional and unconditional approaches. I expand their simulation studies to vary the sampling weights, allow for misspecified PS models, and include an additional estimator, i.e., scaled adjusted logistic propensity estimator (Wang, Valliant and Li (2021), denoted by sWBS). In my simulations, it is observed that the sWBS estimator consistently outperforms or is comparable to the other estimators under the misspecified PS model. The sWBS, as well as WBS or ABS described in my paper, do not assume that the overlapped units in both the nonprobability and probability reference samples are negligible, nor do they require the identification of overlap units as needed by the estimators proposed by Dr. Kim and Ms. Kwon.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100006
    Description: In some of non-probability sample literature, the conditional exchangeability assumption is considered to be necessary for valid statistical inference. This assumption is rooted in causal inference though its potential outcome framework differs greatly from that of non-probability samples. We describe similarities and differences of two frameworks and discuss issues to consider when adopting the conditional exchangeability assumption in non-probability sample setups. We also discuss the role of finite population inference in different approaches of propensity scores and outcome regression modeling to non-probability samples.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100007
    Description: Pseudo weight construction for data integration can be understood in the two-phase sampling framework. Using the two-phase sampling framework, we discuss two approaches to the estimation of propensity scores and develop a new way to construct the propensity score function for data integration using the conditional maximum likelihood method. Results from a limited simulation study are also presented.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100008
    Description: Nonprobability samples emerge rapidly to address time-sensitive priority topics in different areas. These data are timely but subject to selection bias. To reduce selection bias, there has been wide literature in survey research investigating the use of propensity-score (PS) adjustment methods to improve the population representativeness of nonprobability samples, using probability-based survey samples as external references. Conditional exchangeability (CE) assumption is one of the key assumptions required by PS-based adjustment methods. In this paper, I first explore the validity of the CE assumption conditional on various balancing score estimates that are used in existing PS-based adjustment methods. An adaptive balancing score is proposed for unbiased estimation of population means. The population mean estimators under the three CE assumptions are evaluated via Monte Carlo simulation studies and illustrated using the NIH SARS-CoV-2 seroprevalence study to estimate the proportion of U.S. adults with COVID-19 antibodies from April 01-August 04, 2020.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100009
    Description: Our comments respond to discussion from Sen, Brick, and Elliott. We weigh the potential upside and downside of Sen’s suggestion of using machine learning to identify bogus respondents through interactions and improbable combinations of variables. We join Brick in reflecting on bogus respondents’ impact on the state of commercial nonprobability surveys. Finally, we consider Elliott’s discussion of solutions to the challenge raised in our study.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100010
    Description: This discussion summarizes the interesting new findings around measurement errors in opt-in surveys by Kennedy, Mercer and Lau (KML). While KML enlighten readers about “bogus responding” and possible patterns in them, this discussion suggests combining these new-found results with other avenues of research in nonprobability sampling, such as improvement of representativeness.
    Release date: 2024-06-25
Data (12,024)

Data (12,024) (0 to 10 of 12,024 results)

  • Data Visualization: 71-607-X2017003
    Description: This web application provides access to data on the sales of food services and drinking places for Canada, provinces and territories. This dynamic application allows users to compare provincial and territorial data with interactive maps and charts. All data in this release are seasonally adjusted and expressed in current dollars.
    Release date: 2024-06-25

  • Data Visualization: 71-607-X2018016
    Description: This interactive dashboard provides access to current and historical Consumer Price Index (CPI) data in a dynamic and customizable format. Key indicators such as the 12-month and 1-month inflation rates and price trends are presented in interactive charts, allowing users to compare and analyze price changes of all the goods and services in the CPI basket over time as well as across geography (national, provincial and territorial levels).

    Other CPI indicators available in this tool include the Bank of Canada’s core measures of inflation, seasonally adjusted inflation rates, and CPI basket weights.

    This web-based application is updated monthly, as soon as the data for the latest reference month is released in The Daily.

    Release date: 2024-06-25

  • Table: 71-607-X
    Description: Statistics Canada produces a variety of interactive visualization tools that present data in a graphical form. These tools provide a useful way of interpreting trends behind our data on various social and economic topics.
    Release date: 2024-06-25

  • Table: 10-10-0015-01
    Geography: Canada
    Frequency: Quarterly
    Description: Quarterly data by level of government.
    Release date: 2024-06-25

  • Table: 10-10-0139-01
    Geography: Canada
    Frequency: Daily
    Description: This table contains 39 series, with data for starting from 1991 (not all combinations necessarily have data for all years). This table contains data described by the following dimensions (Not all combinations are available): Geography (1 item: Canada); Financial market statistics (39 items: Government of Canada Treasury Bills, 1-month (composite rates); Government of Canada Treasury Bills, 2-month (composite rates); Government of Canada Treasury Bills, 3-month (composite rates);Government of Canada Treasury Bills, 6-month (composite rates); ...).
    Release date: 2024-06-25

  • Table: 18-10-0001-01
    Geography: Canada, Census subdivision, Census metropolitan area, Census metropolitan area part
    Frequency: Monthly
    Description:

    Monthly average retail prices for gasoline and fuel oil for Canada, selected provincial cities, Whitehorse and Yellowknife. Prices are presented for the current month and previous four months. Includes fuel type and the price in cents per litre.

    Release date: 2024-06-25

  • Table: 18-10-0004-01
    Geography: Canada, Province or territory, Census subdivision, Census metropolitan area, Census metropolitan area part
    Frequency: Monthly
    Description:

    Monthly indexes for major components and special aggregates of the Consumer Price Index (CPI), not seasonally adjusted, for Canada, provinces, Whitehorse, Yellowknife and Iqaluit. Data are presented for the current month and previous four months. The base year for the index is 2002=100.

    Release date: 2024-06-25

  • Table: 18-10-0004-02
    Geography: Canada, Province or territory, Census subdivision, Census metropolitan area, Census metropolitan area part
    Frequency: Monthly
    Description:

    Monthly indexes and percentage changes for all components and special aggregates of the Consumer Price Index (CPI), not seasonally adjusted, for Canada, provinces, Whitehorse, Yellowknife and Iqaluit. Data are presented for the corresponding month of the previous year, the previous month and the current month. The base year for the index is 2002=100. 

    Release date: 2024-06-25

  • Table: 18-10-0004-03
    Geography: Canada, Province or territory, Census subdivision, Census metropolitan area, Census metropolitan area part
    Frequency: Monthly
    Description: Monthly indexes and percentage changes for selected sub-groups of the food component of the Consumer Price Index (CPI), not seasonally adjusted, for Canada, provinces, Whitehorse and Yellowknife. Data are presented for the corresponding month of the previous year, the previous month and the current month. The base year for the index is 2002=100.
    Release date: 2024-06-25

  • Table: 18-10-0004-04
    Geography: Canada, Province or territory, Census subdivision, Census metropolitan area, Census metropolitan area part
    Frequency: Monthly
    Description: Monthly indexes and percentage changes for selected sub-groups of the shelter component of the Consumer Price Index (CPI), not seasonally adjusted, for Canada, provinces, Whitehorse and Yellowknife. Data are presented for the corresponding month of the previous year, the previous month and the current month. The base year for the index is 2002=100.
    Release date: 2024-06-25
Analysis (9,984)

Analysis (9,984) (0 to 10 of 9,984 results)

  • Articles and reports: 12-001-X202400100001
    Description: Inspired by the two excellent discussions of our paper, we offer some new insights and developments into the problem of estimating participation probabilities for non-probability samples. First, we propose an improvement of the method of Chen, Li and Wu (2020), based on best linear unbiased estimation theory, that more efficiently leverages the available probability and non-probability sample data. We also develop a sample likelihood approach, similar in spirit to the method of Elliott (2009), that properly accounts for the overlap between both samples when it can be identified in at least one of the samples. We use best linear unbiased prediction theory to handle the scenario where the overlap is unknown. Interestingly, our two proposed approaches coincide in the case of unknown overlap. Then, we show that many existing methods can be obtained as a special case of a general unbiased estimating function. Finally, we conclude with some comments on nonparametric estimation of participation probabilities.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100002
    Description: We provide comparisons among three parametric methods for the estimation of participation probabilities and some brief comments on homogeneous groups and post-stratification.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100003
    Description: Beaumont, Bosa, Brennan, Charlebois and Chu (2024) propose innovative model selection approaches for estimation of participation probabilities for non-probability sample units. We focus our discussion on the choice of a likelihood and parameterization of the model, which are key for the effectiveness of the techniques developed in the paper. We consider alternative likelihood and pseudo-likelihood based methods for estimation of participation probabilities and present simulations implementing and comparing the AIC based variable selection. We demonstrate that, under important practical scenarios, the approach based on a likelihood formulated over the observed pooled non-probability and probability samples performed better than the pseudo-likelihood based alternatives. The contrast in sensitivity of the AIC criteria is especially large for small probability sample sizes and low overlap in covariates domains.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100004
    Description: Non-probability samples are being increasingly explored in National Statistical Offices as an alternative to probability samples. However, it is well known that the use of a non-probability sample alone may produce estimates with significant bias due to the unknown nature of the underlying selection mechanism. Bias reduction can be achieved by integrating data from the non-probability sample with data from a probability sample provided that both samples contain auxiliary variables in common. We focus on inverse probability weighting methods, which involve modelling the probability of participation in the non-probability sample. First, we consider the logistic model along with pseudo maximum likelihood estimation. We propose a variable selection procedure based on a modified Akaike Information Criterion (AIC) that properly accounts for the data structure and the probability sampling design. We also propose a simple rank-based method of forming homogeneous post-strata. Then, we extend the Classification and Regression Trees (CART) algorithm to this data integration scenario, while again properly accounting for the probability sampling design. A bootstrap variance estimator is proposed that reflects two sources of variability: the probability sampling design and the participation model. Our methods are illustrated using Statistics Canada’s crowdsourcing and survey data.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100005
    Description: In this rejoinder, I address the comments from the discussants, Dr. Takumi Saegusa, Dr. Jae-Kwang Kim and Ms. Yonghyun Kwon. Dr. Saegusa’s comments about the differences between the conditional exchangeability (CE) assumption for causal inferences versus the CE assumption for finite population inferences using nonprobability samples, and the distinction between design-based versus model-based approaches for finite population inference using nonprobability samples, are elaborated and clarified in the context of my paper. Subsequently, I respond to Dr. Kim and Ms. Kwon’s comprehensive framework for categorizing existing approaches for estimating propensity scores (PS) into conditional and unconditional approaches. I expand their simulation studies to vary the sampling weights, allow for misspecified PS models, and include an additional estimator, i.e., scaled adjusted logistic propensity estimator (Wang, Valliant and Li (2021), denoted by sWBS). In my simulations, it is observed that the sWBS estimator consistently outperforms or is comparable to the other estimators under the misspecified PS model. The sWBS, as well as WBS or ABS described in my paper, do not assume that the overlapped units in both the nonprobability and probability reference samples are negligible, nor do they require the identification of overlap units as needed by the estimators proposed by Dr. Kim and Ms. Kwon.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100006
    Description: In some of non-probability sample literature, the conditional exchangeability assumption is considered to be necessary for valid statistical inference. This assumption is rooted in causal inference though its potential outcome framework differs greatly from that of non-probability samples. We describe similarities and differences of two frameworks and discuss issues to consider when adopting the conditional exchangeability assumption in non-probability sample setups. We also discuss the role of finite population inference in different approaches of propensity scores and outcome regression modeling to non-probability samples.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100007
    Description: Pseudo weight construction for data integration can be understood in the two-phase sampling framework. Using the two-phase sampling framework, we discuss two approaches to the estimation of propensity scores and develop a new way to construct the propensity score function for data integration using the conditional maximum likelihood method. Results from a limited simulation study are also presented.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100008
    Description: Nonprobability samples emerge rapidly to address time-sensitive priority topics in different areas. These data are timely but subject to selection bias. To reduce selection bias, there has been wide literature in survey research investigating the use of propensity-score (PS) adjustment methods to improve the population representativeness of nonprobability samples, using probability-based survey samples as external references. Conditional exchangeability (CE) assumption is one of the key assumptions required by PS-based adjustment methods. In this paper, I first explore the validity of the CE assumption conditional on various balancing score estimates that are used in existing PS-based adjustment methods. An adaptive balancing score is proposed for unbiased estimation of population means. The population mean estimators under the three CE assumptions are evaluated via Monte Carlo simulation studies and illustrated using the NIH SARS-CoV-2 seroprevalence study to estimate the proportion of U.S. adults with COVID-19 antibodies from April 01-August 04, 2020.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100009
    Description: Our comments respond to discussion from Sen, Brick, and Elliott. We weigh the potential upside and downside of Sen’s suggestion of using machine learning to identify bogus respondents through interactions and improbable combinations of variables. We join Brick in reflecting on bogus respondents’ impact on the state of commercial nonprobability surveys. Finally, we consider Elliott’s discussion of solutions to the challenge raised in our study.
    Release date: 2024-06-25

  • Articles and reports: 12-001-X202400100010
    Description: This discussion summarizes the interesting new findings around measurement errors in opt-in surveys by Kennedy, Mercer and Lau (KML). While KML enlighten readers about “bogus responding” and possible patterns in them, this discussion suggests combining these new-found results with other avenues of research in nonprobability sampling, such as improvement of representativeness.
    Release date: 2024-06-25
Reference (1,891)

Reference (1,891) (20 to 30 of 1,891 results)

  • Surveys and statistical programs – Documentation: 62F0072G
    Description: The primary source of data used in post index construction are periodic retail-price and cost-of-living surveys conducted at foreign locations by the foreign service personnel stationed there. Statistics Canada analyses this survey data. In addition to a variety of price information gathered from retail outlets patronized by Canadian personnel at the post, data are also obtained regarding their spending patterns, along with information on the availability of any special local purchasing facilities, and the extent to which staff make direct importation of consumer goods from other countries. The general aim is for full-scale surveys to be carried out at about three-year intervals. However, specific studies may be undertaken more or less frequently than this, depending on the volatility of retail price conditions in each particular country, the instability of exchange rates and the extent to which close monitoring of changes in the local retail price situation being faced by Canadian personnel can be achieved through reference to other statistical indicators.
    Release date: 2023-12-22

  • Surveys and statistical programs – Documentation: 37-20-00012023006
    Description: This technical reference guide is intended for users of the Education and Labour Market Longitudinal Platform (ELMLP). The data for the products associated with this issue are based on the longitudinal Postsecondary Student Information System (PSIS) administrative data files. Statistics Canada has derived a series of annual indicators of public postsecondary students including persistence rates, graduation rates, and average time to graduation by educational qualification, field of study, age group and gender for Canada, the provinces, and the three combined Territories.
    Release date: 2023-12-19

  • Classification: 12-608-X
    Description: The Standard Classification of Countries and Areas of Interest (SCCAI) has been developed to increase coherence of the list of countries used within Statistics Canada and includes countries and areas for which statistical data are compiled. The Variant of the SCCAI for Social Statistics and the Variant of the SCCAI for Travel Statistics were developed to create groupings of countries to enable the production of integrated statistics when publishing social statistics data and travel statistics data, respectively. These variants have three levels.
    Release date: 2023-12-07

  • Surveys and statistical programs – Documentation: 25-26-0002
    Description: The Consolidated Energy Statistics table (CEST) provides national level monthly estimates of supply and demand characteristics, for both primary and secondary energy sources by fuel type. The data is presented in terajoules; a common unit of measure, allowing easy comparisons between different fuel and energy types. The table is updated with new data on a monthly basis.
    Release date: 2023-12-07

  • Classification: 65-209-X
    Description: The Canadian Export Classification is a structured, hierarchical classification system based on the Harmonized Description and Coding System. The HS nomenclature is divided into 21 Sections, which in general, group goods produced in the same sector of the economy.
    Release date: 2023-12-06

  • Surveys and statistical programs – Documentation: 89-654-X2023004
    Description: The Canadian Survey on Disability (CSD) is a national survey of Canadians aged 15 and over whose everyday activities are limited because of a long-term condition or health-related problem. The 2022 CSD Concepts and Methods Guide is designed to assist CSD data users by providing relevant information on survey content and concepts, sampling design, collection methods, data processing, data quality and product availability.
    Release date: 2023-12-01

  • Classification: 68-516-X
    Description: This product presents the Public Sector Universe, defined by Statistics Canada as a list of institutional units that are included in the economic measurement of governments in Canada. An institutional unit is defined as an economic entity that is capable, in its own right, of owning assets, incurring liabilities, and engaging in economic activities and in transactions with other entities. It includes the sectors of education, health, general government and government business enterprise sectors for all levels of government in Canada, annually, since 2008.
    Release date: 2023-11-22

  • Surveys and statistical programs – Documentation: 45-20-00012023002
    Description: The Canadian Index of Multiple Deprivation (CIMD) is an area-based index which uses Census of Population microdata to measure four key dimensions of deprivation at the dissemination area (DA)-level: residential instability, economic dependency, situational vulnerability and ethno-cultural composition.

    The CIMD allows for an understanding of inequalities in various measures of health and social well-being. While it is a geographically-based index of deprivation and marginalization, it can also be used as a proxy for an individual. The CIMD has the potential to be widely used by researchers on a variety of topics related to socio-economic research. Other uses for the index may include: policy planning and evaluation, or resource allocation.
    Release date: 2023-11-10

  • Surveys and statistical programs – Documentation: 12-585-X
    Description: This product is the dictionary for the Longitudinal Administrative Databank (LAD). The dictionary contains a complete description for each of the income and demographic variables in the LAD, including name, acronym, definition, source, historical availability and historical continuity.

    The following is a partial list of LAD variables: age, sex, marital status, family type, number and age of children, total income, wages and salaries, self-employment, Employment Insurance, Old Age Security, Canada and Quebec Pension Plans, social assistance, investment income, rental income, alimony, registered retirement savings plan (RRSP) income and contributions, low-income status, full-time education deduction, provincial refundable tax credits, goods and service tax (GST) credits, Canada Child Tax Benefits, selected immigration variables, Tax Free Savings (TFSA) information and Canadian Controlled Private Corporations (CCPC) information.

    Release date: 2023-11-10

  • Surveys and statistical programs – Documentation: 45-20-0001
    Description:

    The Canadian Index of Multiple Deprivation (CIMD) is an area-based index which uses Census of Population microdata to measure four key dimensions of deprivation at the dissemination area (DA)-level: residential instability, economic dependency, situational vulnerability and ethno-cultural composition. The CIMD allows for an understanding of inequalities in various measures of health and social well-being. While it is a geographically-based index of deprivation and marginalization, it can also be used as a proxy for an individual. The CIMD has the potential to be widely used by researchers on a variety of topics related to socio-economic research. Other uses for the index may include: policy planning and evaluation, or resource allocation.

    Release date: 2023-11-10
Date modified: