Quality assurance

Skip to main content
Skip to footer

Language selection

Français

Search and menus

Search and menus

Search

Skip to filters. View results.

Results

All (250)

All (250) (0 to 10 of 250 results)

1. Income Research Paper Series
Journals and periodicals: 75F0002M
Description: This series provides detailed documentation on income developments, including survey design issues, data quality evaluation and exploratory research.
Release date: 2024-04-26
2. Census of Agriculture: Data quality report
Surveys and statistical programs – Documentation: 32-26-0007
Description: Census of Agriculture data provide statistical information on farms and farm operators at fine geographic levels and for small subpopulations. Quality evaluation activities are essential to ensure that census data are reliable and that they meet user needs.
This report provides data quality information pertaining to the Census of Agriculture, such as sources of error, error detection, disclosure control methods, data quality indicators, response rates and collection rates.
Release date: 2024-02-06
3. Distributions of Household Economic Accounts, estimates of asset, liability and net worth distributions, 2010 to 2023, technical methodology and quality report
Articles and reports: 13-604-M2024001
Description: This documentation outlines the methodology used to develop the Distributions of household economic accounts published in January 2024 for the reference years 2010 to 2023. It describes the framework and the steps implemented to produce distributional information aligned with the National Balance Sheet Accounts and other national accounts concepts. It also includes a report on the quality of the estimated distributions.
Release date: 2024-01-22
4. Distributions of Household Economic Accounts, estimates of asset, liability and net worth distributions, 2010 to 2022, technical methodology and quality report
Articles and reports: 13-604-M2023001
Description: This documentation outlines the methodology used to develop the Distributions of household economic accounts published in March 2023 for the reference years 2010 to 2022. It describes the framework and the steps implemented to produce distributional information aligned with the National Balance Sheet Accounts and other national accounts concepts. It also includes a report on the quality of the estimated distributions.
Release date: 2023-03-31
5. Distributions of Household Economic Accounts, estimates of asset, liability and net worth distributions, 2010 to 2021, technical methodology and quality report
Articles and reports: 13-604-M2022002
Description:
This documentation outlines the methodology used to develop the Distributions of household economic accounts published in August 2022 for the reference years 2010 to 2021. It describes the framework and the steps implemented to produce distributional information aligned with the National Balance Sheet Accounts and other national accounts concepts. It also includes a report on the quality of the estimated distributions.

Release date: 2022-08-03
6. The Importance of Data Quality
19-22-0009
Description:
Join us as Statistics Canada’s Quality Secretariat will give a presentation on the importance of data quality. We are living in an exciting time for data: sources are more abundant, they are being generated in innovative ways, and they are available quicker than ever. However, a data source is not only worthless if it does not meet basic quality standards – it can be misleading, and worse than having no data at all! Statistics Canada’s Quality Secretariat has a mandate to promote good quality practices within the agency, across the Government of Canada, and internationally. For quality to truly be present, it must be incorporated into each process (from design to analysis) and into the product itself – whether that product is a microdata file or estimates derived from it. We will address why data quality is important and how one can evaluate it in practice. We will cover some basic concepts in data quality (quality assurance vs. control, metadata, etc.), and present data quality as a multidimensional concept. Finally, we will show data quality in action by evaluating a data source together. All data quality literacy levels are welcome. After all, everybody plays a part in quality!

https://www.statcan.gc.ca/en/services/webinars/19220009

Release date: 2022-01-26
7. Creation of a composite quality indicator for administrative data-based estimates using clustering Archived
Articles and reports: 11-522-X202100100015
Description: National statistical agencies such as Statistics Canada have a responsibility to convey the quality of statistical information to users. The methods traditionally used to do this are based on measures of sampling error. As a result, they are not adapted to the estimates produced using administrative data, for which the main sources of error are not due to sampling. A more suitable approach to reporting the quality of estimates presented in a multidimensional table is described in this paper. Quality indicators were derived for various post-acquisition processing steps, such as linkage, geocoding and imputation, by estimation domain. A clustering algorithm was then used to combine domains with similar quality levels for a given estimate. Ratings to inform users of the relative quality of estimates across domains were assigned to the groups created. This indicator, called the composite quality indicator (CQI), was developed and experimented with in the Canadian Housing Statistics Program (CHSP), which aims to produce official statistics on the residential housing sector in Canada using multiple administrative data sources.
Keywords: Unsupervised machine learning, quality assurance, administrative data, data integration, clustering.
Release date: 2021-10-22
8. Empowering analysts to consider the ethics of their work: A case study of the UK Statistics Authority's data ethics framework Archived
Articles and reports: 11-522-X202100100023
Description:
Our increasingly digital society provides multiple opportunities to maximise our use of data for the public good – using a range of sources, data types and technologies to enable us to better inform the public about social and economic matters and contribute to the effective development and evaluation of public policy. Ensuring use of data in ethically appropriate ways is an important enabler for realising the potential to use data for public good research and statistics. Earlier this year the UK Statistics Authority launched the Centre for Applied Data Ethics to provide applied data ethics services, advice, training and guidance to the analytical community across the United Kingdom. The Centre has developed a framework and portfolio of services to empower analysts to consider the ethics of their research quickly and easily, at the research design phase thus promoting a culture of ethics by design. This paper will provide an overview of this framework, the accompanying user support services and the impact of this work.

Key words: Data ethics, data, research and statistics

Release date: 2021-10-22
9. Distributions of Household Economic Accounts, estimates of asset, liability and net worth distributions, 2010 to 2020, technical methodology and quality report
Articles and reports: 13-604-M2021001
Description:
This documentation outlines the methodology used to develop the Distributions of household economic accounts published in September 2021 for the reference years 2010 to 2020. It describes the framework and the steps implemented to produce distributional information aligned with the National Balance Sheet Accounts and other national accounts concepts. It also includes a report on the quality of the estimated distributions.
Release date: 2021-09-07
10. Data Quality in Six Dimensions Archived
Stats in brief: 89-20-00062020001
Description:
In this video, you will be introduced to the fundamentals of data quality, which can be summed up in six dimensions—or six different ways to think about quality. You will also learn how each dimension can be used to evaluate the quality of data.
Release date: 2020-09-23

Data (0)

Data (0) (0 results)

No content available at this time.

Analysis (171)

Analysis (171) (10 to 20 of 171 results)

11. Distributions of Household Economic Accounts, estimates of asset, liability and net worth distributions, 2010 to 2018, technical methodology and quality report Archived
Articles and reports: 13-604-M2019001
Description:
This documentation outlines a step towards a more complete program of annual distributional estimates for the household sector in the Canadian macroeconomic accounts. This documentation also presents the methodology used to develop distributions of wealth for the household sector of the National Balance Sheet Accounts (NBSA) for the reference years 2010 to 2018.
Release date: 2019-03-27
12. Distributions of Household Economic Accounts, estimates of asset, liability and net worth distributions, 2010 to 2017, technical methodology and quality report Archived
Articles and reports: 13-604-M2018087
Description:
Statistics Canada regularly publishes macroeconomic indicators on household assets, liabilities and net worth as part of the quarterly National Balance Sheet Accounts (NBSA). These accounts are aligned with the most recent international standards and are the source of estimates of national wealth for all sectors of the economy, including households, non-profit institutions, governments and corporations along with Canada’s wealth position vis-a-vis the rest of the world. While the NBSA provide high quality information on the overall position of households relative to other economic sectors, they lack the granularity required to understand vulnerabilities of specific groups and the resulting implications for economic wellbeing and financial stability.
Release date: 2018-03-22
13. Statistics Canada's Quality Assurance Framework Archived
Stats in brief: 11-001-X201711116381
Description: Release published in The Daily – Statistics Canada’s official release bulletin
Release date: 2017-04-21
14. Using Administrative Records to Evaluate Survey Data Archived
Articles and reports: 11-522-X201700014711
Description:
After the 2010 Census, the U.S. Census Bureau conducted two separate research projects matching survey data to databases. One study matched to the third-party database Accurint, and the other matched to U.S. Postal Service National Change of Address (NCOA) files. In both projects, we evaluated response error in reported move dates by comparing the self-reported move date to records in the database. We encountered similar challenges in the two projects. This paper discusses our experience using “big data” as a comparison source for survey data and our lessons learned for future projects similar to the ones we conducted.
Release date: 2016-03-24
15. Assimilation and Coverage of the Foreign-Born Population in Administrative Records Archived
Articles and reports: 11-522-X201700014722
Description:
The U.S. Census Bureau is researching ways to incorporate administrative data in decennial census and survey operations. Critical to this work is an understanding of the coverage of the population by administrative records. Using federal and third party administrative data linked to the American Community Survey (ACS), we evaluate the extent to which administrative records provide data on foreign-born individuals in the ACS and employ multinomial logistic regression techniques to evaluate characteristics of those who are in administrative records relative to those who are not. We find that overall, administrative records provide high coverage of foreign-born individuals in our sample for whom a match can be determined. The odds of being in administrative records are found to be tied to the processes of immigrant assimilation – naturalization, higher English proficiency, educational attainment, and full-time employment are associated with greater odds of being in administrative records. These findings suggest that as immigrants adapt and integrate into U.S. society, they are more likely to be involved in government and commercial processes and programs for which we are including data. We further explore administrative records coverage for the two largest race/ethnic groups in our sample – Hispanic and non-Hispanic single-race Asian foreign born, finding again that characteristics related to assimilation are associated with administrative records coverage for both groups. However, we observe that neighborhood context impacts Hispanics and Asians differently.
Release date: 2016-03-24
16. When Race and Hispanic Origin Reporting are Discrepant across Administrative Records Sources: Exploring Methods to Assign Responses Archived
Articles and reports: 11-522-X201700014723
Description:
The U.S. Census Bureau is researching uses of administrative records in survey and decennial operations in order to reduce costs and respondent burden while preserving data quality. One potential use of administrative records is to utilize the data when race and Hispanic origin responses are missing. When federal and third party administrative records are compiled, race and Hispanic origin responses are not always the same for an individual across different administrative records sources. We explore different sets of business rules used to assign one race and one Hispanic response when these responses are discrepant across sources. We also describe the characteristics of individuals with matching, non-matching, and missing race and Hispanic origin data across several demographic, household, and contextual variables. We find that minorities, especially Hispanics, are more likely to have non-matching Hispanic origin and race responses in administrative records than in the 2010 Census. Hispanics are less likely to have missing Hispanic origin data but more likely to have missing race data in administrative records. Non-Hispanic Asians and non-Hispanic Pacific Islanders are more likely to have missing race and Hispanic origin data in administrative records. Younger individuals, renters, individuals living in households with two or more people, individuals who responded to the census in the nonresponse follow-up operation, and individuals residing in urban areas are more likely to have non-matching race and Hispanic origin responses.
Release date: 2016-03-24
17. The challenges of linking and using administrative data from different sources Archived
Articles and reports: 11-522-X201700014724
Description:
At the Institut national de santé publique du Québec, the Quebec Integrated Chronic Disease Surveillance System (QICDSS) has been used daily for approximately four years. The benefits of this system are numerous for measuring the extent of diseases more accurately, evaluating the use of health services properly and identifying certain groups at risk. However, in the past months, various problems have arisen that have required a great deal of careful thought. The problems have affected various areas of activity, such as data linkage, data quality, coordinating multiple users and meeting legal obligations. The purpose of this presentation is to describe the main challenges associated with using QICDSS data and to present some possible solutions. In particular, this presentation discusses the processing of five data sources that not only come from five different sources, but also are not mainly used for chronic disease surveillance. The varying quality of the data, both across files and within a given file, will also be discussed. Certain situations associated with the simultaneous use of the system by multiple users will also be examined. Examples will be given of analyses of large data sets that have caused problems. As well, a few challenges involving disclosure and the fulfillment of legal agreements will be briefly discussed.
Release date: 2016-03-24
18. Measuring the Quality of a Probabilistic Linkage through Clerical-Reviews Archived
Articles and reports: 11-522-X201700014743
Description:
Probabilistic linkage is susceptible to linkage errors such as false positives and false negatives. In many cases, these errors may be reliably measured through clerical-reviews, i.e. the visual inspection of a sample of record pairs to determine if they are matched. A framework is described to effectively carry-out such clerical-reviews based on a probabilistic sample of pairs, repeated independent reviews of the same pairs and latent class analysis to account for clerical errors.
Release date: 2016-03-24
19. Data surveillance on the clinical data used for health system funding in Ontario Archived
Articles and reports: 11-522-X201700014758
Description: Several Canadian jurisdictions including Ontario are using patient-based healthcare data in their funding models. These initiatives can influence the quality of this data both positively and negatively as people tend to pay more attention to the data and its quality when financial decisions are based upon it. Ontario’s funding formula uses data from several national databases housed at the Canadian Institute for Health Information (CIHI). These databases provide information on patient activity and clinical status across the continuum of care. As funding models may influence coding behaviour, CIHI is collaborating with the Ontario Ministry of Health and Long-Term Care to assess and monitor the quality of this data. CIHI is using data mining software and modelling techniques (that are often associated with “big data”) to identify data anomalies across multiple factors. The models identify what the “typical” clinical coding patterns are for key patient groups (for example, patients seen in special care units or discharged to home care), so that outliers can be identified, where patients do not fit the expected pattern. A key component of the modelling is segmenting the data based on patient, provider and hospital characteristics to take into account key differences in the delivery of health care and patient populations across the province. CIHI’s analysis identified several hospitals with coding practices that appear to be changing or significantly different from their peer group. Further investigation is required to understand why these differences exist and to develop appropriate strategies to mitigate variations.
Release date: 2016-03-24
20. Validation of the 10-item Kessler Psychological Distress Scale (K10) in the 2012 Aboriginal Peoples Survey Archived
Articles and reports: 82-003-X201600114307
Description:
Using the 2012 Aboriginal Peoples Survey, this study examined the psychometric properties of the 10-item Kessler Psychological Distress Scale (a short measure of non-specific psychological distress) for First Nations people living off reserve, Métis, and Inuit aged 15 or older.
Release date: 2016-01-20

Reference (78)

Reference (78) (70 to 80 of 78 results)

71. An increment-decrement model of secondary school progression for Canadian provinces Archived
Surveys and statistical programs – Documentation: 11-522-X19980015034
Description:
A model of secondary school progression has been estimated using data from the 1991 School Leavers Survey conducted by Statistics Canada. The data on which the school progression model was based comprised current educational status and responses to retrospective questions on the timing of schooling events. These data were sufficient for approximate reconstruction of educational event histories of each respondent. The school progression model was designed to be included in a larger, continuous time micro-simulation model. Its main features involve estimation -- by age, month of birth and season for both sexes in each province -- of rates of graduation, of dropout, of return and of dropout graduation. Estimation was reinforced with auxiliary 1991 Census and administative data.
Release date: 1999-10-22
72. Estimation with partial overlap longitudinal samples Archived
Surveys and statistical programs – Documentation: 11-522-X19980015035
Description:
In a longitudinal survey conducted for k periods some units may be observed for less than k of the periods. Examples include, surveys designed with partially overlapping subsamples, a pure panel survey with nonresponse, and a panel survey supplemented with additional samples for some of the time periods. Estimators of the regression type are exhibited for such surveys. An application to special studies associated with the National Resources Inventory is discussed.
Release date: 1999-10-22
73. Multivariate logistic regression for data from complex surveys Archived
Surveys and statistical programs – Documentation: 11-522-X19980015036
Description:
Multivariate logistic regression, introduced by Glonek and McCullagh (1995) as a generalisation of logistic regression, is useful in the analysis of longitudinal data as it allows for dependent repeated observations of a categorical variable and for incomplete response profiles. We show how the method can be extended to deal with data from complex surveys and we illustrate it on data from the Swiss Labour Force Survey. The effect of the sampling weights on the parameter estimates and their standard errors is considered.
Release date: 1999-10-22
74. Parameter estimation for a finite mixture of distributions for dichotomous longitudinal data: Comparing algorithms Archived
Surveys and statistical programs – Documentation: 11-522-X19980015037
Description:
For longitudinal data, mixed models are often used, since they allow analysts to take account of the correlation between different observations from the same individual. The finite mixture model may be considered as a special case of a mixed model. In this document, attention will be given to the maximum likelihood method. The maximization of the likelihood function for a finite mixture of distributions is generally more difficult than in the usual case of a single distribution and can require considerable time. The objective of this project was therefore primarily to identify the one or more algorithms that best meet the criteria of run time and of efficiency in finding the solution. To achieve this objective, a simulation study was carried out. Only the situation in which the dependent variable is dichotomous was considered. This situation is very useful in practice, since among other things it can be used to model discrete durations, such as the length of time in "low income" status.
Release date: 1999-10-22
75. Qualitative Aspects of the Survey of Labour and Income Dynamics (SLID) Test 3A Data Collection Archived
Surveys and statistical programs – Documentation: 75F0002M1993007
Description:
This report presents a summary evaluation of the quality of the data collected during the Survey of Labour and Income Dynamics (SLID) field test of labour market activity data, held in January and February 1993.
Release date: 1995-12-30
76. Qualitative Aspects of the Survey of Labour and Income Dynamics (SLID) Test 3B Data Collection Archived
Surveys and statistical programs – Documentation: 75F0002M1993011
Description:
This report presents a summary evaluation of the quality of the data collected during the Survey of Labour and Income Dynamics (SLID) field test of income and wealth, held in April and May 1993.
Release date: 1995-12-30
77. Data Quality of Income Data Using Computer-assisted Interviewing: SLID Experience Archived
Surveys and statistical programs – Documentation: 75F0002M1994015
Description:
This paper describes how the computer-assisted interviewing (CAI) income application was programmed for a Survey of Labour and Income Dynamics (SLID) test conducted in 1993.
Release date: 1995-12-30
78. Some Effects of Computer-assisted Interviewing on the Data Quality of the Survey of Labour and Income Dynamics Archived
Surveys and statistical programs – Documentation: 75F0002M1995007
Description:
This paper describes the impact of computer-assisted interviewing (CAI) on the quality of the Survey of Labour and Income Dynamics (SLID) data in three content areas: labour force activity, respondent-sensitive sources of income, and household relationships.
Release date: 1995-12-30

Report a problem or mistake on this page

Date modified:: 2024-09-10