Collection and questionnaires

Skip to filters. View results.

Filter results by

Search Help
Currently selected filters that can be removed

Keyword(s)

Geography

3 facets displayed. 0 facets selected.

Content

1 facets displayed. 0 facets selected.
Sort Help
entries

Results

All (346)

All (346) (0 to 10 of 346 results)

  • Articles and reports: 12-001-X202500200002
    Description: This study examines interviewer effects on household nonresponse in three waves of the Household Finance and Consumption Survey (HFCS) in Austria using a multilevel model. Addressing nonresponse at its source is crucial for maintaining survey data quality and representativeness. Our findings indicate that the variation in response behavior explained by interviewer effects decreased from about one-third in the first wave to 7% in the third wave. Effective interviewers tend to have a university degree, be married, homeowners, and have a larger workload. Additionally, higher mean wages in the household’s municipality negatively affect survey participation. These insights suggest targeted interviewer selection and training strategies to improve response rates.
    Release date: 2025-12-23

  • Articles and reports: 11-522-X202500100003
    Description: In-person data collection is critical for the success of many large government-sponsored surveys. Despite response rate declines and increasing costs, the mode remains the gold standard for meeting the most rigorous survey requirements for federal survey programs, particularly as part of a multimode data collection strategy (Schober, 2018). However, over the last ten years critical labor market and workforce changes, exacerbated by the pandemic, have made in-person data collection efforts prohibitive for all but the largest survey organizations. Shifting ideas about job flexibility and job satisfaction alongside the increasingly technical role and demanding nature of the job have impacted recruitment and retention for survey organizations across the U.S. and Europe (Charman et al., 2024). The trends in U.S. field data collector employment are summarized and it is outlined that there are promising practices in recruiting and retaining high quality field data collectors. Additionally, broader ways to structure the field data collector labor force for continued success are considered, including supplementing field data collection with multimode alternatives such as video interviewing and updating value propositions for respondents.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100004
    Description: The Survey of Household Spending (SHS) conducted by Statistics Canada collects paper diaries and shopping receipts as a source of household expenditure data. An auto-capturing algorithm was created for SHS 2023 to reduce statistical clerks' manual work of extracting important information from scanned receipts of common store brands. The algorithm used Tesseract optical character recognition (OCR) to extract text characters from images of receipts, and it identified store and product entities using regular expressions, also known as regex. The goal of this study was to enhance the current auto-capture algorithm by experimenting with more advanced OCR and machine learning methods. As a result, PaddleOCR, an open-source OCR toolkit, was selected as the new default OCR engine due to its overall performance in recognizing texts, especially digits, accurately across receipts of various qualities. Additionally, entity classifiers based on support vector machines were trained on historical SHS records and existing regex patterns. By using classifiers to categorize different elements present on receipts instead of relying solely on regex patterns, product and store recognition improved. It is expected that this new algorithm will be used for SHS 2025 to improve the auto-capture quality and reduce the manual burden associated with capturing receipt variables.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100008
    Description: In 2020, Statistics Canada started to use probabilistic web panels as an alternate method of collecting official statistics. In a web panel, respondents to another survey are asked for contact information to participate in future short surveys. This paper will highlight Statistics Canada's experience with panels after 4 years, including what has been learned about the recruitment of panel participants and how to subsequently collect data using panel surveys. The ways in which recruitment questions are presented can result in very different rates of participation. Moreover, the wealth of auxiliary information available on the recruitment survey can be used to actively manage panel collection operations, by predicting the probability of response and using this information to target follow-up efforts.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100013
    Description: As part of answering the call to action for the United Nations' (UN) 17 Sustainable Development Goals, as well as addressing social, economic, and equity challenges within Canada, Statistics Canada's five-year development phase for the Disaggregated Data Action Plan (DDAP) was funded in 2021 to support data driven decision around these challenges. In turn, the document "Guiding Principles: Leveraging the 2021 Census of Populations Data for DDAP Groups of Interest" were created. The guiding principles document explains the organizational framework of the DDAP in the Agency, describes existing data sources, addresses ethical and privacy concerns, and centralizes sampling methods tailored for DDAP initiatives while accounting for characteristics which can complicate sampling and data collection procedures.
    Release date: 2025-09-08

  • Surveys and statistical programs – Documentation: 98-20-00052026004
    Description: This report provides detailed insight into the design and methodology of the content test component of the 2024 Census Test. This test evaluated changes to the wording and flow of some questions, as well as the potential addition of new questions, to help determine the content of the 2026 Census of Population.
    Release date: 2025-07-04

  • Surveys and statistical programs – Documentation: 32-26-0008
    Description: This report describes the main changes, additions or deletions to the Census of Agriculture questionnaire by topic and in the order they appear on the questionnaire.
    Release date: 2025-07-04

  • Articles and reports: 12-001-X202400200002
    Description: This paper investigates whether survey data quality fluctuates over the day. After laying out the argument theoretically, panel data from the Survey of Unemployed Workers in New Jersey are analyzed. Several indirect indicators of response error are investigated, including item nonresponse, interview completion time, rounding, and measures of the quality of time diary data. The evidence that we assemble for a time of day of interview effect is weak or nonexistent. Item nonresponse and the probability that interview completion time is among the 5% shortest appear to increase in the evening, but a more thorough assessment requires instrumental variables.
    Release date: 2024-12-20

  • Articles and reports: 12-001-X202400200006
    Description: As mixed-mode designs become increasingly popular, their effects on data quality have attracted much scholarly attention. Most studies focused on the bias properties of mixed-mode designs; few of them have investigated whether mixed-mode designs have heterogeneous variance structures across modes. While many characteristics of mixed-mode designs, such as varied interviewer usage, systematic differences in respondents, varying levels of social desirability bias, among others, may lead to heterogeneous variances in mode-specific point estimates of population means, this study specifically investigates whether interviewer variances remain consistent across different modes in mixed-mode studies. To address this research question, we utilize data collected from two distinct study designs. In the first design, when interviewers are responsible for either face-to-face or telephone mode, we examine whether there are mode differences in interviewer variances for 1) sensitive political questions, 2) international items, 3) and item missing indicators on international items, using the Arab Barometer wave 6 Jordan data. In the second design, we draw on Health and Retirement Study (HRS) 2016 core survey data to examine the question on three topics when interviewers are responsible for both modes. The topics cover 1) the CESD depression scale, 2) interviewer observations, and 3) the physical activity scale. To account for the lack of interpenetrated designs in both data sources, we include respondent-level covariates in our models. We find significant differences in interviewer variances on one item (twelve items in total) in the Arab Barometer study; whereas for HRS, the results are three out of eighteen. Overall, we find the magnitude of the interviewer variances larger in FTF than TEL on sensitive items. We conduct simulations to understand the power to detect mode effects in the typically modest interviewer sample sizes.
    Release date: 2024-12-20

  • Articles and reports: 11-522-X202200100011
    Description: In 2021, Statistics Canada initiated the Disaggregated Data Action Plan, a multi-year initiative to support more representative data collection methods, enhance statistics on diverse populations to allow for intersectional analyses, and support government and societal efforts to address known inequalities and bring considerations of fairness and inclusion into decision making. As part of this initiative, we are building the Survey Series on People and their Communities, a new probabilistic panel specifically designed to collect data that can be disaggregated according to racialized group. This new tool will allow us to address data gaps and emerging questions related to diversity. This paper will give an overview of the design of the Survey Series on People and their Communities.
    Release date: 2024-03-25
Data (0)

Data (0) (0 results)

No content available at this time.

Analysis (246)

Analysis (246) (0 to 10 of 246 results)

  • Articles and reports: 12-001-X202500200002
    Description: This study examines interviewer effects on household nonresponse in three waves of the Household Finance and Consumption Survey (HFCS) in Austria using a multilevel model. Addressing nonresponse at its source is crucial for maintaining survey data quality and representativeness. Our findings indicate that the variation in response behavior explained by interviewer effects decreased from about one-third in the first wave to 7% in the third wave. Effective interviewers tend to have a university degree, be married, homeowners, and have a larger workload. Additionally, higher mean wages in the household’s municipality negatively affect survey participation. These insights suggest targeted interviewer selection and training strategies to improve response rates.
    Release date: 2025-12-23

  • Articles and reports: 11-522-X202500100003
    Description: In-person data collection is critical for the success of many large government-sponsored surveys. Despite response rate declines and increasing costs, the mode remains the gold standard for meeting the most rigorous survey requirements for federal survey programs, particularly as part of a multimode data collection strategy (Schober, 2018). However, over the last ten years critical labor market and workforce changes, exacerbated by the pandemic, have made in-person data collection efforts prohibitive for all but the largest survey organizations. Shifting ideas about job flexibility and job satisfaction alongside the increasingly technical role and demanding nature of the job have impacted recruitment and retention for survey organizations across the U.S. and Europe (Charman et al., 2024). The trends in U.S. field data collector employment are summarized and it is outlined that there are promising practices in recruiting and retaining high quality field data collectors. Additionally, broader ways to structure the field data collector labor force for continued success are considered, including supplementing field data collection with multimode alternatives such as video interviewing and updating value propositions for respondents.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100004
    Description: The Survey of Household Spending (SHS) conducted by Statistics Canada collects paper diaries and shopping receipts as a source of household expenditure data. An auto-capturing algorithm was created for SHS 2023 to reduce statistical clerks' manual work of extracting important information from scanned receipts of common store brands. The algorithm used Tesseract optical character recognition (OCR) to extract text characters from images of receipts, and it identified store and product entities using regular expressions, also known as regex. The goal of this study was to enhance the current auto-capture algorithm by experimenting with more advanced OCR and machine learning methods. As a result, PaddleOCR, an open-source OCR toolkit, was selected as the new default OCR engine due to its overall performance in recognizing texts, especially digits, accurately across receipts of various qualities. Additionally, entity classifiers based on support vector machines were trained on historical SHS records and existing regex patterns. By using classifiers to categorize different elements present on receipts instead of relying solely on regex patterns, product and store recognition improved. It is expected that this new algorithm will be used for SHS 2025 to improve the auto-capture quality and reduce the manual burden associated with capturing receipt variables.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100008
    Description: In 2020, Statistics Canada started to use probabilistic web panels as an alternate method of collecting official statistics. In a web panel, respondents to another survey are asked for contact information to participate in future short surveys. This paper will highlight Statistics Canada's experience with panels after 4 years, including what has been learned about the recruitment of panel participants and how to subsequently collect data using panel surveys. The ways in which recruitment questions are presented can result in very different rates of participation. Moreover, the wealth of auxiliary information available on the recruitment survey can be used to actively manage panel collection operations, by predicting the probability of response and using this information to target follow-up efforts.
    Release date: 2025-09-08

  • Articles and reports: 11-522-X202500100013
    Description: As part of answering the call to action for the United Nations' (UN) 17 Sustainable Development Goals, as well as addressing social, economic, and equity challenges within Canada, Statistics Canada's five-year development phase for the Disaggregated Data Action Plan (DDAP) was funded in 2021 to support data driven decision around these challenges. In turn, the document "Guiding Principles: Leveraging the 2021 Census of Populations Data for DDAP Groups of Interest" were created. The guiding principles document explains the organizational framework of the DDAP in the Agency, describes existing data sources, addresses ethical and privacy concerns, and centralizes sampling methods tailored for DDAP initiatives while accounting for characteristics which can complicate sampling and data collection procedures.
    Release date: 2025-09-08

  • Articles and reports: 12-001-X202400200002
    Description: This paper investigates whether survey data quality fluctuates over the day. After laying out the argument theoretically, panel data from the Survey of Unemployed Workers in New Jersey are analyzed. Several indirect indicators of response error are investigated, including item nonresponse, interview completion time, rounding, and measures of the quality of time diary data. The evidence that we assemble for a time of day of interview effect is weak or nonexistent. Item nonresponse and the probability that interview completion time is among the 5% shortest appear to increase in the evening, but a more thorough assessment requires instrumental variables.
    Release date: 2024-12-20

  • Articles and reports: 12-001-X202400200006
    Description: As mixed-mode designs become increasingly popular, their effects on data quality have attracted much scholarly attention. Most studies focused on the bias properties of mixed-mode designs; few of them have investigated whether mixed-mode designs have heterogeneous variance structures across modes. While many characteristics of mixed-mode designs, such as varied interviewer usage, systematic differences in respondents, varying levels of social desirability bias, among others, may lead to heterogeneous variances in mode-specific point estimates of population means, this study specifically investigates whether interviewer variances remain consistent across different modes in mixed-mode studies. To address this research question, we utilize data collected from two distinct study designs. In the first design, when interviewers are responsible for either face-to-face or telephone mode, we examine whether there are mode differences in interviewer variances for 1) sensitive political questions, 2) international items, 3) and item missing indicators on international items, using the Arab Barometer wave 6 Jordan data. In the second design, we draw on Health and Retirement Study (HRS) 2016 core survey data to examine the question on three topics when interviewers are responsible for both modes. The topics cover 1) the CESD depression scale, 2) interviewer observations, and 3) the physical activity scale. To account for the lack of interpenetrated designs in both data sources, we include respondent-level covariates in our models. We find significant differences in interviewer variances on one item (twelve items in total) in the Arab Barometer study; whereas for HRS, the results are three out of eighteen. Overall, we find the magnitude of the interviewer variances larger in FTF than TEL on sensitive items. We conduct simulations to understand the power to detect mode effects in the typically modest interviewer sample sizes.
    Release date: 2024-12-20

  • Articles and reports: 11-522-X202200100011
    Description: In 2021, Statistics Canada initiated the Disaggregated Data Action Plan, a multi-year initiative to support more representative data collection methods, enhance statistics on diverse populations to allow for intersectional analyses, and support government and societal efforts to address known inequalities and bring considerations of fairness and inclusion into decision making. As part of this initiative, we are building the Survey Series on People and their Communities, a new probabilistic panel specifically designed to collect data that can be disaggregated according to racialized group. This new tool will allow us to address data gaps and emerging questions related to diversity. This paper will give an overview of the design of the Survey Series on People and their Communities.
    Release date: 2024-03-25

  • Articles and reports: 11-522-X202200100016
    Description: To overcome the traditional drawbacks of chain sampling methods, the sampling method called “network sampling with memory” was developed. Its unique feature is to recreate, gradually in the field, a frame for the target population composed of individuals identified by respondents and to randomly draw future respondents from this frame, thereby minimizing selection bias. Tested for the first time in France between September 2020 and June 2021, for a survey among Chinese immigrants in Île-de-France (ChIPRe), this presentation describes the difficulties encountered during collection—sometimes contextual, due to the pandemic, but mostly inherent to the method.
    Release date: 2024-03-25

  • Articles and reports: 12-001-X202300200014
    Description: Many things have been written about Jean-Claude Deville in tributes from the statistical community (see Tillé, 2022a; Tillé, 2022b; Christine, 2022; Ardilly, 2022; and Matei, 2022) and from the École nationale de la statistique et de l’administration économique (ENSAE) and the Société française de statistique. Pascal Ardilly, David Haziza, Pierre Lavallée and Yves Tillé provide an in-depth look at Jean-Claude Deville’s contributions to survey theory. To pay tribute to him, I would like to discuss Jean-Claude Deville’s contribution to the more day-to-day application of methodology for all the statisticians at the Institut national de la statistique et des études économiques (INSEE) and at the public statistics service. To do this, I will use my work experience, and particularly the four years (1992 to 1996) I spent working with him in the Statistical Methods Unit and the discussions we had thereafter, especially in the 2000s on the rolling census.
    Release date: 2024-01-03
Reference (100)

Reference (100) (20 to 30 of 100 results)

  • Surveys and statistical programs – Documentation: 75F0002M2007001
    Description: The Survey of Labour and Income Dynamics (SLID) is a longitudinal survey which collects information related to the standard of living of individuals and their families. By interviewing the same people over a period of six years, changes and the causes of these changes can be monitored.

    A preliminary interview of background information is collected for all respondents aged 16 and over, who enter the SLID sample. Preliminary interviews are conducted for new household members during their first labour and income interview after they join the household. A labour and income interview is collected each year for all respondents 16 years of age and over.

    The purpose of this document is to present the questions, possible responses and question flows for the 2006 preliminary, labour and income questionnaire (for the 2005 reference year).

    Release date: 2007-05-10

  • Surveys and statistical programs – Documentation: 75F0002M2007002
    Description: The Survey of Labour and Income Dynamics (SLID) conducts an annual labour and income interview in January. The data are collected using computer-assisted interviewing; thus there are no paper questionnaires required for data collection. The questions, responses and interview flow for labour and income are documented in another SLID research paper. This document presents the information for the 2006 entry and exit portions of the labour and income interview (for the 2005 reference year).

    The entry exit component consists of five separate modules. The entry module is the first set of data collected. It is information collected to update the place of residence, housing conditions and expenses, as well as the household composition. For each person identified in entry, the demographics module collects (or updates) the person's name, date of birth, sex and marital status. Then the relationships module identifies (or updates) the relationship between each respondent and every other household member. The exit module includes questions on who to contact for the next interview and the names, phone numbers and addresses of two contacts to be used only if future tracing of respondents is required. An overview of the tracing component is also included in this document.

    Release date: 2007-05-10

  • Surveys and statistical programs – Documentation: 75F0002M2006001
    Description:

    A Preliminary interview of background information is collected for all respondents aged 16 and over, who enter the sample for the Survey of Labour and Income Dynamics (SLID). For the majority of the longitudinal respondents, this occurs when a new panel is introduced and the preliminary information is collected during the first Labour interview. However, all persons living with a longitudinal respondent are also interviewed for SLID. Thus Preliminary interviews are conducted for new household members during their first Labour interview after they join the household. Longitudinal persons who have turned 16 while their household is in the SLID sample are then eligible for SLID interviews so they are asked the Preliminary interview questions during their first Labour interview.

    The purpose of this document is to present the questions, possible responses and question flows for the 2005 Preliminary questionnaire (for the 2004 reference year).

    Release date: 2006-04-06

  • Surveys and statistical programs – Documentation: 75F0002M2006003
    Description:

    The Survey of Income and Labour Dynamics (SLID) interview is conducted using computer-assisted interviewing (CAI). CAI is paperless interviewing. This document is therefore a written approximation of the CAI interview, or the questionnaire.

    In previous years, SLID conducted a Labour interview each January and a separate Income interview in May. In 2005 (reference year 2004) the two interviews were combined and collected in one interview in January.

    A labour and income interview is collected for all respondents 16 years of age and over. Respondents have the option of answering income questions during the interview, or of giving Statistics Canada permission to use their income tax records.

    In January 2005, data was collected for reference year 2004 from panels 3 and 4. Panel 3, in its sixth and final year, consisted of approximately 17,000 households and panel 4, in its third year, also consisted of approximately 17,000 households.

    This document outlines the structure of the January 2005 Labour and Income interview (for the 2004 reference year) including question wording, possible responses, and flows of questions.

    Release date: 2006-04-06

  • Surveys and statistical programs – Documentation: 75F0002M2006002
    Description:

    In previous years, the Survey of Labour and Income Dynamics (SLID) conducted a Labour interview each January and a separate Income interview in May. In 2005 (reference year 2004) the two interviews were combined and collected in one interview in January.

    The data are collected using computer-assisted interviewing. Thus there are no paper questionnaires required for data collection. The questions, responses and interview flow for Labour and Income are documented in other SLID research papers. This document presents the information for the 2005 Entry Exit portion of the Labour Income interview (for the 2004 reference year).

    The Entry Exit Component consists of five separate modules. The Entry module is the first set of data collected. It is information collected to update the place of residence, housing conditions and expenses, as well as the household composition. For each person identified in Entry, the Demographics module collects (or updates) the person's name, date of birth, sex and marital status. Then the Relationships module identifies (or updates) the relationship between each respondent and every other household member. The Exit module includes questions on who to contact for the next interview and the names, phone numbers and addresses of two contacts to be used only if future tracing of respondents is required. An overview of the Tracing component is also included in this document.

    Release date: 2006-03-27

  • Surveys and statistical programs – Documentation: 92-133-X
    Description:

    This report describes changes planned for the 2006 Census education questions. Education questions are a part of the Form 2B (the long form) of the census. This form is completed by 20% of all households. These changes were tested in the May 2004 Census test of over 300,000 households. The changes aim to address data limitations in the 2001 Census questions and to enhance their relevance to education studies by allowing a better reflection of the range of educational pathways taken by Canadians. The report includes an explanation of the reasons for modifying the 2006 Census education content, a detailed look at each of the changes, and a discussion on historical consistency.

    Release date: 2005-08-31

  • Surveys and statistical programs – Documentation: 75F0002M2005006
    Description:

    A preliminary interview of background information is collected for all respondents aged 16 and over, who enter the sample for the Survey of Labour and Income Dynamics (SLID). For the majority of the longitudinal respondents, this occurs when a new panel is introduced and the preliminary information is collected during the first Labour interview. However, all persons living with a longitudinal respondent are also interviewed for SLID. Thus Preliminary interviews are conducted for new household members during their first Labour interview after they join the household. Longitudinal persons who have turned 16 while their household is in the SLID sample are then eligible for SLID interviews so they are asked the Preliminary interview questions during their first Labour interview.

    The purpose of this document is to present the questions, possible responses and question flows for the 2004 Preliminary questionnaire (for the 2003 reference year).

    Release date: 2005-06-16

  • Surveys and statistical programs – Documentation: 75F0002M2005007
    Description:

    Every January, the Survey of Labour and Income Dynamics (SLID) Labour interview is conducted using computer-assisted interviewing (CAI). CAI is paperless interviewing. This document is therefore a written approximation of the CAI interview, or the questionnaire.

    A labour interview is collected for all respondents 16 years of age and over. In January, 2004 data was collected for reference year 2003 from panels 3 and 4. Panel 3, in its fifth year, consisted of approximately 17,000 households and panel 4, in its second year, also consisted of approximately 17,000 households.

    This document outlines the structure of the January 2004 Labour interview (for the 2003 reference year) including question wording, possible responses, and flows of questions.

    Release date: 2005-06-16

  • Surveys and statistical programs – Documentation: 75F0002M2005008
    Description:

    In May 2004 the Survey of Labour and Income Dynamics (SLID) collected data on income from both its third and fourth panels. Panel 3 was in its fifth year of collection and panel 4 was in its second year.

    Respondents had the option of answering income questions in an interview, or of giving permission to Statistics Canada to allow SLID to use the information on their income tax return.

    The purpose of this document is to present the questions, possible responses and question flows for the 2004 Income questionnaire (for the 2003 reference year).

    Release date: 2005-06-16

  • Surveys and statistical programs – Documentation: 89-552-M2005013
    Geography: Canada
    Description:

    This report documents key aspects of the development of the International Adult Literacy and Life Skills Survey (ALL) - its theoretical roots, the domains selected for possible assessment, the approaches taken to assessment in each domain and the criteria that were employed to decide which domains were to be carried in the final design. As conceived, the ALL survey was meant to build on the success of the International Adult Literacy Survey (IALS) assessments by extending the range of skills assessed and by improving the quality of the assessment methods employed. This report documents several successes including: · the development of a new framework and associated robust measures for problem solving · the development of a powerful numeracy framework and associated robust measures · the specification of frameworks for practical cognition, teamwork and information and communication technology literacy The report also provides insight into those domains where development failed to yield approaches to assessment of sufficient quality, insight that reminds us that scientific advance in this domain is hard won.

    Release date: 2005-03-24