Analysis

Results

All (138)

All (138) (130 to 140 of 138 results)

131. Statistical properties of crop production estimators Archived
Articles and reports: 12-001-X198700114468
Description:
The National Agricultural Statistics Service, U.S. Department of Agriculture, conducts yield surveys for a variety of field crops in the United States. While field sampling procedures for various crops differ, the same basic survey design is used for all crops. The survey design and current estimators are reviewed. Alternative estimators of yield and production and of the variance of the estimators are presented. Current estimators and alternative estimators are compared, both theoretically and in a Monte Carlo simulation.
Release date: 1987-06-15
132. Stratification in the Canadian Labour Force Survey Archived
Articles and reports: 12-001-X198500214372
Description:
The use of a multivariate clustering algorithm to perform stratification for the Labour Force Survey is described. The algorithm developed by Friedman and Rubin (1967) is modified to allow the formation of geographically contiguous strata and to delineate heterogeneous but compact primary sampling units (PSUs) within these strata. Studies dealing with stratification variables, stratification robustness over time, and type of stratification are described.
Release date: 1985-12-16
133. Application of linear and log-linear models to data from complex samples Archived
Articles and reports: 12-001-X198400114351
Description:
Most sample surveys conducted by organizations such as Statistics Canada or the U.S. Bureau of the Census employ complex designs. The design-based approach to statistical inference, typically the institutional standard of inference for simple population statistics such as means and totals, may be extended to parameters of analytic models as well. Most of this paper focuses on application of design-based inferences to such models, but rationales are offered for use of model-based alternatives in some instances, by way of explanation for the author’s observation that both modes of inference are used in practice at his own institution.
Within the design-based approach to inference, the paper briefly describes experience with linear regression analysis. Recently, variance computations for a number of surveys of the Census Bureau have been implemented through “replicate weighting”; the principal application has been for variances of simple statistics, but this technique also facilitates variance computation for virtually any complex analytic model. Finally, approaches and experience with log-linear models are reported.
Release date: 1984-06-15
134. Least squares and related analyses for complex survey designs Archived
Articles and reports: 12-001-X198400114352
Description:
The paper shows different estimation methods for complex survey designs. Among others, estimation of mean, ratio and regression coefficient is presented. The standard errors are estimated by different methods: the ordinary least squares procedure, the stratified weighted sample procedure, the stratified unit weight procedure, etc. Theory of large samples and conditions to apply it are also presented.
Release date: 1984-06-15
135. Estimating monthly gross flows in labour force participation Archived
Articles and reports: 12-001-X198300114335
Description:
The Canadian Labour Force Survey is a household survey conducted each month for the purpose of producing point-in-time estimates of the number of persons employed, unemployed and not in the labor force. The survey has a rotating panel design in which all individuals in a sampled household location are interviewed each month, for six consecutive months. In the past, little use has been made of this longitudinal structure, although considerable interest has been expressed in the month-to-month gross flows (transitions) amongst the labour force status categories. In this paper we discuss methods being considered by Statistics Canada for the production of gross flow estimates, but from a model-based perspective.
Release date: 1983-06-15
136. Data, statistics, information - Some issues of the Canadian Social Statistics Scene Archived
Articles and reports: 12-001-X197900254833
Description:
This paper looks at the current state of development of social statistics in Canada. Some key concepts related to statistics and social information are defined and discussed. The availability and analysis of administrative data is highlighted, along with the need for social surveys. Suggestions are made about the types of data analysis needed for the development of social decision models to meet policy requirements. Finally, an outline of priorities for future work toward the effective use of social statistics is given.
Release date: 1979-12-14
137. Approximate tests of independence and goodness of fit based on stratified multi-stage samples Archived
Articles and reports: 12-001-X197800154831
Description: The impact on linear statistics of the sample design used in obtaining survey data is the subject of much of sampling literature. Recently, more attention has been paid to the design’s impact on non-linear statistics; the major factor inhibiting these investigations has been the problem of estimating at least the first two moments of such statistics. The present article examines the problem of estimating the variances of non-linear statistics from complex samples, in the light of existing literature. The behaviour of the chi-square statistic computed from a complex sample to test hypotheses of goodness of fit or independence is studied. Alternative tests are developed and their properties studied in simulation experiments.
Release date: 1978-06-15
138. Controlled random rounding Archived
Articles and reports: 12-001-X197500254825
Description:
Random rounding is a technique to ensure confidentiality of aggregate statistics. By randomly rounding all the components of a total, independently, together with the random rounding of the total itself, substantial discrepancies may arise when aggregating the published data. This paper presents a procedure which avoids substantial discrepancies while still protecting the concept of confidentiality.
Release date: 1975-12-15

Stats in brief (3)

Stats in brief (3) ((3 results))

1. Data ethics part 2: Ethical reviews
Stats in brief: 89-20-00062022004
Description:
Gathering, exploring, analyzing and interpreting data are essential steps in producing information that benefits society, the economy and the environment. In this video, we will discuss the importance of considering data ethics throughout the process of producing statistical information.

As a pre-requisite to this video, make sure to watch the video titled “Data Ethics: An introduction” also available in Statistics Canada’s data literacy training catalogue.

Release date: 2022-10-17
2. Data ethics: An introduction Archived
Stats in brief: 89-20-00062022001
Description:
Gathering, exploring, analyzing and interpreting data are essential steps in producing information that benefits society, the economy and the environment. To properly conduct these processes, data ethics ethics must be upheld in order to ensure the appropriate use of data.

Release date: 2022-05-24
3. FAIR data principles: What is FAIR? Archived
Stats in brief: 89-20-00062022002
Description:
This video will break down what it means to be FAIR in terms of data and metadata, and how each pillar of FAIR serves to guide data users and producers alike, as they navigate their way through the data journey, in order to gain maximum, long term value.

Release date: 2022-05-24

Articles and reports (134)

Articles and reports (134) (0 to 10 of 134 results)

1. A proposal for the problem of matching probabilities estimation in record linkage Archived
Articles and reports: 11-522-X202200100001
Description: Record linkage aims at identifying record pairs related to the same unit and observed in two different data sets, say A and B. Fellegi and Sunter (1969) suggest each record pair is tested whether generated from the set of matched or unmatched pairs. The decision function consists of the ratio between m(y) and u(y),probabilities of observing a comparison y of a set of k>3 key identifying variables in a record pair under the assumptions that the pair is a match or a non-match, respectively. These parameters are usually estimated by means of the EM algorithm using as data the comparisons on all the pairs of the Cartesian product ?=A×B. These observations (on the comparisons and on the pairs status as match or non-match) are assumed as generated independently of other pairs, assumption characterizing most of the literature on record linkage and implemented in software tools (e.g. RELAIS, Cibella et al. 2012). On the contrary, comparisons y and matching status in ? are deterministically dependent. As a result, estimates on m(y) and u(y) based on the EM algorithm are usually bad. This fact jeopardizes the effective application of the Fellegi-Sunter method, as well as automatic computation of quality measures and possibility to apply efficient methods for model estimation on linked data (e.g. regression functions), as in Chambers et al. (2015). We propose to explore ? by a set of samples, each one drawn so to preserve independence of comparisons among the selected record pairs. Simulations are encouraging.
Release date: 2024-03-25
2. A Model-based Disaggregation Method for Estimation of Adult Competency Archived
Articles and reports: 11-522-X202200100003
Description: Estimation at fine levels of aggregation is necessary to better describe society. Small area estimation model-based approaches that combine sparse survey data with rich data from auxiliary sources have been proven useful to improve the reliability of estimates for small domains. Considered here is a scenario where small area model-based estimates, produced at a given aggregation level, needed to be disaggregated to better describe the social structure at finer levels. For this scenario, an allocation method was developed to implement the disaggregation, overcoming challenges associated with data availability and model development at such fine levels. The method is applied to adult literacy and numeracy estimation at the county-by-group-level, using data from the U.S. Program for the International Assessment of Adult Competencies. In this application the groups are defined in terms of age or education, but the method could be applied to estimation of other equity-deserving groups.
Release date: 2024-03-25
3. Children born into vulnerability: Challenges encountered in a Quebec longitudinal survey Archived
Articles and reports: 11-522-X202200100010
Description: Growing Up in Québec is a longitudinal population survey that began in the spring of 2021 at the Institut de la statistique du Québec. Among the children targeted by this longitudinal follow-up, some will experience developmental difficulties at some point in their lives. Those same children often have characteristics associated with higher sample attrition (low-income family, parents with a low level of education). This article describes the two main challenges we encountered when trying to ensure sufficient representativeness of these children, in both the overall results and the subpopulation analyses.
Release date: 2024-03-25
4. Bayesian model assisted design-based estimators of the size, total and mean of a hard-to-reach population from a link-tracing sample with initial cluster sample Archived
Articles and reports: 11-522-X202200100015
Description: We present design-based Horvitz-Thompson and multiplicity estimators of the population size, as well as of the total and mean of a response variable associated with the elements of a hidden population to be used with the link-tracing sampling variant proposed by Félix-Medina and Thompson (2004). Since the computation of the estimators requires to know the inclusion probabilities of the sampled people, but they are unknown, we propose a Bayesian model which allows us to estimate them, and consequently to compute the estimators of the population parameters. The results of a small numeric study indicate that the performance of the proposed estimators is acceptable.
Release date: 2024-03-25
5. Integration of existing data to develop an ethnicity indicator in the LSDDP Archived
Articles and reports: 11-522-X202200100018
Description: The Longitudinal Social Data Development Program (LSDDP) is a social data integration approach aimed at providing longitudinal analytical opportunities without imposing additional burden on respondents. The LSDDP uses a multitude of signals from different data sources for the same individual, which helps to better understand their interactions and track changes over time. This article looks at how the ethnicity status of people in Canada can be estimated at the most detailed disaggregated level possible using the results from a variety of business rules applied to linked data and to the LSDDP denominator. It will then show how improvements were obtained using machine learning methods, such as decision trees and random forest techniques.
Release date: 2024-03-25
6. Statistics Canada’s Quality of Life Statistics Program: April 2021 to March 2023
Articles and reports: 75F0002M2023001
Description: This discussion paper describes the work being achieved and undertaken by Statistics Canada, in partnership with the Treasury Board of Canada Secretariat, the Department of Finance Canada and the Privy Council Office, on developing the Quality of Life Framework for Canada and related outputs, including an online Hub. This is the first paper in a series that will provide updates on the progress of work relating to the Framework.
Release date: 2023-04-19
7. Health Utilities Index Mark 3 scores for children and youth: Population norms for Canada based on cycles 5 (2016 and 2017) and 6 (2018 and 2019) of the Canadian Health Measures Survey
Articles and reports: 82-003-X202300200003
Description: Utility scores are an important tool for evaluating health-related quality of life. Utility score norms have been published for Canadian adults, but no nationally representative utility score norms are available for non-adults. Using Health Utilities Index Mark 3 (HUI3) data from two recent cycles of the Canadian Health Measures Survey (i.e., 2016-2017 and 2018-2019), this is the first study to provide utility score norms for children aged 6 to 11 years and adolescents aged 12 to 17 years.
Release date: 2023-02-15
8. Investigating the Use of Blockchain to Authenticate Data from the Statistics Canada Website
Articles and reports: 11-633-X2022007
Description:
This paper investigates how Statistics Canada can increase trust by giving users the ability to authenticate data from its website through digital signatures and blockchain technology.

Release date: 2022-09-19
9. Bayesian inference for a variance component model using pairwise composite likelihood with survey data
Articles and reports: 12-001-X202200100002
Description:
We consider an intercept only linear random effects model for analysis of data from a two stage cluster sampling design. At the first stage a simple random sample of clusters is drawn, and at the second stage a simple random sample of elementary units is taken within each selected cluster. The response variable is assumed to consist of a cluster-level random effect plus an independent error term with known variance. The objects of inference are the mean of the outcome variable and the random effect variance. With a more complex two stage sampling design, the use of an approach based on an estimated pairwise composite likelihood function has appealing properties. Our purpose is to use our simpler context to compare the results of likelihood inference with inference based on a pairwise composite likelihood function that is treated as an approximate likelihood, in particular treated as the likelihood component in Bayesian inference. In order to provide credible intervals having frequentist coverage close to nominal values, the pairwise composite likelihood function and corresponding posterior density need modification, such as a curvature adjustment. Through simulation studies, we investigate the performance of an adjustment proposed in the literature, and find that it works well for the mean but provides credible intervals for the random effect variance that suffer from under-coverage. We propose possible future directions including extensions to the case of a complex design.

Release date: 2022-06-21
10. Measuring Social Capital at the Neighbourhood Level: Experimental Estimates of Sense of Belonging to the Local Community Measured at the Census Tract Level
Articles and reports: 11-633-X2021007
Description:
Statistics Canada continues to use a variety of data sources to provide neighbourhood-level variables across an expanding set of domains, such as sociodemographic characteristics, income, services and amenities, crime, and the environment. Yet, despite these advances, information on the social aspects of neighbourhoods is still unavailable. In this paper, answers to the Canadian Community Health Survey on respondents’ sense of belonging to their local community were pooled over the four survey years from 2016 to 2019. Individual responses were aggregated up to the census tract (CT) level.
Release date: 2021-11-16

Journals and periodicals (1)

Journals and periodicals (1) ((1 result))

1. Validation Study for a Record Linkage of Births and Infant Deaths in Canada Archived
Journals and periodicals: 84F0013X
Geography: Canada, Province or territory
Description:
This study was initiated to test the validity of probabilistic linkage methods used at Statistics Canada. It compared the results of data linkages on infant deaths in Canada with infant death data from Nova Scotia and Alberta. It also compared the availability of fetal deaths on the national and provincial files.
Release date: 1999-10-08

Report a problem or mistake on this page

Date modified:: 2024-06-23

Language selection

Search and menus

Search

Analysis

Filter results by

Keyword(s)

Subject

Year of publication

Author(s)

Survey or statistical program

Content

Results

All (138) (130 to 140 of 138 results)

Stats in brief (3) ((3 results))

Articles and reports (134) (0 to 10 of 134 results)

Journals and periodicals (1) ((1 result))

Analysis

Filter results by

Keyword(s)

Subject

Year of publication

Author(s)

Survey or statistical program

Content

Results

All (138) (130 to 140 of 138 results)

Stats in brief (3) ((3 results))

Articles and reports (134) (0 to 10 of 134 results)

Journals and periodicals (1) ((1 result))

How do I use the filters and the search box?

How do I refine my search?

How does the search work?

How are the results ordered?

How are the results ordered?