Keyword search

Results

All (300)

All (300) (60 to 70 of 300 results)

61. Objective stepwise Bayes weights in survey sampling Archived
Articles and reports: 12-001-X201300111823
Description:
Although weights are widely used in survey sampling their ultimate justification from the design perspective is often problematical. Here we will argue for a stepwise Bayes justification for weights that does not depend explicitly on the sampling design. This approach will make use of the standard kind of information present in auxiliary variables however it will not assume a model relating the auxiliary variables to the characteristic of interest. The resulting weight for a unit in the sample can be given the usual interpretation as the number of units in the population which it represents.
Release date: 2013-06-28
62. Optimizing quality of response through adaptive survey designs Archived
Articles and reports: 12-001-X201300111824
Description:
In most surveys all sample units receive the same treatment and the same design features apply to all selected people and households. In this paper, it is explained how survey designs may be tailored to optimize quality given constraints on costs. Such designs are called adaptive survey designs. The basic ingredients of such designs are introduced, discussed and illustrated with various examples.
Release date: 2013-06-28
63. Automatic editing with hard and soft edits Archived
Articles and reports: 12-001-X201300111825
Description:
A considerable limitation of current methods for automatic data editing is that they treat all edits as hard constraints. That is to say, an edit failure is always attributed to an error in the data. In manual editing, however, subject-matter specialists also make extensive use of soft edits, i.e., constraints that identify (combinations of) values that are suspicious but not necessarily incorrect. The inability of automatic editing methods to handle soft edits partly explains why in practice many differences are found between manually edited and automatically edited data. The object of this article is to present a new formulation of the error localisation problem which can distinguish between hard and soft edits. Moreover, it is shown how this problem may be solved by an extension of the error localisation algorithm of De Waal and Quere (2003).
Release date: 2013-06-28
64. Estimation of the variance of cross-sectional indicators for the SILC survey in Switzerland Archived
Articles and reports: 12-001-X201300111827
Description:
SILC (Statistics on Income and Living Conditions) is an annual European survey that measures the population's income distribution, poverty and living conditions. It has been conducted in Switzerland since 2007, based on a four-panel rotation scheme that yields both cross-sectional and longitudinal estimates. This article examines the problem of estimating the variance of the cross-sectional poverty and social exclusion indicators selected by Eurostat. Our calculations take into account the non-linearity of the estimators, total non-response at different survey stages, indirect sampling and calibration. We adapt the method proposed by Lavallée (2002) for estimating variance in cases of non-response after weight sharing, and we obtain a variance estimator that is asymptotically unbiased and very easy to program.
Release date: 2013-06-28
65. Combining cohorts in longitudinal surveys Archived
Articles and reports: 12-001-X201300111828
Description:
A question that commonly arises in longitudinal surveys is the issue of how to combine differing cohorts of the survey. In this paper we present a novel method for combining different cohorts, and using all available data, in a longitudinal survey to estimate parameters of a semiparametric model, which relates the response variable to a set of covariates. The procedure builds upon the Weighted Generalized Estimation Equation method for handling missing waves in longitudinal studies. Our method is set up under a joint-randomization framework for estimation of model parameters, which takes into account the superpopulation model as well as the survey design randomization. We also propose a design-based, and a joint-randomization, variance estimation method. To illustrate the methodology we apply it to the Survey of Doctorate Recipients, conducted by the U.S. National Science Foundation.
Release date: 2013-06-28
66. On the performance of self benchmarked small area estimators under the Fay-Herriot area level model Archived
Articles and reports: 12-001-X201300111830
Description:
We consider two different self-benchmarking methods for the estimation of small area means based on the Fay-Herriot (FH) area level model: the method of You and Rao (2002) applied to the FH model and the method of Wang, Fuller and Qu (2008) based on augmented models. We derive an estimator of the mean squared prediction error (MSPE) of the You-Rao (YR) estimator of a small area mean that, under the true model, is correct to second-order terms. We report the results of a simulation study on the relative bias of the MSPE estimator of the YR estimator and the MSPE estimator of the Wang, Fuller and Qu (WFQ) estimator obtained under an augmented model. We also study the MSPE and the estimators of MSPE for the YR and WFQ estimators obtained under a misspecified model.
Release date: 2013-06-28
67. Conservative variance estimation for sampling designs with zero pairwise inclusion probabilities Archived
Articles and reports: 12-001-X201300111831
Description:
We consider conservative variance estimation for the Horvitz-Thompson estimator of a population total in sampling designs with zero pairwise inclusion probabilities, known as "non-measurable" designs. We decompose the standard Horvitz-Thompson variance estimator under such designs and characterize the bias precisely. We develop a bias correction that is guaranteed to be weakly conservative (nonnegatively biased) regardless of the nature of the non-measurability. The analysis sheds light on conditions under which the standard Horvitz-Thompson variance estimator performs well despite non-measurability and where the conservative bias correction may outperform commonly-used approximations.
Release date: 2013-06-28
68. Methodological Document on the 2011 Census Language Data Archived
Surveys and statistical programs – Documentation: 98-314-X2011051
Description:
Readers will find a complete analysis of factors affecting the comparability of Language results between the censuses in the Methodological Document on the 2011 Census Language Data.
Release date: 2013-05-03
69. Historical Data Linkage of Tax Records on Labour and Income: The Case of the Living in Canada Survey Pilot Archived
Articles and reports: 89-648-X2013002
Geography: Canada
Description:
Data matching is a common practice used to reduce the response burden of respondents and to improve the quality of the information collected from respondents when the linkage method does not introduce bias. However, historical linkage, which consists in linking external records from previous years to the year of the initial wave of a survey, is relatively rare and, until now, had not been used at Statistics Canada. The present paper describes the method used to link the records from the Living in Canada Survey pilot to historical tax data on income and labour (T1 and T4 files). It presents the evolution of the linkage rate going back over time and compares earnings data collected from personal income tax returns with those collected from employers file. To illustrate the new possibilities of analysis offered by this type of linkage, the study concludes with an earnings profile by age and sex for different cohorts based on year of birth.
Release date: 2013-01-24
70. Labour Force Survey Products and Services
Surveys and statistical programs – Documentation: 71-544-X
Description: This catalogue briefly describes all Labour Force Survey products offered on a monthly, annual and occasional basis. It includes products, uses, general release dates, formats available and prices, as well as special request services and Internet services. It also introduces any changes to products.
Release date: 2012-07-06

Data (14)

Data (14) (10 to 20 of 14 results)

11. Consulting Engineering Services Price Index [1997] Archived
Table: 62F0040X1997001
Description:
The first in this series is the Consulting Engineering Services Price Index (CEPI) which is an annual index that measures changes in the prices for services provided by consulting engineers. These services encompass advisory and design work as well as construction or project management. They are provided for many types of projects (fields of specialization), and to both Canadian and foreign clients. Price indexes are published for 10 fields of specialization as well as for national, regional, and foreign markets.
Release date: 1999-05-04
12. National Population Health Survey Overview Archived
Table: 82-567-X
Description:
The National Population Health Survey (NPHS) is designed to enhance the understanding of the processes affecting health. The survey collects cross-sectional as well as longitudinal data. In 1994/95 the survey interviewed a panel of 17,276 individuals, then returned to interview them a second time in 1996/97. The response rate for these individuals was 96% in 1996/97. Data collection from the panel will continue for up to two decades. For cross-sectional purposes, data were collected for a total of 81,000 household residents in all provinces (except people on Indian reserves or on Canadian Forces bases) in 1996/97.
This overview illustrates the variety of information available by presenting data on perceived health, chronic conditions, injuries, repetitive strains, depression, smoking, alcohol consumption, physical activity, consultations with medical professionals, use of medications and use of alternative medicine.
Release date: 1998-07-29
13. Survey of Labour and Income Dynamics Microdata User's Guide Archived
Public use microdata: 75M0001G
Description:
Documentation to accompany public-use microdata files. Contains a detailed description of the survey design, content and methods, as well as the record layout and the data dictionary.
Release date: 1997-10-31
14. General Social Survey, Cycle 10: Family (1995) - Public Use Microdata File Archived
Public use microdata: 12M0010X
Description:
Cycle 10 collected data from persons 15 years and older and concentrated on the respondent's family. Topics covered include marital history, common- law unions, biological, adopted and step children, family origins, child leaving and fertility intentions.

The target population of the GSS (General Social Survey) consisted of all individuals aged 15 and over living in a private household in one of the ten provinces.

Release date: 1997-02-28

Analysis (211)

Analysis (211) (60 to 70 of 211 results)

61. Replication variance estimation under two-phase sampling Archived
Articles and reports: 12-001-X201100111448
Description:
In two-phase sampling for stratification, the second-phase sample is selected by a stratified sample based on the information observed in the first-phase sample. We develop a replication-based bias adjusted variance estimator that extends the method of Kim, Navarro and Fuller (2006). The proposed method is also applicable when the first-phase sampling rate is not negligible and when second-phase sample selection is unequal probability Poisson sampling within each stratum. The proposed method can be extended to variance estimation for two-phase regression estimators. Results from a limited simulation study are presented.
Release date: 2011-06-29
62. Cost efficiency of repeated cluster surveys Archived
Articles and reports: 12-001-X201100111449
Description:
We analyze the statistical and economic efficiency of different designs of cluster surveys collected in two consecutive time periods, or waves. In an independent design, two cluster samples in two waves are taken independently from one another. In a cluster-panel design, the same clusters are used in both waves, but samples within clusters are taken independently in two time periods. In an observation-panel design, both clusters and observations are retained from one wave of data collection to another. By assuming a simple population structure, we derive design variances and costs of the surveys conducted according to these designs. We first consider a situation in which the interest lies in estimation of the change in the population mean between two time periods, and derive the optimal sample allocations for the three designs of interest. We then propose the utility maximization framework borrowed from microeconomics to illustrate a possible approach to the choice of the design that strives to optimize several variances simultaneously. Incorporating the contemporaneous means and their variances tends to shift the preferences from observation-panel towards simpler panel-cluster and independent designs if the panel mode of data collection is too expensive. We present numeric illustrations demonstrating how a survey designer may want to choose the efficient design given the population parameters and data collection cost.
Release date: 2011-06-29
63. On the efficiency of randomized probability proportional to size sampling Archived
Articles and reports: 12-001-X201100111450
Description:
This paper examines the efficiency of the Horvitz-Thompson estimator from a systematic probability proportional to size (PPS) sample drawn from a randomly ordered list. In particular, the efficiency is compared with that of an ordinary ratio estimator. The theoretical results are confirmed empirically with of a simulation study using Dutch data from the Producer Price Index.
Release date: 2011-06-29
64. The use of estimating equations to perform a calibration on complex parameters Archived
Articles and reports: 12-001-X201100111451
Description:
In the calibration method proposed by Deville and Särndal (1992), the calibration equations take only exact estimates of auxiliary variable totals into account. This article examines other parameters besides totals for calibration. Parameters that are considered complex include the ratio, median or variance of auxiliary variables.
Release date: 2011-06-29
65. Low Income Lines, 2009-2010 Archived
Articles and reports: 75F0002M2011002
Description:
In order to provide a holographic or complete picture of low income, Statistics Canada uses three complementary low income lines: the Low Income Cut-offs (LICOs), the Low Income Measures (LIMs) and the Market Basket Measure (MBM). While the first two lines were developed by Statistics Canada, the MBM is based on concepts developed by Human Resources and Skill Development Canada. Though these measures differ from one another, they give a generally consistent picture of low income status over time. None of these measures is the best. Each contributes its own perspective and its own strengths to the study of low income, so that cumulatively, the three provide a better understanding of the phenomenon of low income as a whole. These measures are not measures of poverty, but strictly measures of low income.
Release date: 2011-06-15
66. Real-Financial Linkages in the Canadian Economy: An Input-Output Approach Archived
Articles and reports: 11F0027M2010065
Geography: Canada
Description:
The purpose of this paper is twofold. First, the authors provide a detailed social accounting matrix (SAM), which incorporates the income and financial flows into the standard input-output matrix, for the Canadian economy for 2004. Second, they use the SAM to assess the strength of the real-financial linkages by calculating and comparing real SAM multipliers and financial social accounting matrix (FSAM) multipliers. For FSAM multipliers, financial flows are endogenous, whereas for real SAM multipliers they are not. The results show that taking into account financial flows increases the impact of a final demand shock on Canadian output. Financial flows also play an important role in determining the cumulative effect of an income shock or the availability of investment funds. Between 2008 and the first half of 2009, financial institutions shifted their investments toward government bonds, short-term paper, and foreign investments. This shift together with the fact that non-financial institutions were unwilling or unable to increase their financial liabilities, led to estimated declines in all GDP multipliers between 2008 and the first half of 2009 (2009H1). The main advantage of using the extended input-output analysis is that it provides a simple framework, with very few assumptions, which allows the assessment of the strength of real-financial linkages by means of multipliers. However, the methodology is subject to the Lucas critique, that as shocks shift prices, agents cannot adjust. Such a framework is, nevertheless, appropriate in short-term impact analysis such as this study.
Release date: 2011-05-20
67. The organisation of statistical methodology and methodological research in national statistical offices Archived
Articles and reports: 12-001-X201000211375
Description:
The paper explores and assesses the approaches used by statistical offices to ensure effective methodological input into their statistical practice. The tension between independence and relevance is a common theme: generally, methodologists have to work closely with the rest of the statistical organisation for their work to be relevant; but they also need to have a degree of independence to question the use of existing methods and to lead the introduction of new ones where needed. And, of course, there is a need for an effective research program which, on the one hand, has a degree of independence needed by any research program, but which, on the other hand, is sufficiently connected so that its work is both motivated by and feeds back into the daily work of the statistical office. The paper explores alternative modalities of organisation; leadership; planning and funding; the role of project teams; career development; external advisory committees; interaction with the academic community; and research.
Release date: 2010-12-21
68. Comparison of survey regression techniques in the context of small area estimation of poverty Archived
Articles and reports: 12-001-X201000211378
Description:
One key to poverty alleviation or eradication in the third world is reliable information on the poor and their location, so that interventions and assistance can be effectively targeted to the neediest people. Small area estimation is one statistical technique that is used to monitor poverty and to decide on aid allocation in pursuit of the Millennium Development Goals. Elbers, Lanjouw and Lanjouw (ELL) (2003) proposed a small area estimation methodology for income-based or expenditure-based poverty measures, which is implemented by the World Bank in its poverty mapping projects via the involvement of the central statistical agencies in many third world countries, including Cambodia, Lao PDR, the Philippines, Thailand and Vietnam, and is incorporated into the World Bank software program PovMap. In this paper, the ELL methodology which consists of first modeling survey data and then applying that model to census information is presented and discussed with strong emphasis on the first phase, i.e., the fitting of regression models and on the estimated standard errors at the second phase. Other regression model fitting procedures such as the General Survey Regression (GSR) (as described in Lohr (1999) Chapter 11) and those used in existing small area estimation techniques: Pseudo-Empirical Best Linear Unbiased Prediction (Pseudo-EBLUP) approach (You and Rao 2002) and Iterative Weighted Estimating Equation (IWEE) method (You, Rao and Kovacevic 2003) are presented and compared with the ELL modeling strategy. The most significant difference between the ELL method and the other techniques is in the theoretical underpinning of the ELL model fitting procedure. An example based on the Philippines Family Income and Expenditure Survey is presented to show the differences in both the parameter estimates and their corresponding standard errors, and in the variance components generated from the different methods and the discussion is extended to the effect of these on the estimated accuracy of the final small area estimates themselves. The need for sound estimation of variance components, as well as regression estimates and estimates of their standard errors for small area estimation of poverty is emphasized.
Release date: 2010-12-21
69. Fence method for nonparametric small area estimation Archived
Articles and reports: 12-001-X201000111244
Description:
This paper considers the problem of selecting nonparametric models for small area estimation, which recently have received much attention. We develop a procedure based on the idea of fence method (Jiang, Rao, Gu and Nguyen 2008) for selecting the mean function for the small areas from a class of approximating splines. Simulation results show impressive performance of the new procedure even when the number of small areas is fairly small. The method is applied to a hospital graft failure dataset for selecting a nonparametric Fay-Herriot type model.
Release date: 2010-06-29
70. Some contributions to jackknifing two-phase sampling estimators Archived
Articles and reports: 12-001-X201000111247
Description:
In this paper, the problem of estimating the variance of various estimators of the population mean in two-phase sampling has been considered by jackknifing the two-phase calibrated weights of Hidiroglou and Särndal (1995, 1998). Several estimators of population mean available in the literature are shown to be the special cases of the technique developed here, including those suggested by Rao and Sitter (1995) and Sitter (1997). By following Raj (1965) and Srivenkataramana and Tracy (1989), some new estimators of the population mean are introduced and their variances are estimated through the proposed jackknife procedure. The variance of the chain ratio and regression type estimators due to Chand (1975) are also estimated using the jackknife. A simulation study is conducted to assess the efficiency of the proposed jackknife estimators relative to the usual estimators of variance.
Release date: 2010-06-29

Reference (74)

Reference (74) (60 to 70 of 74 results)

61. Understanding Measurements of Farm Income Archived
Surveys and statistical programs – Documentation: 21-525-X
Description:
Statistics Canada publishes several measures of farm income, each produced for a different purpose. This bulletin describes the concepts behind these different measures, the methods by which the measures are constructed, and the uses for which they were designed.
Release date: 2000-11-29
62. User Guide to 1996 Census Income Data Archived
Surveys and statistical programs – Documentation: 75F0002M2000010
Description:
This report explains the concept of income and provides definitions of the various sources of income and derived income variables. It also documents the various aspects of the census that can have an impact on census income estimates.
Release date: 2000-07-26
63. Statistical processing in the next millennium Archived
Surveys and statistical programs – Documentation: 11-522-X19990015640
Description:
This paper states how SN is preparing for a new era in the making of statistics, as it is triggered by technological and methodological developments. An essential feature of the turn to the new era is the farewell to the stovepipe way of data processing. The paper discusses how new technological and methodological tools will affect processes and their organization. Special emphasis is put on one of the major chances and challenges the new tools offer: establishing coherence in the content of statistics and in the presentation to users.
Release date: 2000-03-02
64. The challenges of using administrative data to support policy-relevant research: The example of the longitudinal immigration database (IMDB) Archived
Surveys and statistical programs – Documentation: 11-522-X19990015642
Description:
The Longitudinal Immigration Database (IMDB) links immigration and taxation administrative records into a comprehensive source of data on the labour market behaviour of the landed immigrant population in Canada. It covers the period 1980 to 1995 and will be updated annually starting with the 1996 tax year in 1999. Statistics Canada manages the database on behalf of a federal-provincial consortium led by Citizenship and Immigration Canada. The IMDB was created specifically to respond to the need for detailed and reliable data on the performance and impact of immigration policies and programs. It is the only source of data at Statistics Canada that provides a direct link between immigration policy levers and the economic performance of immigrants. The paper will examine the issues related to the development of a longitudinal database combining administrative records to support policy-relevant research and analysis. Discussion will focus specifically on the methodological, conceptual, analytical and privacy issues involved in the creation and ongoing development of this database. The paper will also touch briefly on research findings, which illustrate the policy outcome links the IMDB allows policy-makers to investigate.
Release date: 2000-03-02
65. Creating and enhancing a population-based linked health database: methods, challenges, and applications Archived
Surveys and statistical programs – Documentation: 11-522-X19990015662
Description:
As the availability of both health utilization and outcome information becomes increasingly important to health care researchers and policy makers, the ability to link person-specific health data becomes a critical objective. This type of linkage of population-based administrative health databases has been realized in British Columbia. The database was created by constructing an historical file of all persons registered with the health care system, and then by probabilistically linking various program files to this 'coordinating' file. The first phase of development included the linkage of hospital discharge data, physician billing data, continuing care data, data about drug costs for the elderly, births data and deaths data. The second phase of development has seen the addition data sources external to the Ministry of Health including cancer incidence data, workers' compensation data, and income assistance data.
Release date: 2000-03-02
66. A donor imputation system to create a census database fully adjusted for underenumeration Archived
Surveys and statistical programs – Documentation: 11-522-X19990015668
Description:
Following the problems with estimating underenumeration in the 1991 Census of England and Wales the aim for the 2001 Census is to create a database that is fully adjusted to net underenumeration. To achieve this, the paper investigates weighted donor imputation methodology that utilises information from both the census and census coverage survey (CCS). The US Census Bureau has considered a similar approach for their 2000 Census (see Isaki et al 1998). The proposed procedure distinguishes between individuals who are not counted by the census because their household is missed and those who are missed in counted households. Census data is linked to data from the CCS. Multinomial logistic regression is used to estimate the probabilities that households are missed by the census and the probabilities that individuals are missed in counted households. Household and individual coverage weights are constructed from the estimated probabilities and these feed into the donor imputation procedure.
Release date: 2000-03-02
67. Combining data sources: Air pollution and asthma consultations in 59 general practices throughout England and Wales - A case study Archived
Surveys and statistical programs – Documentation: 11-522-X19990015688
Description:
The geographical and temporal relationship between outdoor air pollution and asthma was examined by linking together data from multiple sources. These included the administrative records of 59 general practices widely dispersed across England and Wales for half a million patients and all their consultations for asthma, supplemented by a socio-economic interview survey. Postcode enabled linkage with: (i) computed local road density; (ii) emission estimates of sulphur dioxide and nitrogen dioxides, (iii) measured/interpolated concentration of black smoke, sulphur dioxide, nitrogen dioxide and other pollutants at practice level. Parallel Poisson time series analysis took into account between-practice variations to examine daily correlations in practices close to air quality monitoring stations. Preliminary analyses show small and generally non-significant geographical associations between consultation rates and pollution markers. The methodological issues relevant to combining such data, and the interpretation of these results will be discussed.
Release date: 2000-03-02
68. Using meta-analysis to understand the impact of time-of-use rates Archived
Surveys and statistical programs – Documentation: 11-522-X19990015692
Description:
Electricity rates that vary by time-of-day have the potential to significantly increase economic efficiency in the energy market. A number of utilities have undertaken economic studies of time-of-use rates schemes for their residential customers. This paper uses meta-analysis to examine the impact of time-of-use rates on electricity demand pooling the results of thirty-eight separate programs. There are four key findings. First, very large peak to off-peak price ratios are needed to significantly affect peak demand. Second, summer peak rates are relatively effective compared to winter peak rates. Third, permanent time-or-use rates are relatively effective compared to experimental ones. Fourth, demand charges rival ordinary time-of-use rates in terms of impact.
Release date: 2000-03-02
69. Meta-analysis of population dynamics data: Hierarchical modelling to reduce uncertainty Archived
Surveys and statistical programs – Documentation: 11-522-X19990015694
Description:
We use data on 14 populations of coho salmon to estimate critical parameters that are vital for management of fish populations. Parameter estimates from individual data sets are inefficient and can be highly biased, and we investigate methods to overcome these problems. Combination of data sets using nonlinear mixed effects models provides more useful results, however questions of influence and robustness are raised. For comparison, robust estimates are obtained. Model-robustness is also explored using a family of alternative functional forms. Our results allow ready calculation of the limits of exploitation and may help to prevent extinction of fish stocks. Similar methods can be applied in other contexts where parameter estimation is part of a larger decision-making process.
Release date: 2000-03-02
70. Sampling and Weighting (Reference Products: Technical Reports: 1996 Census of Population) Archived
Surveys and statistical programs – Documentation: 92-371-X
Description:
This report deals with sampling and weighting, a process whereby certain characteristics are collected and processed for a random sample of dwellings and persons identified in the complete census enumeration. Data for the whole population are then obtained by scaling up the results for the sample to the full population level. The use of sampling may lead to substantial reductions in costs and respondent burden, or alternatively, can allow the scope of a census to be broadened at the same cost.
Release date: 1999-12-07

Report a problem or mistake on this page

Date modified:: 2024-04-24

Language selection

Search and menus

Search

Keyword search

Filter results by

Keyword(s)

Subject

Type

Year of publication

Geography

Survey or statistical program

Portal

Content

Results

All (300) (60 to 70 of 300 results)

Data (14) (10 to 20 of 14 results)

Analysis (211) (60 to 70 of 211 results)

Reference (74) (60 to 70 of 74 results)

Keyword search

Filter results by

Keyword(s)

Subject

Type

Year of publication

Geography

Survey or statistical program

Portal

Content

Results

All (300) (60 to 70 of 300 results)

Data (14) (10 to 20 of 14 results)

Analysis (211) (60 to 70 of 211 results)

Reference (74) (60 to 70 of 74 results)

How do I use the filters and the search box?

How do I refine my search?

How does the search work?

How are the results ordered?

How are the results ordered?