Section 5 Data quality

Warning View the most recent version.

Archived Content

Information identified as archived is provided for reference, research or recordkeeping purposes. It is not subject to the Government of Canada Web Standards and has not been altered or updated since it was archived. Please "contact us" to request a format other than those available.

Lineage
Positional accuracy
Attribute accuracy
Logical consistency
Consistency with other products
Completeness

Linkage data quality elements provide information on the fitness-for-use of a spatial database by describing why, when and how the data are created, and how accurate the data are. The elements include an overview describing the purpose and usage, as well as specific quality elements reporting on the lineage, attribute accuracy, logical consistency and completeness. This information is provided to users for all linkage data products disseminated for the census.

Lineage

Lineage describes the history of the linkage data, including descriptions of the source material from which the data were derived, and the methods of derivation. It also contains the dates of the source material, and all transformations involved in producing the final digital files.

Sources

The sources used to derive the Postal Codes by Federal Ridings File (PCFRF) are as follows:

  • The May 2011 Postal Code Conversion File (PCCF) links postal codes (provided by Canada Post Corporation [CPC] on the Address Lookup File updated to May 2011) to geographic codes for all 2006 Census geographic areas, including province and federal electoral district 2003 Representative Order codes. It also provides the geographic point coordinates representing the postal codes. The May 2011 PCCF contains over 1.6 million postal code records linked to the geographic areas used in the 2006 Census. These geographical areas have a reference date of January 1, 2006, except for the Federal electoral district – 2003 Representation Order.
  • The PCFRF contains postal code data under license from Canada Post Corporation. The most recent Canada Post Corporation file from which this data is copied is dated May 2011.
  • Federal electoral district (FED) names are derived from Geography Division's Spatial Data Infrastructure. The source of the geographic names and codes of federal electoral districts is the 2003 Representation Order of the Chief Electoral Office, Elections Canada. The Spatial Data Infrastructure contains a table with the name of each federal electoral district and its associated identification code. This table is updated based on name changes provided by Elections Canada. Where changes to the electoral boundaries have been provided by Elections Canada, the correspondence between the federal electoral district and postal codes is updated.
  • The 2006 Census of Population is used as a source for deriving the weights. When a postal code is linked in the PCFRF to more than one FED, the number of persons reporting the postal code in the census may be used to derive the weights.

Method of derivation

The PCFRF is created by extracting the active postal codes and the related FED codes included in the May 2011 PCCF, containing May 2011 postal codes. Each FED code in this file is linked to the list of federal electoral districts – 2003 Representation Order codes and names. The linkage to the FED on the May 2011 PCCF is based on the dissemination block or dissemination area geocoded in the PCCF.

The resulting PCFRF file contains 841,799 active postal code records of which 826,866 are unique links to one federal electoral district. In total, 7,190 active postal codes (14,933 records) are linked to more than one federal electoral district (further details are provided in Logical consistency later in this section). The number of postal code records by federal electoral district and whether those postal codes are linked to other FEDs is provided in Table 3.1.

The unique link variable is derived based on the postal code and FED codes in the PCFRF. If the postal code is linked to only one FED, the unique link is assigned a value of 1, otherwise it is assigned a value of 2.

The 'weight' estimates the proportion of the population of a postal code that resides within each FED. If a postal code is linked to only one FED in the PCFRF, the weight is equal to 1. If the postal code is linked to more than one FED and is reported in the 2006 Census, the weight is equal to the proportion of the population that reported the postal code in each of the FEDs. If the postal code was not reported in the census, the weight is estimated using the address ranges in the service area of the postal code as found in the Address Lookup File from Canada Post Corporation. If necessary, the weights for a postal code are normalised and adjusted using the Single Link Indicator variable in the PCCF so that the sum of weights equals 1.0.

Positional accuracy

Not applicable

Attribute accuracy

Attribute accuracy refers to the accuracy of the quantitative and qualitative information attached to each feature (such as population for a population centre, street name, census subdivision name and code).

The attribute accuracy of the PCFRF is dependent on the accuracy of the geocodes for the dissemination blocks and dissemination areas in the PCCF. The linkage of the dissemination blocks or dissemination areas to the FEDs is based on the boundaries of the FEDs as found in the Spatial Data Infrastructure.

The accuracy of the weight variable is based on the linkage to the FED in the PCFRF, the population reporting the postal code in the census as well as address range data in Canada Post's Address Lookup File.

The population on which the weight variable in the PCFRF is based was derived from the total population data of the 2006 Census. Population counts are determined according to the 'de jure' method. This means that people are enumerated at their usual place of residence, regardless of where they may have been on Census Day, May 16, 2006. For more information on the quality of 2006 Census data, see Appendix B in the 2006 Census Dictionary.

If a postal code is linked to more than one FED in the PCFRF and was not reported in the census, address range data from the Address Lookup File is used to estimate the weight. This is the case for about 1% of the postal codes in the PCFRF. Because large populations residing in apartments or collective dwelling units may be represented by only one address, this method can underestimate the weight associated with these populations.

Logical consistency

Logical consistency describes the fidelity of relationships encoded in the data structure of the digital linkage data.

Of the 841,799 active postal code records found on this file, there are 826,866 active postal codes uniquely linked to one federal electoral district and 7,190 active postal codes that are linked to two or more federal electoral districts. The following table summarizes them.

Table 5.1
Count of postal codes linked to federal electoral districts
Number of federal electoral districts Active postal codes Number of records
1 826,866 826,866
2 6,726 13,452
3 401 1,203
4 47 188
5 6 30
6 10 60
Total 834,056 841,799

Consistency with other products

Data contained in the PCFRF are consistent with all 2006 Census related geographic products with the exception of the 2006 Census Forward Sortation Area Boundary File (Catalogue no. 92-170-XWE, XCE), which represents only the forward sortation areas reported in the 2006 Census. The PCFRF is derived from the Postal Code Conversion File (PCCF), and is consistent with that file.

Completeness

Completeness refers to the degree to which geographic features, their attributes and their relationships are included or omitted in a dataset. It also includes information on selection criteria, definitions used, and other relevant mapping rules.

Completeness in the context of the PCFRF is the degree to which all valid postal codes are accounted for. All postal codes, valid and active as of May 2011 according to CPC, have been linked to census geography.

There are 308 FEDs in the 2003 Representation Order of the Chief Electoral Office, Elections Canada. All of these FEDs are included in the PCFRF.

The data files are named using a file naming convention described in section 4, Technical specifications. Each file contains the following number of active postal code records:

Table 5.2
Number of postal code records per region in Postal Codes by Federal Ridings File (PCFRF) data files
File name Number of records
pcfrfEastFED2003_MAY11_fcpcefEstCEF2003.zip 101,847
pcfrfQueFED2003_MAY11_fcpcefQuéCEF2003.zip 212,235
pcfrfOntFED2003_MAY11_fcpcefOntCEF2003.zip 279,573
pcfrfWestFED2003_MAY11_fcpcefOuestCEF2003.zip 129,937
pcfrfBCFED2003_MAY11_fcpcefCBCEF2003.zip 117,207
pcfrfNatFED2003_MAY11_fcpcefNatCEF2003.zip 841,799

Table 5.3 lists abbreviations for the region names used in the data file names and the province and territories that they represent.

Table 5.3
Region abbreviations and associated province and/or territory in Postal Codes by Federal Ridings File (PCFRF) data files
English abbreviation - region name Associated province and/or territory - English French abbreviation - region name Associated province and/or territory - French
East Newfoundland and Labrador, Prince Edward Island, Nova Scotia,
New Brunswick
Est Terre-Neuve-et-Labrador, Île-du-Prince-Édouard, Nouvelle-Écosse, Nouveau-Brunswick
Que Quebec Qué Québec
Ont Ontario Ont Ontario
West Manitoba, Saskatchewan, Alberta, Northwest Territories, Nunavut Ouest Manitoba, Saskatchewan, Alberta, Territoires du Nord-Ouest, Nunavut
BC British Columbia, Yukon CB Colombie-Britannique, Yukon
Nat Canada Nat Canada