This reference guide is intended for users of the 2021 GeoSuite Data Package. This guide provides an overview of the files, the general methodology used to create them, and important technical information.
What's new?
- The downloadable version of GeoSuite is not available for the 2021 Census. The 2021 GeoSuite Data Package containing all of the 2021 tables in comma separated value (.csv) format is available for download.
- All 2021 Census geographyproducts now include the Dissemination Geography Unique Identifier (DGUID) for geographic areas.
1. About this guide
This reference guide is intended for users of the 2021 GeoSuite Data Package. A record layout is provided in the Technical specifications section.
This data product is provided 'as-is,’ and Statistics Canada makes no warranty, either express or implied, including but not limited to, warranties of merchantability and fitness for a particular purpose. In no event will Statistics Canada be liable for any direct, special, indirect, consequential or other damages, however caused.
2. Overview
The 2021 GeoSuite Data Package contains the same 2021 Census data that are available in GeoSuite Web in tabular format. It includes the 2021 Census population counts, the 2021 Census dwelling counts, representative point coordinates, land area, geographic codes, names and, in some cases, the 2016 Census population counts (both final and adjusted). It also includes lookup tables as well as reference lists.
3. About this product
Purpose of the product
The objective of the 2021 GeoSuite Data Package is to provide users with the ability to load all 2021 GeoSuite data into their own application or database, in order to format, retrieve and query data, explore the links between 2021 Census standard levels of geography, and obtain selected tabular information.
Definitions and concepts
Geographic terms and concepts are briefly defined in the Dictionary, Census of Population, 2021.
Content
The 2021 GeoSuite Data Package contains information for all 2021 Census standard geographic areas, including unique identifiers (UIDs), DGUIDs, related attributes, population and dwelling counts, land area, representative point coordinates and Census incompletely enumerated Indian reserves and Indian settlements.
Hierarchy of standard geographic areas
The 2021 GeoSuite Data Package includes data for the following the 2021 Census standard geographic areas:
- Canada (CAN)
- Provinces and territories (PRs)
- Census divisions (CDs)
- Federal electoral districts (FEDs) (2013 Representation Order)
- Census subdivisions (CSDs)
- Designated places (DPLs)
- Economic regions (ERs)
- Census consolidated subdivisions (CCSs)
- Census metropolitan areas (CMAs), census agglomerations (CAs) and census metropolitan influenced zones (MIZs)
- Census tracts (CTs)
- Population centres (POPCTRs) and rural areas (RAs)
- Dissemination areas (DAs)
- Dissemination blocks (DBs)
- Aggregate dissemination areas (ADAs)
- Place names (PNs)
The Figure 1.1, “Hierarchy of standard geographic areas for dissemination, 2021 Census,” illustrates the relationships between all standard geographic areas.
2021 Census population and private dwellings
The population and dwelling counts contained within the 2021 GeoSuite Data Package are from the 2021 Census. The counts for a particular geographic area represent the number of people whose usual place of residence is in that area, regardless of where they happened to be on census day, May 11, 2021.
2021 Census land area
Land area is the area in square kilometres of the land-based portions of 2021 Census standard geographic areas. The land area data contained within the 2021 GeoSuite Data Package may or may not be consistent with land area data provided by other sources. Land area is calculated using ArcGIS® software for the sole purpose of calculating population density.
Land area data for 2021 Census standard geographic areas reflect the boundaries in effect on January 1, 2021 (the geographic reference date for the 2021 Census of Canada).
2021 Census incompletely enumerated Indian reserves and Indian settlements flag
In 2021, some Indian reserves and Indian settlements were incompletely enumerated. For these reserves and settlements, dwelling enumeration was either not permitted or was interrupted before it could be completed.
The 2021 Census population and dwelling counts are not available for the incompletely enumerated Indian reserves and Indian settlements, and are not included in 2021 Census tabulations. Data for geographic areas containing one or more of these reserves and settlements are noted accordingly.
2016 Census population by 2016 Census boundaries
The 2016 Census population counts are as they were enumerated during the 2016 Census, according to boundaries that were in effect as of January 1, 2016. These data are provided for all standard geographic areas.
2016 Census population by 2021 Census boundaries and the adjusted population flag
Users wishing to compare the 2021 Census statistical data with those of other censuses should be aware that the boundaries of geographic areas may change from one census to another. In order to facilitate this comparison, the 2016 Census population counts are adjusted as needed to take into account boundary changes between the 2016 and the 2021 censuses. The 2016 Census population by 2021 Census boundaries is also known as the 2016 adjusted population. Where the 2016 adjusted population counts did not equal the 2016 final population counts, the adjusted population flag was set to 1.
Since data are provided by the 2021 Census boundaries and geographic structure, calculations on census data from the 2021 GeoSuite Data Package should only be done using the 2016 data adjusted to the 2021 boundaries.
Secondary province code
The secondary province (XPR) field is used to indicate which CMAs, CAs and POPCTRs cross provincial boundaries. The XPR field is read in conjunction with the PR (code) field to obtain the names of these provinces.
Positional data
The 2021 GeoSuite Data Package contains the representative point coordinates for the DAs, weighted by population data. It also contains the representative point coordinates for the CSDs, the ADAs and the DPLs. These representative point coordinates are centrally located. The representative point coordinates were projected in Lambert conformal conic projection (NAD83).
Lookup tables
All the lookup tables required to describe the coded information (e.g., CDTYPE, CSDTYPE, DPLTYPE, etc.) contained in the data files are included in the package.
Reference lists
The package contains the five following frequently used reference lists:
- DB by CT for each CMA/CA
- DB by CSD for each CMA/CA
- DB by CSD for each CD
- CSD by FED
- CT by CSD
General methodology
The National Geographic Database (NGD) is a joint Statistics Canada-Elections Canada initiative to develop and maintain a spatial database which serves the needs of both organizations. The focus of the NGD is the continual improvement of quality and currency of spatial coverage using updates from provinces, territories and local sources. The native files used for the creation of the 2021 GeoSuite Data Package reside on Statistics Canada's Spatial Data Infrastructure (SDI) which was derived directly from data stored in the NGD.
Attribute information was retrieved from SDI and tables were created for each 2021 Census standard geographic level. Each table contains attribute information for all higher level geographies, where applicable. Common attributes, such as codes and UIDs, link all standard levels of geography in order to provide the users with connections that represent relationships found in the complete geographic hierarchy.
Limitations
Not applicable
Comparison to other products/versions
The 2021 GeoSuite Data Package contains UIDs, DGUIDs, names, and where applicable, types and classes applicable to the 2021 Census.
In order to harmonize the 2021 GeoSuite Data Package with other products, it only includes place names sourced from the Geographical Names Board of Canada (GNBC).
The 2021 GeoSuite Data Package includes all the DBs, while the “2021 Dissemination Block Cartographic Boundary File” does not include the DBs located entirely within coastal waters. See the 2021 Boundary Files, Reference Guide for more information.
Users should note that even when the boundaries of standard geographic areas did not change between the 2016 and 2021 censuses, the land areas may differ due to geometry shifts. The shifts are caused by the integration of CanVec hydrographic features, as well as improvements in the absolute positional accuracy of roads.
Because of the missing counts for the incompletely enumerated Indian reserves and Indian settlements, users are cautioned that for the affected geographic areas, comparisons (e.g., percentage change) between 2016 and 2021 may not be precise. The impact of the missing data can be significant for lower-level geographic areas (e.g., CDs), where the incompletely enumerated Indian reserves and Indian settlements account for a higher proportion of the population. This is especially true for lower-level geographic areas where a particular Indian reserve or Indian settlement was incompletely enumerated for the 2021 Census and enumerated for the 2016 Census and vice versa.
Use with other products
The 2021 Census standard geographic areas in the 2021 GeoSuite Data Package can be linked to other 2021 Census products using UIDs or DGUIDs.
The 2021 Census DB unique identifiers (DBUID) included in the 2021 GeoSuite Data Package can be used with the 2021 Correspondence Files to identify corresponding 2016 Census DBs. The 2016 DBUIDs can then be linked to the 2016 Geographic Attribute File or 2016 Geosuite to retrieve the 2016 Census standard geographic areas and their attributes.
Reference date
Population and dwelling counts
The population and dwelling count data contained within the 2021 GeoSuite Data Package refer to the 2021 Census of Population which was conducted on May 11, 2021.
Standard geographic areas
The geographic reference date is a date determined by Statistics Canada to finalize the geographic framework for which 2021 Census statistical data are collected, tabulated and reported. The reference date for the 2021 Census standard geographic areas is January 1, 2021.
4. Technical specifications
Record layout and data descriptions
The following table identifies and briefly describes the selected attributes comprising the content of the 2021 GeoSuite Data Package.
Each disseminated geography includes the UID or code of their related higher level of geographies. For the CDs, the ER code is available. In 2021, one CD (CDUID 3524) belongs to two ERs. Since only one ER can be related to a given CD in GeoSuite, the most populated ER (ERUID 3530) has been assigned.
Attribute domain values
Census division type (CDTYPE)
For information on census division types, refer to the “Census division type (CDTYPE), 2021 Census” table.
Census subdivision type (CSDTYPE)
Census subdivisions are classified according to designations adopted by provincial, territorial or federal authorities.
For information on census subdivision types, refer to the “Census subdivision type (CSDTYPE), 2021 Census” table.
Designated place type (DPLTYPE)
For information on designated place types, refer to the “Designated place type (DPLTYPE), 2021 Census” table.
Statistical Area Classification type (SACTYPE)
The Statistical Area Classification type is a one-digit code that identifies whether a CSD is a component of a CMA, a CA, a MIZ or in the territories.
For information on Statistical Area Classification types, refer to the “Statistical Area Classification type (SACTYPE), 2021 Census” table.
Statistical Area Classification code (SACCODE)
The Statistical Area Classification code is a three-digit code that groups CSDs according to whether they are a component of a CMA, CA or MIZ. MIZ categories denote the degree of influence that the CMAs and/or CAs have on these zones.
For information on Statistical Area Classification codes, refer to the “Statistical Area Classification code (SACCODE), 2021 Census” table.
Census metropolitan area and census agglomeration type (CMATYPE)
For information on census metropolitan area and census agglomeration types, refer to the “Census metropolitan area and census agglomeration type (CMATYPE), 2021 Census” table.
Population centre and rural area type (POPCTRRATYPE)
For information on population centre and rural area types, refer to the “Population centre and rural area type (POPCTRRATYPE), 2021 Census” table.
Population centre and rural area size classes (POPCTRRACLASS)
For information on population centre and rural area size classes, refer to the “Population centre and rural area size classes (POPCTRRACLASS), 2021 Census” table.
Locator source (LOCSOURCE)
The locator source describes the origin of the representative point. For the 2021 GeoSuite Data Package, the Geographical Names Board of Canada (GNBC) is the locator source of all place names.
Dissolved geographic area (DISSOLVEDGA)
For the 2021 GeoSuite Data Package, no dissolved CSDs, DPLs, POPCTRs and unincorporated places were included in the data.
File specifications
The content of the 2021 GeoSuite Data Package is approximately 230 megabytes.
Software formats
This reference guide does not provide details on specific software packages that are available for use in comma separated value format (.csv). Users are advised to contact the appropriate software vendor for information. Users should be aware that the comma separated value (.csv) files contain leading zeroes which not all spreadsheet software recognize. If the software does not load the leading zeroes, users should import the files as a text/csv instead, in order to view the files properly.
File extension and accented character information
The .csv files are compressed into a WinZip® file (file extension .zip).
Geographic representation
Not applicable
File naming convention
The 2021 GeoSuite Data Package file follow a standard naming convention. The file name includes the census year, catalogue number, language and file format.
The compressed files are named as follows:
- 2021_92-150-X_eng.zip
- 2021_92-150-X_fra.zip
5. Data quality
Data quality elements provide information on the fitness-for-use of a database by describing why, when, and how the data are created, and how accurate the data are. The quality elements include an overview reporting on the lineage, positional accuracy, attribute accuracy, logical consistency and completeness. This information is provided to users for all geographic data products disseminated for the census.
Lineage
Lineage describes the history of the data, including descriptions of the source material from which the data were derived, and the methods of derivation. It also contains the dates of the source material, and all transformations involved in producing the files.
All data in the 2021 GeoSuite Data Package were originally extracted from Statistics Canada's SDI.
Positional accuracy
Positional accuracy refers to the absolute and relative accuracy of the positions of geographic features. Absolute accuracy is the closeness of the coordinate values in a dataset to values accepted as or being true. Relative accuracy is the closeness of the relative positions of features to their respective relative positions accepted as or being true. Descriptions of positional accuracy include the quality of the final file or product after all transformations.
The only positional data contained within the 2021 GeoSuite Data Package are the representative point coordinates. Within Statistics Canada's SDI, representative point coordinates were generated using ArcGIS® software in conjunction with the different geographic boundaries. The representative point coordinates were initially calculated based on the Lambert conformal conic projection; they were then transformed to latitude and longitude coordinates.
Attribute accuracy
Attribute accuracy refers to the accuracy of the quantitative and qualitative information attached to each feature (such as population counts for DBs, CSD UIDs (CSDUIDs), names and types).
The UIDs, DGUIDs names, types and classes contained within the 2021 GeoSuite Data Package, along with the relationships between all standard geographic areas, were verified against Statistics Canada's SDI and found to accurately reflect them.
Blank fields are displayed within the 2021 GeoSuite Data Package where population and dwelling counts have been suppressed due to incompletely enumerated Indian reserves and Indian settlements. Population counts for Indian reserve refusal CSDs are not included in any census counts, therefore the blank population counts at the DB levels are consistent with the 2021 Census statistical data.
Logical consistency
Logical consistency describes the fidelity of relationships encoded in the data structure of the digital spatial data.
Consistency between data at various geographic levels was verified. Verification procedures ensured that counts at lower geographic levels sum to higher geographic levels. The verification procedures also ensured that higher geographic levels include the appropriate geographic units.
Consistency with other products
The population and dwelling count data in the 2021 GeoSuite Data Package are consistent with those disseminated in other 2021 Census products. The UIDs and DGUIDs used in the 2021 GeoSuite Data Package are the same as those used in other geography products and represent the same geographic areas.
Completeness
Completeness refers to the degree to which geographic features, their attributes and their relationships are included or omitted in a dataset. It also includes information on selection criteria, definitions used, and other relevant mapping rules.
The 2021 GeoSuite Data Package contains one record for each of the 498,786 DBs. It also contains the appropriate number of geographic areas for each standard geographic level. The data in Table 1.1, “Geographic areas by province and territory, 2021 Census,” were verified within the 2021 GeoSuite Data Package.
- Date modified: