Health Reports
A pan-Canadian dataset of neighbourhood retail food environment measures using Statistics Canada’s Business Register

by Andrew C. Stevenson, Clara Kaufmann, Rachel C. Colley, Leia M. Minaker, Michael J. Widener, Thomas Burgoine, Claudia Sanmartin and Nancy A. Ross

Release date: February 16, 2022

DOI: https://www.doi.org/10.25318/82-003-x202200200001-eng

Abstract

Background

The objective of this study was to create the Canadian Food Environment Dataset (Can-FED) and to demonstrate its validity.

Data and methods

Food outlet data were extracted from Statistics Canada’s Business Register (BR) in 2018. Retail food environment access measures (both absolute and relative measures) were calculated using network buffers around the centroid of 56,589 dissemination areas in Canada. A k-medians clustering approach was used to create categorical food environment variables that were easy to use and amenable to dissemination. Validity of the measures was assessed by comparing the food environment measures from Can-FED with measures created using Enhanced Points of Interest data by DMTI Spatial Inc. and data from a municipal health inspection list. Validity was also assessed by calculating the geographic variability in food environments across census metropolitan areas (CMAs) and assessing associations between CMA-level food environments and CMA-level health indicators.

Results

Two versions of Can-FED were created: a researcher file that must be accessed within a secure Statistics Canada environment and a general-use file available online. Agreement between Can-FED food environment measures and those derived from a proprietary dataset and a municipal health inspection list ranged from rs=0.28 for convenience store density and rs=0.53 for restaurant density. At the CMA level, there is wide geographic variation in the food environment with evidence of patterning by health indicators.

Interpretation

Can-FED is a valid and accessible dataset of pan-Canadian food environment measures that was created from the BR, a data source that has not been explored fully for health research.

Keywords

food environment, built environment, diet, accessible data, body mass index, cardiometabolic health, geography, epidemiology

Authors

Andrew C. Stevenson (andrew.stevenson@mail.mcgill.ca) and Clara Kaufmann are with the Department of Geography, McGill University, Montréal, Quebec. Rachel C. Colley and Claudia Sanmartin are with the Health Analysis Division, Statistics Canada, Ottawa, Ontario. Leia M. Minaker is with the School of Planning, University of Waterloo, Waterloo, Ontario. Michael J. Widener is with the Department of Geography and Planning, University of Toronto, Toronto, Ontario. Thomas Burgoine is with the UK Clinical Research Collaboration Centre for Diet and Activity Research, MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom. Nancy A. Ross (Nancy.ross@mcgill.ca) with Queen’s University as Vice-President (Research) and work was carried out when Nancy A. Ross was with the Department of Geography, McGill University, Montréal, Quebec, and the Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montréal, Quebec.

 

What is already known on this subject?

  • Unfavourable food environments, characterized as neighbourhoods with high access to fast food and low access to healthy food, contribute to poor diet quality.
  • Previous Canadian research indicates that neighbourhoods with good access to healthy food are associated with healthier diets, lower body mass index and lower risk of type 2 diabetes.
  • Canadian food environment research to-date has been limited to certain regions of Canada as no pan-Canadian food environment dataset was available.

What does this study add?

  • This paper describes the development and validation of the Canadian Food Environment Dataset (Can-FED): a pan-Canadian dataset of retail food environment measures.
  • The Can-FED includes the densities of 19 different food store types for all dissemination areas in Canada. Two relative food environment measures are also included in the database.
  • Comparisons to secondary food environment datasets and the geographic variability of food environments in Canada are presented.

Introduction

The retail food environment is a modifiable component of the built environment with the potential to contribute to improvements in the diets of Canadians at the population level. The retail food environment is defined by geographic access to different types of retail food sources, including restaurants and food stores. Unfavourable neighbourhood retail food environments, characterized by neighbourhoods with an overabundance of less healthy food stores or a scarcity of healthier food stores, are a target for intervention because they can be a contributing factor to poor diet quality.Note 1 In Canada, neighbourhoods with easy access to healthier food options or limited access to less healthy food options have been associated with healthier diets,Note 2Note 3Note 4 lower body mass index (BMI)Note 5 and lower risks of Type 2 diabetes.Note 6 However, measurement error regarding both food environments and either diet or health outcomes has generated inconsistent findings.Note 5Note 7Note 8

Researchers typically use secondary business datasets or government sources to identify and locate food outlets to measure retail food environments.Note 5 However, validation studies of these types of secondary datasets have identified substantial errors that arise from problems such as misclassification of food outlets, incomplete coverage and inaccurate geocoding.Note 8Note 9Note 10These errors have been shown to underestimate or overestimate food access compared with ground validation.Note 8 Random error could mask true associations, and systematic error could lead researchers to inaccurate findings. Additionally, measures created from proprietary sources cannot be made available to other researchers or the public. Measures created from government sources are often specific to a local region, therefore limiting the ability of researchers to measure retail food environments across Canada and compare retail food environments across regions. Self-reported diet and health outcomes can also have substantial error from recall bias, social desirability bias, interviewer bias, or inaccurate translation of self-reported statements into the relevant measure of diet or health, which can also generate inconclusive or unexpected findings.Note 11Note 12

A high-quality and accessible Canada-wide dataset of businesses that can be used to create food environment measures has not been identified. Accessible and valid national food environment measures would contribute to a stronger evidence base with increased accuracy and reusability of exposure data. Additionally, a dataset that allows for Canada-wide analyses may be an improvement over regional studies that may suffer from a lack of heterogeneity in exposure to food environments. This is a problem because if everyone within the study has a similar level of exposure to the retail food environment, the retail food environment will have a limited influence on the distribution of the outcome, potentially masking a true association. National and valid retail food environment measures would facilitate data linkage with national health surveys, health administrative data and investigator-led cohort studies. They are intended to be made available to public health stakeholders in Canada who wish to adopt and monitor food environment interventions.

The purpose of this paper is to describe the development and validation of the Canadian Food Environment Dataset (Can-FED), a pan-Canadian dataset of retail food environment measures at the dissemination area (DA) level using food outlet data from the 2018 Statistics Canada Business Register (BR).

Two versions of the dataset were developed: (1) a researcher Can-FED that includes continuous absolute and relative densities measured as counts per kilometre within a buffer, which is accessible through Statistics Canada’s secure data environments of the Canadian Research Data Centre Network; and (2) a general-use Can-FED that includes categorical measures of the food environment, which is publicly available for download. The objectives of this paper are to describe the approach used to create Can-FED and to demonstrate its validity as a food environment dataset by comparing the food environment measures from Can-FED with measures derived using a secondary business dataset and a public health inspection list, calculating the geographic variability of food environments across Canada, and assessing ecological associations with health indicators at the census metropolitan area (CMA) level.

Methods

Development of the Canadian Food Environment Dataset

Food outlet data

Food outlet data come from the BR, a central repository of information on businesses operating in Canada.Note 13 Information on businesses is compiled from mandatory tax data collected by the Canada Revenue Agency (CRA). Responding to the survey is mandatory, and all outlets are consistently classified with a North American Industry Classification System (NAICS) code that identifies the primary function of a business. A research contract was signed with Statistics Canada to provide access to the BR at a secure site at the Statistics Canada headquarters in Ottawa.

Food outlet classification

Detailed classification of food outlets was based on a level 5 NAICS code, augmented with a name-based assignment method. Businesses that needed to be further defined from their assigned NAICS code were extracted by querying that NAICS code, then categorized based on the outlet name. For example, outlets with NAICS code 722512 (“limited-service eating places”) were extracted from the BR. Then, keyword searches indicating a fast-food outlet (e.g., the name of a chain or the word “burger”) were conducted on the outlet name to further define “fast-food outlets.” In total, 19 food outlet types were derived from NAICS codes and name-based assignments (Table 1).


Table 1
Overview of retail food environment measures in the researcher Canadian Food Environment Dataset and the general-use Canadian Food Environment Dataset
Table summary
This table displays the results of Overview of retail food environment measures in the researcher Canadian Food Environment Dataset and the general-use Canadian Food Environment Dataset . The information is grouped by Absolute densities (#/km2)
(continuous variable) (appearing as row headers), Definition and Assignment and method (appearing as column headers).
Absolute densities (#/km2)
(continuous variable)
Definition Assignment and method
NAICS code and additional keyword or other specification (if applicable)
Researcher dataset
Chain supermarkets Stores that primarily sell a variety of fresh and prepared food products, have multiple locations, and are owned by large retail companies
445110
Chain supermarket brand name
Grocery stores Stores that primarily sell a variety of fresh and prepared food products
445110
Not a chain supermarket (as defined above)
Convenience stores Stores that primarily sell convenience goods and food products that are already prepared and packaged
445120
N/A
Convenience stores at a gas station Stores located at a gas station that primarily sell convenience goods and food products that are already prepared and packaged
447110
N/A
Bakeries Retail bakeries that sell fresh baked goods on the premises
311811
N/A
Fruit and vegetable markets Stores that primarily sell fresh fruits and vegetables
445230
N/A
Meat markets Stores that primarily sell meat and poultry 445210
N/A
Fish markets Stores that primarily sell fish and seafood products
445220
N/A
Specialty stores Stores that primarily sell specialty food products (e.g., coffee store, spice and herb store, dietary supplement store)
445299
N/A
Confectionery Stores that primarily sell either packaged or ready-to-eat sweets, such as chocolates, ice cream or candy
445292
N/A
Full-service restaurants Eating places where patrons typically order from a waiter, can be seated for dine-in and pay after eating
722511
N/A
Fast-food outlets Eating places that sell pre-prepared or quickly prepared food at a counter that is likely highly processed 722512
Chain fast-food brand name, or businesses with a name alluding to fast-food (e.g., "burger," "pizza," "fried," "fries")
Cafés Limited-service eating places that serve coffee and tea beverages and typically do not offer a full menu
722512
Chain café brand name, or businesses with a name alluding to cafés (e.g., "coffee," "java," "café")
Other limited-service food outlets Eating places that typically sell pre-prepared or quickly prepared food at a counter, are not fast-food outlets and are not cafes
722512
Not a fast-food outlet or café (as defined above)
Bars Drinking places primarily engaged in preparing and serving alcoholic beverages for immediate consumption
722410
N/A
Liquor stores Stores that primarily sell alcoholic beverages, including liquor, beer and wine 445310
N/A
Dollar stores Variety and dollar stores that primarily sell pre-packaged snack foods
452999
Dollar store chain name
Superstores Large stores that sell a variety of food products and other non-food products
452910
Super or mega store chain name
Chain pharmacies Large chain pharmacies that offer a selection of different food products
446110
Pharmacy chain name
Relative densities (continuous variable) Definition Formula
mRFEI Proportion of food outlets that sell a wide selection of fresh and nutritious food ( chain supermarkets+grocery stores+fruit and vegetable markets ) ( chain supermarkets+grocery stores+fruit and vegetable markets + fast food outlets+convenience stores+ convenience stores at a gas station )  x 100 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qadaWcaaWdaeaapeWaaeWaa8aabaWdbiaadogacaWGObGaamyyaiaa dMgacaWGUbGaaeiiaiaadohacaWG1bGaamiCaiaadwgacaWGYbGaam yBaiaadggacaWGYbGaam4AaiaadwgacaWG0bGaam4CaiabgUcaRiaa dEgacaWGYbGaam4BaiaadogacaWGLbGaamOCaiaadMhacaqGGaGaam 4CaiaadshacaWGVbGaamOCaiaadwgacaWGZbGaey4kaSIaamOzaiaa dkhacaWG1bGaamyAaiaadshacaqGGaGaamyyaiaad6gacaWGKbGaae iiaiaadAhacaWGLbGaam4zaiaadwgacaWG0bGaamyyaiaadkgacaWG SbGaamyzaiaabccacaWGTbGaamyyaiaadkhacaWGRbGaamyzaiaads hacaWGZbaacaGLOaGaayzkaaaapaqaa8qadaqadaWdaqaabeqaa8qa caWGJbGaamiAaiaadggacaWGPbGaamOBaiaabccacaWGZbGaamyDai aadchacaWGLbGaamOCaiaad2gacaWGHbGaamOCaiaadUgacaWGLbGa amiDaiaadohacqGHRaWkcaWGNbGaamOCaiaad+gacaWGJbGaamyzai aadkhacaWG5bGaaeiiaiaadohacaWG0bGaam4BaiaadkhacaWGLbGa am4CaiabgUcaRiaadAgacaWGYbGaamyDaiaadMgacaWG0bGaaeiiai aadggacaWGUbGaamizaiaabccacaWG2bGaamyzaiaadEgacaWGLbGa amiDaiaadggacaWGIbGaamiBaiaadwgacaqGGaGaamyBaiaadggaca WGYbGaam4AaiaadwgacaWG0bGaam4CaaqaaiabgUcaRiaabccacaWG MbGaamyyaiaadohacaWG0bGaeyOeI0IaaeiiaiaadAgacaWGVbGaam 4BaiaadsgacaqGGaGaam4BaiaadwhacaWG0bGaamiBaiaadwgacaWG 0bGaam4CaiabgUcaRiaadogacaWGVbGaamOBaiaadAhacaWGLbGaam OBaiaadMgacaWGLbGaamOBaiaadogacaWGLbGaaeiiaiaadohacaWG 0bGaam4BaiaadkhacaWGLbGaam4CaiabgUcaRaqaaiaadogacaWGVb GaamOBaiaadAhacaWGLbGaamOBaiaadMgacaWGLbGaamOBaiaadoga caWGLbGaaeiiaiaadohacaWG0bGaam4BaiaadkhacaWGLbGaam4Cai aabccacaWGHbGaamiDaiaabccacaWGHbGaaeiiaiaadEgacaWGHbGa am4CaiaabccacaWGZbGaamiDaiaadggacaWG0bGaamyAaiaad+gaca WGUbaaaiaawIcacaGLPaaaaaGaaeiOaiaabIhacaqGGcGaaGymaiaa icdacaaIWaaaaa@F274@
Rmix Proportion of fast-food outlets relative to fast-food outlets and full-service restaurants   ( fast-food outlets ) ( fast-food outlets+full-service restaurants )  x 100 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaqGGcWaaSaaa8aabaWdbmaabmaapaqaa8qacaWGMbGaamyyaiaa dohacaWG0bGaaeylaiaadAgacaWGVbGaam4BaiaadsgacaGGGcGaam 4BaiaadwhacaWG0bGaamiBaiaadwgacaWG0bGaam4CaaGaayjkaiaa wMcaaaWdaeaapeWaaeWaa8aabaWdbiaadAgacaWGHbGaam4Caiaads hacaqGTaGaamOzaiaad+gacaWGVbGaamizaiaacckacaWGVbGaamyD aiaadshacaWGSbGaamyzaiaadshacaWGZbGaey4kaSIaamOzaiaadw hacaWGSbGaamiBaiaab2cacaWGZbGaamyzaiaadkhacaWG2bGaamyA aiaadogacaWGLbGaaiiOaiaadkhacaWGLbGaam4CaiaadshacaWGHb GaamyDaiaadkhacaWGHbGaamOBaiaadshacaWGZbaacaGLOaGaayzk aaaaaiaabckacaqG4bGaaeiOaiaaigdacaaIWaGaaGimaaaa@77D6@
Absolute densities (categorical variable) Definition Assignment and method
NAICS code and additional keyword or other specification (if applicable)
General-use dataset
Chain supermarkets Stores that primarily sell a variety of fresh and prepared food products, have multiple locations, and are owned by large retail companies 445110
Chain supermarket brand name
Grocery stores Stores that primarily sell a variety of fresh and prepared food products 445110
Not a chain supermarket (as defined above)
Fruit and vegetable markets Stores that primarily sell fresh fruits and vegetables 445230
N/A
Fast-food outlets Eating places that sell pre-prepared or quickly prepared food at a counter that is likely highly processed 722512
Chain fast-food brand name, or businesses with a name alluding to fast-food (e.g., "burger," "pizza," "fried," "fries")
All convenience stores Stores including those located at a gas station that primarily sell convenience goods and food products that are already prepared and packaged 445120 and 47110
N/A
Full-service restaurants Eating places where patrons typically order from a waiter, can be seated for dine-in and pay after eating 722511
N/A
Relative densities (categorical variable) Definition Formula
mRFEI Proportion of food outlets that have a wide selection of fresh and nutritious food ( chain supermarkets+grocery stores+fruit and vegetable markets ) ( chain supermarkets+grocery stores+fruit and vegetable markets + fast food outlets+convenience stores+ convenience stores at a gas station )  x 100 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qadaWcaaWdaeaapeWaaeWaa8aabaWdbiaadogacaWGObGaamyyaiaa dMgacaWGUbGaaeiiaiaadohacaWG1bGaamiCaiaadwgacaWGYbGaam yBaiaadggacaWGYbGaam4AaiaadwgacaWG0bGaam4CaiabgUcaRiaa dEgacaWGYbGaam4BaiaadogacaWGLbGaamOCaiaadMhacaqGGaGaam 4CaiaadshacaWGVbGaamOCaiaadwgacaWGZbGaey4kaSIaamOzaiaa dkhacaWG1bGaamyAaiaadshacaqGGaGaamyyaiaad6gacaWGKbGaae iiaiaadAhacaWGLbGaam4zaiaadwgacaWG0bGaamyyaiaadkgacaWG SbGaamyzaiaabccacaWGTbGaamyyaiaadkhacaWGRbGaamyzaiaads hacaWGZbaacaGLOaGaayzkaaaapaqaa8qadaqadaWdaqaabeqaa8qa caWGJbGaamiAaiaadggacaWGPbGaamOBaiaabccacaWGZbGaamyDai aadchacaWGLbGaamOCaiaad2gacaWGHbGaamOCaiaadUgacaWGLbGa amiDaiaadohacqGHRaWkcaWGNbGaamOCaiaad+gacaWGJbGaamyzai aadkhacaWG5bGaaeiiaiaadohacaWG0bGaam4BaiaadkhacaWGLbGa am4CaiabgUcaRiaadAgacaWGYbGaamyDaiaadMgacaWG0bGaaeiiai aadggacaWGUbGaamizaiaabccacaWG2bGaamyzaiaadEgacaWGLbGa amiDaiaadggacaWGIbGaamiBaiaadwgacaqGGaGaamyBaiaadggaca WGYbGaam4AaiaadwgacaWG0bGaam4CaaqaaiabgUcaRiaabccacaWG MbGaamyyaiaadohacaWG0bGaeyOeI0IaaeiiaiaadAgacaWGVbGaam 4BaiaadsgacaqGGaGaam4BaiaadwhacaWG0bGaamiBaiaadwgacaWG 0bGaam4CaiabgUcaRiaadogacaWGVbGaamOBaiaadAhacaWGLbGaam OBaiaadMgacaWGLbGaamOBaiaadogacaWGLbGaaeiiaiaadohacaWG 0bGaam4BaiaadkhacaWGLbGaam4CaiabgUcaRaqaaiaadogacaWGVb GaamOBaiaadAhacaWGLbGaamOBaiaadMgacaWGLbGaamOBaiaadoga caWGLbGaaeiiaiaadohacaWG0bGaam4BaiaadkhacaWGLbGaam4Cai aabccacaWGHbGaamiDaiaabccacaWGHbGaaeiiaiaadEgacaWGHbGa am4CaiaabccacaWGZbGaamiDaiaadggacaWG0bGaamyAaiaad+gaca WGUbaaaiaawIcacaGLPaaaaaGaaeiOaiaabIhacaqGGcGaaGymaiaa icdacaaIWaaaaa@F274@
Rmix Proportion of fast-food outlets relative to fast-food outlets and full-service restaurants   ( fast-food outlets ) ( fast-food outlets+full-service restaurants )  x 100 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qacaqGGcWaaSaaa8aabaWdbmaabmaapaqaa8qacaWGMbGaamyyaiaa dohacaWG0bGaaeylaiaadAgacaWGVbGaam4BaiaadsgacaGGGcGaam 4BaiaadwhacaWG0bGaamiBaiaadwgacaWG0bGaam4CaaGaayjkaiaa wMcaaaWdaeaapeWaaeWaa8aabaWdbiaadAgacaWGHbGaam4Caiaads hacaqGTaGaamOzaiaad+gacaWGVbGaamizaiaacckacaWGVbGaamyD aiaadshacaWGSbGaamyzaiaadshacaWGZbGaey4kaSIaamOzaiaadw hacaWGSbGaamiBaiaab2cacaWGZbGaamyzaiaadkhacaWG2bGaamyA aiaadogacaWGLbGaaiiOaiaadkhacaWGLbGaam4CaiaadshacaWGHb GaamyDaiaadkhacaWGHbGaamOBaiaadshacaWGZbaacaGLOaGaayzk aaaaaiaabckacaqG4bGaaeiOaiaaigdacaaIWaGaaGimaaaa@77D6@

Access metrics: Researcher dataset

Retail food environment measures were calculated in ArcMap (version 10.7.1, ESRI) using network buffers (created from Statistics Canada’s 2016 Census Road Network FileNote 14) around the population-weighted centroid (calculated by Statistics CanadaNote 15) of all 56,589 DAs in Canada. DAs are the smallest standard geographic area for which census data are disseminated across Canada—they have populations of 400 to 700 people.Note 16 Two network buffer sizes (1 km and 3 km) were calculated from the population-weighted centroid of each DA (Figure 1). The 1-km network buffers represent an approximate 10- to 15-minute walk from the centroid to the edge and are useful for researchers to assess food and how people access food stores in their immediate neighbourhood or by foot.Note 17 The 3-km buffers were included in this study because they may more accurately capture how people access food in areas that are less densely populated and where people are more likely to drive to stores.

Example of a 1-km buffer around the centroid of a dissemination area

Description for Figure 1

Two buffer sizes were created: a 1-km and 3-km network buffer calculated from the population-weighted centroid of each dissemination area. 1-km network buffers represent an approximate 10 to 15-minute walk from the centroid to the edge and are useful for researchers to assess food and how people access food stores in their immediate neighbourhood or by foot.) 3-km buffers were included in this study because they may more accurately capture how people access food in areas that are less densely populated and where people are more likely to drive to stores.

Each food outlet was spatially joined to the buffer or buffers that it falls within. Nineteen absolute measures were calculated by summing the total number of each outlet type that fell within a buffer and dividing the sum of each outlet type by the total area of the buffer, represented by the following equation:

( 1 )Densit y Outlet type  = Coun t Outlet type k m 2 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qadaqadaWdaeaapeGaaGymaaGaayjkaiaawMcaaiaadseacaWGLbGa amOBaiaadohacaWGPbGaamiDaiaadMhapaWaaSbaaSqaa8qacaWGpb GaamyDaiaadshacaWGSbGaamyzaiaadshacaGGGcGaamiDaiaadMha caWGWbGaamyzaiaacckaa8aabeaak8qacqGH9aqpdaWcaaWdaeaape Gaam4qaiaad+gacaWG1bGaamOBaiaadshapaWaaSbaaSqaa8qacaWG pbGaamyDaiaadshacaWGSbGaamyzaiaadshacaGGGcGaamiDaiaadM hacaWGWbGaamyzaaWdaeqaaaGcbaWdbiaadUgacaWGTbWdamaaCaaa leqabaWdbiaaikdaaaaaaaaa@5EF4@

Several of the absolute measures, such as fast-food density and supermarket density, have been commonly employed in food environment studies,Note 5Note 7 while others, such as pharmacy density and superstore density, are new additions. Two relative measures were calculated using the counts of selected outlet types within each buffer: the modified retail food environment index (mRFEI)Note 18 and the fast-food restaurant mix (Rmix).Note 3 Measures that capture the relative mix of different types of outlets that people are exposed to were used in past research.Note 6Note 19Note 20 The mRFEI calculates the proportion of outlets that offer a wide range of fresh and nutritious options within each buffer, and it is defined as:

( 2 )mRFEI=  ( chain supermarkets+grocery stores + fruit and vegetable markets ) ( chain supermarkets+grocery stores + fruit and vegetable markets + fast-food outlets + convenience stores + convenience stores at a gas station )  x 100 MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qadaqadaWdaeaapeGaaGOmaaGaayjkaiaawMcaaiaad2gacaWGsbGa amOraiaadweacaWGjbGaeyypa0JaaeiOamaalaaapaqaa8qadaqada WdaeaapeGaam4yaiaadIgacaWGHbGaamyAaiaad6gacaGGGcGaam4C aiaadwhacaWGWbGaamyzaiaadkhacaWGTbGaamyyaiaadkhacaWGRb GaamyzaiaadshacaWGZbGaey4kaSIaam4zaiaadkhacaWGVbGaam4y aiaadwgacaWGYbGaamyEaiaacckacaWGZbGaamiDaiaad+gacaWGYb GaamyzaiaadohacaGGGcGaey4kaSIaaiiOaiaadAgacaWGYbGaamyD aiaadMgacaWG0bGaaiiOaiaadggacaWGUbGaamizaiaacckacaWG2b GaamyzaiaadEgacaWGLbGaamiDaiaadggacaWGIbGaamiBaiaadwga caGGGcGaamyBaiaadggacaWGYbGaam4AaiaadwgacaWG0bGaam4Caa GaayjkaiaawMcaaaWdaeaapeWaaeWaa8aaeaqabeaapeGaam4yaiaa dIgacaWGHbGaamyAaiaad6gacaGGGcGaam4CaiaadwhacaWGWbGaam yzaiaadkhacaWGTbGaamyyaiaadkhacaWGRbGaamyzaiaadshacaWG ZbGaey4kaSIaam4zaiaadkhacaWGVbGaam4yaiaadwgacaWGYbGaam yEaiaacckacaWGZbGaamiDaiaad+gacaWGYbGaamyzaiaadohacaGG GcGaey4kaSIaaiiOaiaadAgacaWGYbGaamyDaiaadMgacaWG0bGaai iOaiaadggacaWGUbGaamizaiaacckacaWG2bGaamyzaiaadEgacaWG LbGaamiDaiaadggacaWGIbGaamiBaiaadwgacaGGGcGaamyBaiaadg gacaWGYbGaam4AaiaadwgacaWG0bGaam4CaaqaaiabgUcaRiaaccka caWGMbGaamyyaiaadohacaWG0bGaaeylaiaadAgacaWGVbGaam4Bai aadsgacaGGGcGaam4BaiaadwhacaWG0bGaamiBaiaadwgacaWG0bGa am4CaiaacckacqGHRaWkcaGGGcGaam4yaiaad+gacaWGUbGaamODai aadwgacaWGUbGaamyAaiaadwgacaWGUbGaam4yaiaadwgacaGGGcGa am4CaiaadshacaWGVbGaamOCaiaadwgacaWGZbGaaiiOaiabgUcaRi aacckacaWGJbGaam4Baiaad6gacaWG2bGaamyzaiaad6gacaWGPbGa amyzaiaad6gacaWGJbGaamyzaiaacckacaWGZbGaamiDaiaad+gaca WGYbGaamyzaiaadohacaGGGcGaamyyaiaadshacaGGGcGaamyyaiaa cckacaWGNbGaamyyaiaadohacaGGGcGaam4CaiaadshacaWGHbGaam iDaiaadMgacaWGVbGaamOBaaaacaGLOaGaayzkaaaaaiaabckacaqG 4bGaaeiOaiaaigdacaaIWaGaaGimaaaa@0C7D@

Rmix calculates the proportion of fast-food restaurants relative to fast-food and full-service restaurants within each buffer, defined as:

( 3 )Rmix=  ( fast-food outlets ) ( fast-food outlets+full-service restaurants )  x 100 MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xbbL8F4rqqrFfpeea0xe9Lq=Jc9 vqaqpepm0xbba9pwe9Q8fs0=yqaqpepae9pg0FirpepeKkFr0xfr=x fr=xb9adbaqaaeGaciGaaiaabeqaamaabaabaaGcbaaeaaaaaaaaa8 qadaqadaWdaeaapeGaaG4maaGaayjkaiaawMcaaiaadAeacaWGgbGa amOuaiaad2eacqGH9aqpcaqGGcWaaSaaa8aabaWdbmaabmaapaqaa8 qacaWGMbGaamyyaiaadohacaWG0bGaaeylaiaadAgacaWGVbGaam4B aiaadsgacaGGGcGaam4BaiaadwhacaWG0bGaamiBaiaadwgacaWG0b Gaam4CaaGaayjkaiaawMcaaaWdaeaapeWaaeWaa8aabaWdbiaadAga caWGHbGaam4CaiaadshacaqGTaGaamOzaiaad+gacaWGVbGaamizai aacckacaWGVbGaamyDaiaadshacaWGSbGaamyzaiaadshacaWGZbGa ey4kaSIaamOzaiaadwhacaWGSbGaamiBaiaab2cacaWGZbGaamyzai aadkhacaWG2bGaamyAaiaadogacaWGLbGaaiiOaiaadkhacaWGLbGa am4CaiaadshacaWGHbGaamyDaiaadkhacaWGHbGaamOBaiaadshaca WGZbaacaGLOaGaayzkaaaaaiaabckacaqG4bGaaeiOaiaaigdacaaI WaGaaGimaaaa@7E80@

Statistical methods

Agreement with other secondary food outlet datasets

The neighbourhood-level food environment measures in the researcher Can-FED were compared with measures created from the 2013 Enhanced Points of Interest file developed by DMTI and the 2020 public health inspection list compiled in Peel, Ontario. The DMTI file is a proprietary dataset that contains over 1 million businesses and points of interest in Canada. These data are made available to researchers at Canadian universities for research purposes, and several Canadian food environment association studies have used DMTI data to calculate food environment exposure measures.Note 5 A Data Release Agreement was signed at McGill University. The Peel public health inspection list is maintained by the local public health authority, and it is freely available to download online (https://data.peelregion.ca/data-categories/food-check/food-check-peel.aspx). Outlets in the secondary datasets were categorized in different ways, such as by facility type or Standard Industrial Classification code rather than by NAICS code. The data were recoded based on the Can-FED food outlet types using keywords and the classification attributes in the DMTI dataset or the Peel public health inspection list.

Food outlet data from the Peel public health inspection list and from the DMTI file were mapped in ArcMap (10.7.1). Food outlet density measures were created using the same 1-km buffers as those used to create Can-FED measures. Spearman’s correlation coefficients were calculated to assess the association between the measures from the DMTI file, the Peel public health inspection list and Can-FED. Associations were assessed for chain supermarkets, grocery plus fruit and vegetable stores, restaurants, fast-food outlets, convenience stores, the mRFEI and the Rmix. These types of outlets were included in the general-use file because proximity to them is likely to determine whether people use them, and they have been shown to be associated with diet and health in previous studies.Note 5Note 6 Additionally, the attributes included for each outlet in the Peel public health inspection list did not allow for a more extensive list of food outlet categories.

Census metropolitan area-level food environments and correlations with health outcomes

Descriptive statistics (mean and 95% confidence interval [CI], standard deviation, and interquartile range) were generated from DA measures in the researcher version of Can-FED. Two relative food environment measures, the mRFEI and the Rmix, were calculated for each CMA in Canada (n=35) by aggregating the DA-level relative measures across each CMA boundary. A CMA is formed by one or more adjacent municipalities centred on a population centre, and it must have a population of at least 100,000, of which 50,000 or more live in an urban core.Note 21

Tests of association using Pearson correlation coefficients were run between the two relative food environment variables and five CMA-level indicators of health for 35 CMAs in Canada: the percentage of the population living with diabetes, the percentage of the population living with high blood pressure, the percentage of the population who consumed fruits and vegetables five or more times per day, the percentage of the population with a BMI considered overweight or obese, and the percentage of households in the population that are considered food insecure. Health data were derived by Statistics Canada from the 2017 and 2018 Canadian Community Health Survey (CCHS) based on self-reports from respondents aged 12 and older on having been diagnosed with high blood pressure, having been diagnosed with Type 1 or Type 2 diabetes, and their height and weight.Note 22 The number of times that respondents reported eating fruits and vegetables per day came from the 2015 and 2016 CCHS reference period. Food insecurity data were obtained from a PROOF reportNote 23 that used the 2017 and 2018 CCHS.

Cluster analysis for the general-use Canadian Food Environment Dataset

A k-medians clustering approach was used to create cluster groupings for all absolute and relative densities above zero. A k-medians approach finds k-number of cluster centres to minimize within-group variation of observations (DAs) and to maximize between-group variation. K-medians are more resistant to outliers than k-means.Note 24 Zeros were isolated and put into their own category because zero is a unique and meaningful value for the food environment measures, representing no access to an outlet type. Any value above zero indicates that there is some access to that outlet type in the buffer. This dichotomy can represent two types of different food environments that would not have been captured if zero was not removed from the k-median groupings. A general-use file was created for all of Canada and for each province and territory using 1-km and 3-km buffers around the centroid of each DA.

Results

Researcher and general-use Canadian Food Environment Dataset

In the researcher Can-FED, 19 continuous absolute density measures (count/km2) and 2 continuous relative density measures (mRFEI and Rmix) were calculated within 1-km and 3-km network buffers of 56,312 DAs across Canada (Table 1). In the general-use Can-FED, six categorical absolute density measures and two categorical relative density measures were calculated within 1-km and 3-km network buffers of 56,312 DAs (Table 1). Five grouping clusters were created for each variable in the general-use Can-FED: zero, class 1, class 2, class 3 and class 4. Class 1 represents the DAs with the lowest food environment variable densities, and class 4 represents the DAs with the highest food environment variable densities, as determined from the k-medians cluster analysis.

Agreement with other secondary datasets

Across both the DMTI file and the Peel public health inspection list, the absolute density measures at the DA level were generally modestly (r=0.20 to 0.39) to moderately (r=0.40 to 0.59) correlated with measures from Can-FED (Table 2). Spearman’s correlation coefficients between the DMTI file or the Peel public health inspection list and Can-FED ranged from rs=0.28 for convenience store density derived from the DMTI file to rs=0.53 for full-service restaurant density derived from the Peel public health inspection list. Relative density measures at the DA level were generally less strongly correlated with measures from Can-FED, ranging from rs=0.26 for the Rmix derived from the Peel public health inspection list to rs=0.31 for the mRFEI derived from the Peel public health inspection list.


Table 2
Spearman’s correlation measures between food environment measures
(1-km network buffer)
Table summary
This table displays the results of Spearman’s correlation measures between food environment measures
(1-km network buffer) Peel region, Ontario (appearing as column headers).
Peel region, Ontario
DMTI PHL Can-FED
Convenience store density
DMTI Note ...: not applicable Note ...: not applicable Note ...: not applicable
PHL 0.59 Note ...: not applicable Note ...: not applicable
Can-FED 0.28 0.34 Note ...: not applicable
Chain supermarket density
DMTI Note ...: not applicable Note ...: not applicable Note ...: not applicable
PHL 0.42 Note ...: not applicable Note ...: not applicable
Can-FED 0.34 0.45 Note ...: not applicable
Restaurant density
DMTI Note ...: not applicable Note ...: not applicable Note ...: not applicable
PHL 0.71 Note ...: not applicable Note ...: not applicable
Can-FED 0.49 0.53 Note ...: not applicable
Fast-food density
DMTI Note ...: not applicable Note ...: not applicable Note ...: not applicable
PHL 0.68 Note ...: not applicable Note ...: not applicable
Can-FED 0.48 0.49 Note ...: not applicable
Grocery and fruit and vegetable store density
DMTI Note ...: not applicable Note ...: not applicable Note ...: not applicable
PHL 0.46 Note ...: not applicable Note ...: not applicable
Can-FED 0.33 0.38 Note ...: not applicable
Rmix
DMTI Note ...: not applicable Note ...: not applicable Note ...: not applicable
PHL 0.53 Note ...: not applicable Note ...: not applicable
Can-FED 0.27 0.26 Note ...: not applicable
mRFEI
DMTI Note ...: not applicable Note ...: not applicable Note ...: not applicable
PHL 0.42 Note ...: not applicable Note ...: not applicable
Can-FED 0.28 0.31 Note ...: not applicable

Census metropolitan area-level food environments and correlations with health outcomes

The mean mRFEI using the 1-km buffer size across the CMAs was 15.1% (95% CI: 13.3%, 16.9%), which indicates about 15% of food establishments, on average, provide fresh and nutritious food. Vancouver (30.8%; 95% CI: 29.9%, 31.7%), Victoria (27.0%; 95% CI: 24.6%, 29.4%) and Québec (22.7%; 95% CI: 21.3%, 24.2%) had the highest mRFEI scores, while Saint John (8.0%; 95% CI: 5.6%, 10.3%), Barrie (7.7%; 95% CI: 6.0%, 8.5%) and Belleville (6.4%; 95% CI: 4.3%, 8.5%) had the lowest (Figure 2; Appendix 1).

Figure 2 Description

Data table for Figure 2 
Data table for Figure 2
Table summary
This table displays the results of Data table for Figure 2. The information is grouped by CMA (appearing as row headers), Modified retail food environment index percent (appearing as column headers).
CMA Modified retail food environment index percent
Belleville 6.43
Barrie 7.69
Saint John 7.99
Greater Sudbury / Grand Sudbury 8.93
Peterborough 9.51
Kingston 9.54
Moncton 10.08
St. John's 10.52
St. Catharines–Niagara 10.76
Oshawa 11.38
Windsor 12.19
Halifax 12.45
Abbotsford–Mission 13.16
Sherbrooke 13.53
London 13.89
Edmonton 13.97
Guelph 14.03
Kitchener–Cambridge–Waterloo 14.26
Kelowna 14.60
Brantford 14.72
Trois-Rivières 15.10
Calgary 15.35
Hamilton 15.82
Ottawa–Gatineau 15.88
Saguenay 16.60
Thunder Bay 17.00
Saskatoon 17.56
Regina 19.60
Toronto 20.29
Winnipeg 20.67
Lethbridge 21.22
Montréal 21.90
Québec 22.72
Victoria 27.01
Vancouver 30.82

The mean Rmix across the CMAs using the 1-km buffer siz e was 46.6% (95% CI: 43.8%, 49.4%), meaning that, on average, almost half of all food establishments are fast-food outlets. St. John’s (62.4%; 95% CI: 57.8%, 67.0%), Brantford (59.6%; 95% CI: 55.0%, 64.2%) and Moncton (58.9%; 95% CI: 53.7%, 64.2%) had the highest Rmix scores, while Victoria (33.3%; 95% CI: 30.8%, 35.9%), Montréal (28.8%; 95% CI: 28.1%, 29.4%) and Vancouver (26.4%; 95% CI: 25.6%, 27.3%) had the lowest (Figure 3; Appendix 2).

Figure 3 Description

Data table for Figure 3 
Data table for Figure 3
Table summary
This table displays the results of Data table for Figure 3. The information is grouped by CMA (appearing as row headers), Mean fast-food restaurant mix percent (appearing as column headers).
CMA Mean fast-food restaurant mix percent
Vancouver 26.45
Montréal 28.80
Victoria 33.35
Sherbrooke 35.91
Toronto 36.06
Québec 38.49
Calgary 38.51
Winnipeg 39.79
Windsor 40.93
Saskatoon 43.53
Ottawa–Gatineau 43.82
Guelph 44.99
Edmonton 45.04
Kelowna 45.30
Abbotsford–Mission 46.44
Halifax 46.53
Trois-Rivières 46.54
Barrie 47.16
Peterborough 47.62
Saguenay 47.79
St. Catharines–Niagara 47.94
Regina 48.48
London 50.00
Belleville 50.59
Hamilton 51.69
Kitchener–Cambridge–Waterloo 51.84
Thunder Bay 52.69
Oshawa 53.25
Lethbridge 53.49
Saint John 53.91
Kingston 56.11
Greater Sudbury / Grand Sudbury 58.32
Moncton 58.93
Brantford 59.59
St. John's 62.42

The mRFEI showed a strong negative correlation with the percentage of the population reporting overweight or obese categories of BMI, rp=-0.65 (95% CI: -0.90, -0.40); a moderate negative correlation with the percentage of the population who reported living with diabetes, rp=-0.44 (95% CI: -0.67, -0.12), and high blood pressure, rp=-0.46 (95% CI: -0.69, -0.16); a modest negative correlation with households experiencing food insecurity, rp=-0.34 (95% CI: -0.60, -0.01); and a modest positive (but not statistically significant) correlation with the percentage of the population who reported eating fruits and vegetables five times or more per day, rp=0.32 (95% CI: -0.02, 0.60).

The Rmix showed a strong positive correlation with the percentage of the population who reported overweight or obese categories of BMI, rp=0.74 (95% CI: 0.54, 0.86); a moderate positive correlation with the percentage of the population who reported living with high blood pressure, rp=0.50 (95% CI: 0.19, 0.71); a moderate negative correlation with the percentage of the population who reported eating fruits and vegetables five times or more per day, rp=-0.46 (95% CI: -0.69, -0.14); and a modest positive association with the percentage of the population who reported living with diabetes, rp=0.36 (95% CI: 0.05, 0.63) (Table 3).


Table 3
Pearson correlation coefficients between the census metropolitan area-level relative food environment variables and health
Table summary
This table displays the results of Pearson correlation coefficients between the census metropolitan area-level relative food environment variables and health . The information is grouped by Variable (appearing as row headers), mRFEI, 95% confidence interval , Rmix and 95% confidence interval (appearing as column headers).
Variable 95% confidence interval Rmix 95% confidence interval
Lower Upper Lower Upper
mRFEI Note ...: not applicable Note ...: not applicable Note ...: not applicable -0.67Table 3 Note  -0.82Table 3 Note  -0.44Table 3 Note 
Rmix -0.67Table 3 Note  -0.82Table 3 Note  -0.44Table 3 Note  Note ...: not applicable Note ...: not applicable Note ...: not applicable
Percentage overweight or obese -0.65Table 3 Note  -0.8Table 3 Note  -0.4Table 3 Note  0.74Table 3 Note  0.54Table 3 Note  0.86Table 3 Note 
Percentage diabetic -0.44Table 3 Note  -0.67Table 3 Note  -0.12Table 3 Note  0.36Table 3 Note  0.05Table 3 Note  0.63Table 3 Note 
Percentage with high blood pressure -0.46Table 3 Note  -0.69Table 3 Note  -0.16Table 3 Note  0.5Table 3 Note  0.19Table 3 Note  0.71Table 3 Note 
Percentage who consumed F/V >= 5 times per day 0.32 -0.02 0.6 -0.46Table 3 Note  -0.69Table 3 Note  -0.14Table 3 Note 
Percentage food insecure -0.34Table 3 Note  -0.6Table 3 Note  -0.01Table 3 Note  0.2 -0.13 0.51

Discussion

This study was motivated by the desire to create a pan-Canadian dataset of food environment measures with high accuracy that is accessible to researchers and the public health community. Two versions of Can-FED were created using geo-coded food outlet data from the 2018 Statistics Canada BR—a researcher file and a general-use file. The researcher file contains a wide range of continuous variables that must be accessed in a secure Statistics Canada environment. The general-use file contains a more limited number of categorical variables and is publicly available for download. Results show that there was generally modest to moderate agreement between Can-FED and food environment measures derived using the DMTI proprietary dataset and the Peel public health inspection list. At the CMA level, there is wide geographic variation in the retail food environment, and the food environment measures are correlated with several CMA-level and health indicators.

The researcher Can-FED contains 19 absolute density variables and 2 relative density variables. Several food outlet categories were created so that researchers could address a broad range of research questions. Relative density measures were provided alongside absolute density measures because they account for different types of food outlets that operate in a neighbourhood.Note 25 Additional relative measures can be calculated by researchers using the absolute densities. A recent review from Canada and a review from the United States determined that relative measures were more consistently associated with health-related outcomes.Note 5Note 7

Variables were created by calculating the density of outlets within 1-km and 3-km network buffers around the centroids of each DA in Canada. Network buffers represent the routes people can take to access outlets. In Canada, food environment research has generally used a buffer size of 0.4 km to 1.6 km.Note 5 A study of adults from five American cities (Chapel Hill, Albuquerque, Columbus, Philadelphia and Los Angeles) determined that the average distance travelled to a food establishment was 4.2 km and that a 1.6 km buffer covered 64% of food establishments visited by participants.Note 26 These findings indicate that employing a larger buffer size may result in more consistent and significant associations with the diet and health outcomes of residents in certain regions. Providing multiple buffers allows users to choose the best conceptual fit for their application.Note 27 Densities (count divided by the area) were calculated to standardize the access measures because network buffer areas varied in size based on the density of the road network.

Can-FED measures tended to have modest to moderate agreement with measures derived from the DMTI file and the Peel public health inspection list. Some agreement is lost because of year mismatch, which may be especially important in quickly growing regions like Peel, Ontario. Importantly, the DMTI file and the Peel public health inspection list did not contain the same classification attributes, so individual outlets had to be categorized differently in each dataset to create the food environment categories, leading to possible misclassification. Many duplicates needed to be removed. For example, about 25% of the food outlets in the categories had to be removed because they were listed twice. The records may have been duplicated intentionally in the dataset for inspection purposes, but they needed to be removed before calculating correlations with Can-FED to ensure they were only counted once. It also appears that differential accuracy of food environment variables exists within the datasets. For example, convenience store density derived from the DMTI file was modestly correlated with convenience store density from Can-FED (rs=0.28), while restaurant density derived from the DMTI file was moderately correlated with restaurant density from Can-FED (rs=0.49). These results align with previous research that assessed the validity of secondary datasets. A systematic review and meta-analysis of 20 validation studies of commercially available data determined that there was high variability in data quality depending on the data source assessed, with most of the data sources falling between moderate and substantial validity when compared with a gold standard.Note 9 Because of the advantages of the data collection process in the BR, the Can-FED data are likely to be a gold standard. This can be determined by future validation work.

Canadian cities vary in their abundance of retail food outlets offering highly processed and nutrient-poor food and those offering a wide selection of fresh and nutritious food. The mean mRFEI score ranged from 6.4% in Belleville, Ontario, to 30.8% in Vancouver, British Columbia. This score was higher than most American state-level mRFEI scores, which ranged from 4% to 16%.Note 18 Evidence suggests that neighbourhoods without access to outlets that are likely to offer a wide selection of fresh and nutritious food are less widespread in Canada than in the United States.Note 28 In Canada, areas with access to some outlets that are likely to offer a wide selection of fresh and nutritious food but an overabundance of outlets that offer highly processed and nutrient-poor food are common.Note 28 Additionally, state-level scores include more rural areas than CMA-level scores, and this may tend to have many zeros because of a lack in food outlet access, leading to lower mRFEI scores. The mean Rmix score among DAs with at least one fast-food outlet ranged from 26.4% in Vancouver, British Columbia, to 62.4% in St. John’s, Newfoundland and Labrador. Variations in the food environment across cities could be because of differences in cultural preferences and norms that change the type of food outlets demanded. For example, cities in Quebec have a higher proportion of convenience stores (known as “dépanneurs”) than other cities, which may be, in part, because they sell select alcoholic beverages and have a unique cultural importance. Additionally, differences in urban design such as density, zoning differences and walkability may favour a certain type of outlet.

At the level of CMAs, the percentage of the population living with overweight or obesity, diabetes, or high blood pressure and who experience household food insecurity was lower with higher (more favourable) mRFEI scores. Some evidence showed that the percentage of people who eat fruits and vegetables five or more times per day was higher with higher mRFEI scores. However, this association was inconclusive as the lower bound of the confidence interval fell just below zero. The percentage of the population living with overweight or obesity, diabetes, or high blood pressure was higher in CMAs with higher (less favourable) Rmix scores, and the percentage of the population who ate five or more fruits and vegetables per day decreased with higher Rmix scores. These ecological relationships suggest that food environments have modest to strong relationships with important indicators of cardiometabolic health at the CMA level. Further exploration of using multivariate models and individual-level health data is warranted to understand the independent effect of neighbourhood retail food environments on the health of residents.

Limitations

The CMA-level health outcomes relied on self-reports that can introduce error in different ways, such as recall bias and social desirability bias.Note 11Note 12 Additionally, overweight and obesity were determined by BMI, which cannot distinguish between people with higher muscle mass and those with high fat mass. This may reduce the validity of BMI to measure overweight and obesity.Note 29 Furthermore, the 1-km and 3-km buffer sizes for the retail food environment measured in Can-FED may be too small to capture rural residents’ food shopping behaviour. Further research is needed to determine the average distance Canadians in rural areas travel to access food outlets. An American study determined that the distance to the nearest supermarket for people in rural areas was estimated to be 2.1 km at the 20th percentile, 5.6 km at the median and 10.2 km at the 80th percentile.Note 30 More considerations will be needed to adapt for rural areas since distance, density and presence alone vary considerably by rural area.

Conclusion

Can-FED is a pan-Canadian dataset of food environment measures using food outlet data derived from the BR. Previous research in this field has used secondary datasets from proprietary or local government sources. There are concerns about the accuracy, accessibility, timeliness and geographic coverage of these types of secondary datasets.Note 8Note 9Note 10 The BR stores data on businesses across Canada that are identified from mandatory business tax data collected by the CRA. All establishments are coded with the most up-to-date industry classification (NAICS) codes, and Statistics Canada staff continues to perform ongoing quality evaluation. Can-FED provides new, high-quality and flexible national measures of the food environment and comes in two versions. The researcher dataset contains continuous variables that can be accessed in a secure Statistics Canada environment. The general-use file contains a selected number of easy-to-use categorical variables and is available publicly. The datasets will allow for surveillance of the spatial variation of food environments across Canada and will open up opportunities for etiological studies when linked to national or investigator-led surveys.

The neighbourhood retail food environment measures can be adapted in future versions of Can-FED based on the best new evidence on how people access food outlets and what types of food outlets are relevant to diet and health outcomes. Researchers may also wish to consider how to weigh food outlets based on their size and opening hours, and how to incorporate new statistical and geographic methods that are available to future versions. A consensus on how often the measures need to be updated is warranted.

Acknowledgements

This study is supported by a grant from the Canadian Institutes of Health Research (CIHR) (grant reference number DA2-162516) and by funding from the Canadian Urban Environmental Health Research Consortium. Research by Andrew C. Stevenson is supported by a Fonds de recherche du Québec – Santé doctoral training award. Research by Nancy A. Ross is supported through a Canada Research Chair. Research by Thomas Burgoine is funded by the Centre for Diet and Activity Research, a UK Clinical Research Collaboration Public Health Research Centre of Excellence. Funding from the British Heart Foundation, Cancer Research UK, the Economic and Social Research Council, the Medical Research Council, the National Institute for Health Research, and the Wellcome Trust (grant reference number MR/K023187/1), under the auspices of the UK Clinical Research Collaboration, is gratefully acknowledged. Research by Thomas Burgoine is also supported by the MRC Epidemiology Unit, University of Cambridge (grant reference number MC/UU/00006/7).

The analysis presented in this paper was conducted at the Quebec Interuniversity Centre for Social Statistics (QICSS), which is part of the Canadian Research Data Centre Network (CRDCN). The services and activities provided by the QICSS are made possible by the financial or in-kind support of the Social Sciences and Humanities Research Council, the CIHR, the Canada Foundation for Innovation, Statistics Canada, the Fonds de recherche du Québec, and Quebec universities. The views expressed in this paper are those of the authors and not necessarily those of the CRDCN, the QICSS or their partners.

We would like to thank the staff of the Data Integration Infrastructure Division and the Health Analysis Division at Statistics Canada for their ongoing support and guidance on this project. The classification of food outlets into the food environment categories were based on the subjective assessment by the research team, and not by Statistics Canada.

Appendix


Appendix Table A.1
Mean census metropolitan area-level modified retail food environment index and 95% confidence interval, standard deviation, first quartile and third quartile values, interquartile range, and total number of dissemination areas scores using the 1-km buffer size
Table summary
This table displays the results of Mean census metropolitan area-level modified retail food environment index and 95% confidence interval. The information is grouped by Census metropolitan areas (appearing as row headers), Mean mRFEI, 95% confidence interval, Standard deviation, First quartile, Third quartile, Interquartile range and Number of dissemination areas (appearing as column headers).
Census metropolitan areas Mean mRFEI 95% confidence interval Standard deviation First quartile Third quartile Interquartile range Number of dissemination areas
Lower Upper
Abbotsford–Mission 13.2 10.0 16.3 26.2 0 16.7 16.7 265
Barrie 7.7 6.0 9.4 15.8 0 10.0 10.0 333
Belleville 6.4 4.3 8.5 14.3 0 0.0 0.0 178
Brantford 14.7 12.2 17.3 20.1 0 25.0 25.0 238
Calgary 15.4 14.3 16.4 22.5 0 25.0 25.0 1,759
Edmonton 14.0 12.9 15.0 22.0 0 22.2 22.2 1,688
Greater Sudbury / Grand Sudbury 8.9 6.5 11.3 20.1 0 0.0 0.0 269
Guelph 14.0 11.4 16.6 20.3 0 25.0 25.0 234
Halifax 12.4 10.7 14.2 21.9 0 20.0 20.0 601
Hamilton 15.8 14.7 17.0 20.3 0 25.0 25.0 1,199
Kelowna 14.6 11.2 18.0 27.7 0 20.0 20.0 255
Kingston 9.5 6.9 12.2 21.6 0 9.4 9.4 255
Kitchener–Cambridge–Waterloo 14.3 12.7 15.8 21.5 0 25.0 25.0 736
Lethbridge 21.2 16.6 25.9 31.4 0 33.3 33.3 175
London 13.9 12.3 15.4 21.8 0 25.0 25.0 760
Moncton 10.1 7.1 13.0 22.6 0 12.5 12.5 226
Montréal 21.9 21.4 22.4 20.5 0 33.3 33.3 6,469
Oshawa 11.4 9.9 12.9 18.4 0 20.0 20.0 580
Ottawa–Gatineau 15.9 14.8 16.9 23.6 0 25.0 25.0 1,947
Peterborough 9.5 6.8 12.2 19.4 0 14.3 14.3 198
Québec 22.7 21.3 24.2 26.6 0 37.5 37.5 1,291
Regina 19.6 16.6 22.6 29.9 0 30.0 30.0 381
Saguenay 16.6 13.3 19.9 28.9 0 25.0 25.0 295
Saint John 8.0 5.6 10.3 18.7 0 0.0 0.0 242
Saskatoon 17.6 15.2 19.9 24.8 0 28.6 28.6 428
Sherbrooke 13.5 11.4 15.6 19.4 0 25.0 25.0 327
St. Catharines–Niagara 10.8 9.2 12.3 20.6 0 16.7 16.7 678
St. John's 10.5 8.2 12.8 21.2 0 14.3 14.3 326
Thunder Bay 17.0 13.7 20.3 26.0 0 33.3 33.3 238
Toronto 20.3 19.8 20.8 22.1 0 31.3 31.3 7,525
Trois-Rivières 15.1 12.4 17.8 22.7 0 25.0 25.0 272
Vancouver 30.8 29.9 31.7 27.0 0 50.0 50.0 3,450
Victoria 27.0 24.6 29.4 29.3 0 50.0 50.0 574
Windsor 12.2 10.7 13.7 17.9 0 20.0 20.0 548
Winnipeg 20.7 19.3 22.0 24.1 0 33.3 33.3 1,229

Appendix Table A.2
Mean census metropolitan area-level fast-food restaurant mix scores and 95% confidence interval, standard deviation, first quartile and third quartile values, interquartile range, and total number of dissemination areas using the 1-km buffer around dissemination areas with one or more fast-food outlets
Table summary
This table displays the results of Mean census metropolitan area-level fast-food restaurant mix scores and 95% confidence interval. The information is grouped by Census metropolitan areas (appearing as row headers), Mean Rmix, 95% confidence interval, Standard deviation, First quartile, Third quartile, Interquartile range and Number of dissemination areas (appearing as column headers).
Census metropolitan areas Mean Rmix 95% confidence interval Standard deviation First quartile Third quartile Interquartile range Number of dissemination areas
Lower Upper
Vancouver 26.4 25.6 27.3 20.3 0.0 25.0 25.0 2,183
Montréal 28.8 28.1 29.4 21.6 0.0 28.57 28.57 4,226
Victoria 33.3 30.8 35.9 21.1 0.0 33.3 33.3 263
Sherbrooke 35.9 31.8 40.0 24.8 0.0 33.3 33.3 140
Toronto 36.1 35.5 36.6 21.0 11.1 40.0 28.9 5,598
Calgary 38.5 37.1 39.9 22.4 0.0 37.5 37.5 986
Québec 38.5 36.2 40.8 28.9 0.0 33.3 33.3 605
Winnipeg 39.8 38.1 41.5 23.7 0.0 40.0 40.0 749
Windsor 40.9 38.2 43.7 25.6 10.0 40.0 30.0 332
Saskatoon 43.5 40.6 46.5 24.1 4.8 50.0 45.2 257
Ottawa–Gatineau 43.8 42.3 45.3 25.7 0.0 50.0 50.0 1,127
Edmonton 45.0 43.4 46.7 26.0 0.0 47.8 47.8 956
Guelph 45.0 40.6 49.4 26.0 0.0 50.0 50.0 134
Kelowna 45.3 39.7 50.9 27.8 0.0 40.0 40.0 95
Abbotsford–Mission 46.4 42.3 50.5 20.4 0.0 50.0 50.0 95
Halifax 46.5 43.4 49.7 26.7 9.2 50.0 40.8 277
Trois-Rivières 46.5 42.1 51.0 26.0 0.0 50.0 50.0 131
Barrie 47.2 43.8 50.5 22.3 0.0 50.0 50.0 170
Peterborough 47.6 40.8 54.4 32.0 0.0 35.7 35.7 85
Saguenay 47.8 41.1 54.5 27.3 0.0 33.3 33.3 64
St. Catharines–Niagara 47.9 45.3 50.6 26.4 0.0 50.0 50.0 382
Regina 48.5 44.7 52.2 27.1 0.0 42.9 42.9 201
London 50.0 47.6 52.4 24.9 0.0 50.0 50.0 414
Belleville 50.6 44.4 56.8 26.7 0.0 50.0 50.0 71
Hamilton 51.7 50.1 53.3 23.5 18.2 52.9 34.7 832
Kitchener–Cambridge–Waterloo 51.8 49.3 54.4 27.7 0.0 50.0 50.0 454
Thunder Bay 52.7 47.5 57.9 29.4 16.7 50.0 33.3 123
Oshawa 53.2 50.4 56.1 26.3 0.0 50.0 50.0 326
Lethbridge 53.5 47.7 59.3 25.8 0.0 50.0 50.0 76
Saint John 53.9 48.0 59.9 25.6 0.0 55.6 55.6 71
Kingston 56.1 51.1 61.1 28.4 0.0 50.0 50.0 124
Greater Sudbury / Grand Sudbury 58.3 52.9 63.7 26.3 0.0 50.0 50.0 91
Moncton 58.9 53.7 64.2 27.1 0.0 57.1 57.1 102
Brantford 59.6 55.0 64.2 27.6 0.0 57.1 57.1 138
St. John's 62.4 57.8 67.0 28.9 0.0 57.1 57.1 152
Date modified: