Technical Reference Guides for the Education and Labour Market Longitudinal Platform (ELMLP)
Overview of the Education and Labour Market Longitudinal Platform and Associated Datasets, 2024

Release date: August 15, 2024

Skip to text

Text begins

1. Introduction

Statistics Canada, in collaboration with the provincial and territorial ministries of education, Employment and Social Development Canada (ESDC), and other stakeholders, has developed the Education and Labour Market Longitudinal Platform (ELMLP).

The ELMLP allows longitudinal integration of administrative data related to education with other data sources to create anonymized, customized datasets for analytical purposes.

The ELMLP fills data gaps and enables a greater understanding of student and apprenticeship pathways, transitions to the labour market and outcomes over time. Data from the ELMLP can help address a wide range of policy questions pertaining to student and apprenticeship persistence, completion, mobility and pathways as well as their labour market outcomes.

These data allow policy makers to understand the different types of trajectories that students can take through their postsecondary education or apprenticeship training as well as student characteristics that may be related to these trajectories.

The target users for the ELMLP include provincial and territorial ministries of education, apprenticeship authorities, postsecondary educational institutions, federal government departments, members of the academic community, researchers, students and parents interested in graduate outcomes and other stakeholder groups involved in education and the labour market.

2. The Education and Labour Market Longitudinal Platform (ELMLP)

2.1 The key features of the ELMLP

  1. Platform – The ELMLP allows researchers to integrate selected datasets to access information about past cohorts of college/university students and registered apprentices, in order to better understand their pathways and the ways in which education and training affected their career prospects.
  2. Securely integrable datasets - These datasets that can be integrated allow researchers to know more than what a single dataset or survey can provide. Securely integrable datasets means that any of the datasets in the ELMLP may be combined with any other dataset using an anonymous linkage identifier found on each file. The integration of datasets is carried out within the Statistics Canada Social Data Linkage Environment (SDLE), which maintains the highest privacy and data security standards. The ELMLP is not a large integrated dataset, but rather a relational data environment. Researchers are only given access to the files necessary for their analysis. After identifying which ELMLP datasets are needed to answer a specific research or policy question, researchers can use the anonymous identifier located on each file to bring these datasets together.
  3. Longitudinal data – The data available within the Platform can be linked longitudinally, allowing researchers to better understand the characteristics, pathways and outcomes of students and apprentices over time.
  4. Accessible data – Datasets prepared for the Platform are made available to researchers through the Research Data Centre (RDC) network across Canada.

2.2 Accessibility, confidentiality and privacy

Datasets that are integrated in the ELMLP are deemed sensitive statistical information and are subject to the confidentiality requirements of the Statistics Act. Statistics Canada employees who work with the integrated datasets for research purposes have access to the data only after they have been stripped of personal identifiers. Furthermore, only Statistics Canada employees and deemed employees who have an approved need to access the data for their analytical work are allowed access to the files.

These data are treated with the same level of confidentiality as surveys administered by Statistics Canada.

Indicators, analysis and findings from the ELMLP are released through Statistics Canada’s website.

The anonymized ELMLP data are also available in Statistics Canada’s Research Data Centres (RDC) to researchers with approved projects only. These researchers are provided with access in a secure setting at the RDCs, which are staffed by Statistics Canada employees. The RDCs operate under the provisions of the Statistics Act in accordance with all confidentiality rules, and are only accessible to researchers once they have been sworn in under the Statistics Act as “deemed employees.” All analytical results are vetted before they can be taken out of the RDCs to ensure that confidentiality is maintained and individuals cannot be identified.

2.3 Core and supplementary datasets

The ELMLP consists of two types of datasets: core and supplementary.

Core datasets are updated in the ELMLP on an annual basis to include the most recently released data.  

Supplementary datasets are updated with a frequency that varies according to a number of factors and may not always include the most recent data years. Concordance key files are necessary when integrating many of the supplementary datasets, to either the ELMLP core datasets or other supplementary datasets.

Core datasets

The core datasets are:

  1. The Postsecondary Student Information System (PSIS) – an annual administrative dataset of all Canadian (provincial and territorial) public college and university enrolments and graduations by type of program, type of credential, and field of study for each reporting year. PSIS collects information pertaining to the programs and courses offered at an institution, as well as demographic information for the students and the program(s) and course(s) in which they were registered, or from which they have graduated. The ELMLP includes longitudinal PSIS data from academic year 2009/2010 onwards. For more information, see Statistics Canada’s PSIS information page.
  2. The Registered Apprenticeship Information System (RAIS) – an annual administrative dataset of all Canadian (provincial and territorial) registered apprentices and trade qualifiers. The RAIS compiles data on the number of registered apprentices taking in-class and/or on-the-job training in trades that are either Red Seal or non-Red Seal and where apprenticeship training is either compulsory or voluntary. It also compiles data on the number of provincial and interprovincial certificates granted to apprentices or trade qualifiers (challengers). The purpose of the RAIS is to gather information on individuals who receive training and those who obtain certification within a trade where apprenticeship training is being offered. The ELMLP includes RAIS data from 2008 onward. For more information, see Statistics Canada’s RAIS information page.
  3. Income tax from the T1 Family File (T1FF), subset for the PSIS and RAIS populationsan annual administrative dataset which includes selected information from income tax data from 1992 onwards for the PSIS and RAIS populations. The source T1 Family File (T1FF) covers individuals who filed taxes in Canada and their families and is derived from their T1 income tax returns (when they are available). For the most part, tax returns were filed in the spring of the year following the reference year. Income information is based on the calendar year. There is no T1FF datafile available in the ELMLP that includes the total tax-filing population of Canada. For more information, see Statistics Canada’s T1FF information page.

Supplementary datasets

The supplementary files are:

1.  Census of Population, Long Form, 2016 and 2021 cycles– datasets that provide information about the Canadian population, including characteristics such as age, sex, type of dwelling, families, households and marital status, and language. For the 2016 Census and 2021 Census, Canadian households were enumerated using two main types of questionnaires: the short-form questionnaire and the long-form questionnaire. The long-form questionnaire includes the same questions as the short form, as well as a series of questions, such as those about visible minorities, aimed at providing a more comprehensive portrait of the Canadian population and Canadian households. The long-form questionnaires were sent to a sample of the population covering approximately 1 in 4 households. For more information, see Statistics Canada’s Census information page.

2. Longitudinal Administrative Databank (LAD), 1982 to 2021 – an administrative longitudinal dataset designed as a research tool on income and demographics. It uses a longitudinal sample of 20% of the records in the annual T1 Family File representing the total population of individuals who filed taxes in Canada and their families. It can be used as a control group or for residual group analysis. For more information, see Statistics Canada’s LAD information page.

3. Longitudinal Immigration Database (IMDB), 1952 to 2022 – an administrative longitudinal dataset that includes immigration data for all immigrants since 1952 and non-permanent residents since 1980. Outcomes for this population are available from tax files since 1982. The IMDB provides detailed information on socioeconomic outcomes of immigrants after their admission. For more information, see Statistics Canada’s IMDB information page.

4. National Graduates Survey (NGS), Class of 2015 and Class of 2020 – datasets containing information from a sample survey of graduates of public postsecondary educational institutions in Canada for the class of 2015 and the class of 2020. The questions focus on academic path, funding for postsecondary education (including government-sponsored student loans), and transition into the labour market. New cycles of this survey take place every 5 years. For more information, see Statistics Canada’s NGS information page.

5. Canada Student Financial Assistance Program (CSFA), 2009/2010 to 2020/2021 (formerly called Canada Student Loans Program (CSLP), 2003/2004 to 2015/2016) – an administrative dataset which provides information on both non-repayable and repayable student financial assistance (grants and loans) to eligible students to help them access and afford postsecondary education. The CSFA data provides information on needs assessment, disbursement and repayment of loans and grants. All provinces and territories participate in the CSFA, except Quebec, the Northwest Territories and Nunavut, who receive alternative payments from the Government of Canada to administer their own student financial aid programs. The CSFA data at Statistics Canada only reflects the federal portion of student financial aid (i.e. not any provincial or territorial student financial aid). Federal-provincial/territorial arrangements vary by jurisdiction.

6. Canada Education Savings Program (CESP), 1998 to 2021 – administrative datasets which include information on the CESP and the Canada Learning Bond (CLB) program, both of which are federal savings incentive programs that are available for eligible Registered Education Savings Plan (RESP) beneficiaries. It includes information on the RESP, an Education Savings Plan that has been registered with the Canada Revenue Agency. It is a savings vehicle used by individuals and families to save for children's postsecondary education. The CESP and the CLB programs provide additional government grants for postsecondary studies based on the original RESP contributions and other criteria. The files provide information on the beneficiaries, the subscribers and the primary caregivers and on contributions, grants and withdrawals.

7. Canada Apprentice Loans (CAL), 2015 to 2023 – an administrative datafile which provides information on those who are eligible to receive loans during the period in which they are registered in Red Seal apprenticeships and are taking technical training. The loans help cover the costs of apprenticeship training.

8. Apprenticeship Incentive Grants (AIG), 2007 to 2023 – an administrative data file which provides information on taxable cash grants for those registered with their provincial/territorial apprenticeship authority as an apprentice. To receive it, applicants are required to prove that they have successfully completed either the first and/or second year or level (or equivalent) in a designated Red Seal tradeNote 1.

9. Apprenticeship Completion Grant (ACG), 2009 to 2023 – an administrative data file that provides information about one-time taxable cash grants designed to encourage apprentices to complete their apprenticeship programNote 1.

10.  Apprenticeship Incentive Grant for Women (AIG-W), April 1st 2018 to 2023 - an administrative data file which provides information on taxable cash grants for women registered with their provincial/territorial apprenticeship authority as an apprentice. To receive it, applicants are required to prove that they have successfully completed either the first and/or second year or level (or equivalent) in a designated Red Seal trade where women are underrepresentedNote 1.

11. National Apprenticeship Survey (NAS), 2015 – a dataset containing information from a sample survey that looks at factors affecting the completion, certification and transition of apprentices to the labour market. For more information, see Statistics Canada’s NAS information page.

12. Employment Insurance Status Vector (EISV), 2007 to 2022 – an administrative dataset that follows the trajectory of Employment Insurance (EI) claims and benefit payments from application to termination. It describes the characteristics of the EI claimants, provides summary information for the claim and a week by week account of claimant activity during the life of the claim.

13. Emergency and recovery benefits (ERB), March 2020 to 2022 – this dataset is intended to cover the major income support measures offered to Canadians as a result of the COVID-19 pandemic in 2020 through 2022. These direct support programs include the Canada Emergency Response Benefit (CERB), the Canada Emergency Student Benefit (CESB), the Canada Recovery Benefit (CRB), the Canada Recovery Caregiving Benefit (CRCB), the Canada Recovery Sickness Benefit (CRSB), and the Canada Worker Lockdown Benefit (CWLB). Currently, concordance keys are only available in the RDCs to integrate selected years of PSIS and RAIS with the CERB, the CRB, the CRCB, and the CRSB. For more information on the ERB, see Statistics Canada’s ERB information page.

14. British Columbia Kindergarten to grade 12 data (BC K-12), 1991/1992 to 2021/2022 – an administrative dataset which provides student-level information for students who attended British Columbia’s public and independent schools during the reference period. The data includes information about student characteristics (gender, age, geographic location, etc.) and progress through the education system (graduation, Foundation Skills Assessment results, exam marks, etc.).

15. Ontario grade 9 to 12 data (ON 9-12), grade 9 entrant cohorts of 2009/2010 to 2012/2013; grade 12 cohorts of 2013/2014 to 2015/2016 – administrative dataset which provide anonymized, longitudinal, student-level information for specific cohorts of students who attended a public secondary school or a publicly-funded Roman Catholic secondary school in Ontario. The data includes information about student characteristics (gender, age, exceptionality, etc.), and progress through the secondary school system (graduation, course marks, Education Quality and Accountability Office testing results, etc.).

16. Survey of Approaches to Educational Planning (SAEP), 2020 – a dataset containing information from a sample survey that gathers information from parents and guardians of children aged 0 to 17 about their attitudes and expectations with respect to their child’s postsecondary education, the strategies they use to prepare for their children's postsecondary education, their financial plans for paying for their schooling, and the barriers to saving for higher education. The dataset also includes demographic and labour force status information. For more information, see Statistics Canada’s SAEP information page.

17. T1FF for PSIS and RAIS —parental income variant (Consumer Price Index-adjusted), 1992 to 2021 – an administrative dataset derived from the T1 Family File (T1FF) subset file, for the Postsecondary Student Information System (PSIS) and the Registered Apprenticeship Information System (RAIS), that provides information on the income of parents of postsecondary students, registered apprentices and trade qualifiers. The dataset is available for analytical convenience, as all of the variables can be re-derived using the T1 Family File of the ELMLP core files.

18. Income tax from the T1 Family File (T1FF), subsets for various ELMLP supplementary dataset populations - The T1FF has also been integrated with some of the ELMLP supplementary data sources, with records only available for those subsets of the Canadian population. Currently, a T1FF subset is available for the ELMLP core datasets (PSIS and RAIS), the British Columbia Kindergarten to grade 12 (BC K-12) data, the BC K-12 parents, the Canada Education Savings Program (beneficiaries and primary caregivers), the Ontario grade 9 to 12 data (ON 9-12), and the Canada Student Financial Assistance Program. These supplementary T1FF subsets are not necessarily updated on an annual basis.


Date modified: