Integration of existing data to develop an ethnicity indicator in the LSDDP
Articles and reports: 11-522-X202200100018Description: The Longitudinal Social Data Development Program (LSDDP) is a social data integration approach aimed at providing longitudinal analytical opportunities without imposing additional burden on respondents. The LSDDP uses a multitude of signals from different data sources for the same individual, which helps to better understand their interactions and track changes over time. This article looks at how the ethnicity status of people in Canada can be estimated at the most detailed disaggregated level possible using the results from a variety of business rules applied to linked data and to the LSDDP denominator. It will then show how improvements were obtained using machine learning methods, such as decision trees and random forest techniques. Issue Number: 2022001Author(s): Saidi, Abdelnasser; Farah, Aziz; Diagne, BassirouMain Product:Statistics Canada International Symposium Series: Proceedings