Overview of record linkage

Surveys and statistical programs – Documentation: 11-522-X19990015660
Description:

There are many different situations in which one or more files need to be linked. With one file the purpose of the linkage would be to locate duplicates within the file. When there are two files, the linkage is done to identify the units that are the same on both files and thus create matched pairs. Often records that need to be linked do not have a unique identifier. Hierarchical record linkage, probabilistic record linkage and statistical matching are three methods that can be used when there is no unique identifier on the files that need to be linked. We describe the major differences between the methods. We consider how to choose variables to link, how to prepare files for linkage and how the links are identified. As well, we review tips and tricks used when linking files. Two examples, the probabilistic record linkage used in the reverse record check and the hierarchical record linkage of the Business Number (BN) master file to the Statistical Universe File (SUF) of unincorporated tax filers (T1) will be illustrated.

Issue Number: 1999001
Author(s): Bernier, Julie; Nobrega, Karla
Main Product: Statistics Canada International Symposium Series: Proceedings
Format Release date More information
CD-ROM March 2, 2000

Related information

Subjects and keywords

Subjects

Keywords