Survey Methodology
A layered perturbation method for the protection of tabular outputs
Archived Content
Information identified as archived is provided for reference, research or recordkeeping purposes. It is not subject to the Government of Canada Web Standards and has not been altered or updated since it was archived. Please "contact us" to request a format other than those available.
- Release date: June 22, 2017
Abstract
The protection of data confidentiality in tables of magnitude can become extremely difficult when working in a custom tabulation environment. A relatively simple solution consists of perturbing the underlying microdata beforehand, but the negative impact on the accuracy of aggregates can be too high. A perturbative method is proposed that aims to better balance the needs of data protection and data accuracy in such an environment. The method works by processing the data in each cell in layers, applying higher levels of perturbation for the largest values and little or no perturbation for the smallest ones. The method is primarily aimed at protecting personal data, which tend to be less skewed than business data.
Key Words: Confidentiality; Data perturbation; Tabular outputs.
Table of contents
- Section 1. Introduction
- Section 2. Background
- Section 3. The Layered Perturbation Method (LPM)
- Section 4. Empirical investigation
- Section 5. Discussion and challenges
- References
How to cite
Tambay, J.-L. (2017). A layered perturbation method for the protection of tabular outputs. Survey Methodology, Statistics Canada, Catalogue No. 12-001-X, Vol. 43, No. 1. Paper available at http://www.statcan.gc.ca/pub/12-001-x/2017001/article/14818-eng.htm.
Note
- Date modified: