Bookmark and Share

Re-identifying register data by survey data: an empirical study


Bender, Stefan; Brand, Ruth; Bacher, Johann


Bitte beziehen Sie sich beim Zitieren dieses Dokumentes immer auf folgenden Persistent Identifier (PID):http://nbn-resolving.de/urn:nbn:de:0168-ssoar-236119

Weitere Angaben:
Abstract "More and more empirical researchers from universities or research centres would like to use register data collected by statistical agencies or the social security system, because these data can be used for several empirical studies, e.g. the analysis of special groups or quantitative effects of economic policies. Most of the register data required have to be (factually) anonymised before they are disseminated to preserve confidentiality. Therefore re-identification risks for register data are examined by matching a sample of register data with survey data, collected especially for scientific purposes. Three methods were applied: the uniqueness approach, a simple distance estimation and a cluster analysis. The data sets used were two birth cohorts (1964 and 1971) of the German employment statistics (register data) and the German Life History Study. The analysis show that a re-identification of real persons may be possible by a standard-cluster analysis or a simple distance criterion if an intruder has access to additional information. The number of re-identifiable persons is remarkably high although the proportion of re-identifiable persons is less than expected on the basis of the uniqueness-approach." (author's abstract)
Thesaurusschlagwörter data; empirical social research; official statistics; survey; interview; comparison of methods; data preparation; analysis; anonymity; personal data; data protection
Klassifikation Erhebungstechniken und Analysetechniken der Sozialwissenschaften
Methode Grundlagenforschung; Methodenentwicklung
Sprache Dokument Englisch
Publikationsjahr 2001
Seitenangabe S. 373-381
Zeitschriftentitel Statistical journal of the United Nations Economic Commission for Europe, 18 (2001) 4
Lizenz Deposit Licence - Keine Weiterverbreitung, keine Bearbeitung
Datenlieferant Dieser Metadatensatz wurde vom Sondersammelgebiet Sozialwissenschaften (USB Köln) erstellt.