SDC harmonisation (Finished)

Coordinator: Anco Hundepool
Statistics Netherlands
E-mail: ahnl@cbs.nl

The main purpose of this ESSNet will be to pave the way for future cooperation in Europe. Therefore the main outcome of this project will be case studies investigating new directions to be realised by future projects and an architecture for making the ARGUS software more open, allowing to enlarge the team of developers.


Task 1. Framework for the harmonisation of microdata protection

Task 1-1: Choice of measure for the output
A crucial step in the definition of possible methodologies to be applied to the data is the definition of the set of objective measures for the output that will be maintained by all possible candidate methods. Identification of such benchmarking statistics will involve in particular the participation of survey experts. Views on possible candidate statistics will be shared with CVTS experts to reach a general agreement from the side of the data producers.

Task 1-2: User needs
Users needs are essential when identifying possible anonymisation methods. The analysis of the projects undertaken by researchers on the data and contacts with users of CVTS data will contribute to the definition of the benchmarking statistics and to prioritise features.

Task 1-3: Definition of methodologies
Destatis and Statistics Austria have already presented proposals to reach protection of CVTS data ((Brandt and Hafner, 2007; Meindl and Templ, 2008, Lenz et al. 2006) as well as other (Belsby et al. 2005). These methods will be reviewed to study ways to achieve anonymisation taking into account the new framework and the benchmarking statistics. Istat will also contribute with the definition of another methodology.

Task 1-4: Implementation and reporting
The method will be implemented for CVTS data from few participating MS to show feasibility. (Sweden, Austria, Germany, Italy and The Netherlands). A report on the implementation process of the above framework will be produced at the end of the project detailing the pros and cons and critical points encountered by MS, viability of general proposal and future directions.

D1.1 - A report where the chosen objective measures of the output as well as the users’ needs will be described and motivated. This document will be shared with participating member states (month 2).
D1.2 - A report with the description of the new methodology satisfying the benchmarking statistics described in D1.1. (month 5).
D1.3 – Routines to perform the framework methodology proposed By ISTAT (month 5)
D1.4 – Documentation for implementation (month 5)
D1.5 – Report on the experience and recommendations for the future (month 12)

Task 2. Case studies on tabular data

Task 2-1: Case studies cell suppression

Task 2-1a will follow the approaches from Capobianchi and Franconi, (2009) and Virgili and Franconi, (2009), describing and comparing the analytical process and its outcome for several project partner countries (Germany, Netherlands) for relevant parts of the SBS (f.i. manufacturing industry) and FATS publication. Description of current implementation for SBS is also envisaged to share the experiences on the production side.
Task 2-1b will use the software implementations in τ-ARGUS De Wolf and Hundepool, (2010) and the wrapper function from SAS to t ARGUS developed by Schmidt and Giessing, (2010) to process data sets resulting from task 2-1a. Destatis will also explain how to use implementation Schmidt and Giessing, (2010) on stage 2 and compare this approach to the one of the Eurostat software CIF.
Task 2-1c should draw conclusions from the results of tasks 2-1a and b, coming up with ideas and recommendations for essential building blocks of a methodological framework for the harmonisation of tabular SDC methods based on cell suppression.

Task 2-2: Case studies on perturbative methods

Task 2-2a will introduce the new stochastic perturbation methods and provide a demonstration for both stages (1 and 2), using data sets of task 2-1 and compare results to cell suppression and the rounding/perturbation method, respectively.
Task 2-2b will evaluate the results of task 2a and give a recommendation for a framework for the development of perturbation based SDC methods.

Task 2-3: Case studies on census tables

Task 2-3a will analyse a number of test Census hypercubes and protect these tables according to different rules and methods (in particular cell suppression and rounding) available in τ-ARGUS.
Task 2-3b will evaluate the results of task 2-3a and give recommendations how to protect the harmonised EU Census tables of the member states with minimum information loss using state of the art SDC methods while respecting the different legal frameworks in Europe

D2.1 – SBS report (month 10)
D2.2 – FATS report (month 10)
D2.3 – Report on harmonisation of tabular SDC methods based on cell suppression (month 12)
D2.4 – Report on a framework for the development of perturbation based SDC methods (month 12)
D2.5 – Report on protected Census tables (month 7)
D2.6 – Report on the evaluation of protection strategies and recommendations for harmonised protection measures for European Census hypercubes (month 12)

Task 3. Future directions of the software tools

The aim of this workpackage is to describe the future directions of the development of SDC-tools, both for microdata as well as for tabular data. The following issues, in arbitrary order, have to be addressed:

  1. Required functionality
  2. User friendliness
  3. Integration into the production process
  4. Sustainability
  5. Documentation of current versions
  6. Maintenance/Governance model
  7. Platforms
  8. Meta data
  9. Data file formats
  10. LP-solvers

Task 4. Dissemination towards the ESS


D4.1 – The Wiki will become operational during the scope of the project
D4.2 – Presentations of Task 1and Task 2 results at the relevant Task Force meetings
D4.3 – Presentation of the architecture document (Task 3) at the Working Group on Statistical Confidentiality (WGSC) or another group
D4.4 - Presentation at the NTTS Conference on the SBS-case
D4.5 – Presentation at the joint Eurostat/UNECE conference on the microdata harmonisation

Task 5. Project management