PRIVACY IN STATISTICAL DATABASES 2024

Antibes Juan-les-Pins, France. September 25-27, 2024

PROGRAMME

Please, upload your slides to the room laptop before your session.

Wednesday, 25 September

08h30 Registration
09h15 Opening Session. Melek Önen and Josep Domingo-Ferrer
Session: Disclosure risk assessment I. Session Chair: Marieke de Vries
09:20"The Statbarn: A New Model for Output Statistical Disclosure Control", Elizabeth Green, Felix Ritchie and Paul White
09:45"Attribute Disclosure Risk in Smart Meter Data: A Position Paper", Guillermo Navarro-Arribas and Vicenç Torra
10:10"An Examination of the Alleged Privacy Threats of Confidence-Ranked Reconstruction of Census Microdata", David Sanchez, Najeeb Jebreel, Krish Muralidhar, Josep Domingo-Ferrer and Alberto Blanco-Justicia
10:35BREAK
Session: Disclosure risk assessment II. Session Chair: Matthias Templ
10:55"Synthetic Data: Comparing Utility and Risk in Microdata and Tables", Simon Xi Ning Kolb, Jui Andreas Tang and Sarah Giessing
11:20"Synthetic Data Outliers: Navigating Identity Disclosure", Carolina Trindade, Luís Antunes, Tânia Carvalho and Nuno Moniz
11:45"Privacy Risk from Synthetic Data: Practical Proposals", Gillian Raab
12:10LUNCH BREAK
Session: Privacy models and concepts. Session Chair: Peter-Paul de Wolf
14:00"From Isolation to Identification", Giuseppe D'Acquisto, Aloni Cohen, Maurizio Naldi and Kobbi Nissim
14:25"Differentially Private Quantile Regression", Tran Tran, Matthew Reimherr and Aleksandra Slavkovic
14:50"Utility Analysis of Differentially Private Anonymized Data Based on Random Sampling", Takumi Sugiyama, Hiroto Osugi, Io Yamanaka and Kazuhiro Minami
15:15"Privacy- & Utility-Preserving Data Releases over Fragmented Data Using Differential Privacy via Individual Ranking", Luis Del Vasto Terrientes, Sergio Martinez and David Sanchez
15:30BREAK
Session: Statistical table protection. Session Chair: Sarah Giessing
15:50"Secondary Cell Suppression by Gaussian Elimination: An Algorithm Suitable for Handling Issues with Zeros and Singletons", Øyvind Langsrud
16:15"Obtaining (ɛ,δ)-Differential Privacy Guarantees When Using the Poisson Distribution to Synthesize Tabular Data", James Jackson, Robin Mitra, Brian Francis and Iain Dove
Session: Microdata protection. Session Chair: Anna Oganian
16:40"Asymptotic Utility of Spectral Anonymization", Katariina Perkonoja and Joni Virta
17:05"Robin Hood: A De-identification Method to Preserve Minority Representation for Disparities Research", James Thomas Brown, Ellen Clayton, Michael Matheny, Murat Kantarcioglu, Yevgeniy Vorobeychik and Bradley Malin
17:30"An Optimization Approach to Privacy Preserving Dynamic Data Publishing", Jordi Castro, Claudio Gentile and Adrian Tobar-Nicolau *

Thursday, 26 September

Session: Synthetic data generation methods I. Session Chair: Paul Francis
09:00"The Production of Bespoke Synthetic Teaching Datasets without Access to the Original Data", Mark Elliot, Claire Little and Richard Allmendinger
09:25"Evaluating the Pseudo Likelihood Approach for Synthesizing Surveys under Informative Sampling", Anna Oganian, Joerg Drechsler and Mehtab Iqbal
09:50"Hidden Power of Quasi-multinomial Sampling: Utility Analysis and Bias Correction", Hajime Ono and Nobuaki Hoshino *
10:05"Evaluation of Synthetic Data Quality Using the Quantile at Risk", Michel Béra, Vasiliki Daskalaki, Spiros Kolovos, Antonis Spinakis, Konstantinos Spinakis and Photis Stavropoulos *
10:20"A Cautionary Reflection on (Pseudo-)Synthetic Data from Deep Learning on Personal Data", Fabio Ricciato *
10:35BREAK
Session: Synthetic data generation methods II. Session Chair: Bradley Malin
10:55"Generating Synthetic Data Is Complicated: Know Your Data and Know Your Generator", Jonathan Latner, Marcel Neunhoeffer and Jörg Drechsler
11:20"Developing Synthetic Microdata through Machine Learning for the Annual Business Survey", Jorge Cisneros, Audrey Kindlon, Timothy Wojan, Matthew Williams, Jennifer Ozawa, Christine Task, Damon Streat and Heather Madray *
Session: Synthetic data generation software. Session Chair: Gillian Raab
11:35"A Comparison of SynDiffix Multi-table versus Single-table Synthetic Data", Paul Francis
12:00"An Evaluation of Synthetic Data Generators Implemented in the Python Library Synthcity", Emma Fössing and Jörg Drechsler
12:25"Evaluation of Synthetic Data Generators on Complex Tabular Data", Oscar Thees, Jiří Novák and Matthias Templ
12:50LUNCH BREAK
Session: Case studies. Session Chair: Jörg Drechsler
14:45"A Case Study Exploring Data Synthesis Strategies on Tabular vs. Aggregated Data Sources for Official Statistics", Mohamed Aghaddar, Liu Nuo Su, Manel Slokom and Peter-Paul de Wolf
15:10"Relational Or Single: A Comparative Analysis of Data Synthesis Approaches for Privacy and Utility on a Use Case from Statistical Office", Shruti Agrawal, Manel Slokom, Nynke C. Krol and Peter-Paul de Wolf
15:35"Escalation of Commitment: A Case Study of the United States Census Bureau Efforts to Implement Differential Privacy for the 2020 Decennial Census", Krish Muralidhar and Steven Ruggles
16:00"Applications of Statistical Disclosure Control Methods to Protect the Confidentiality in Agricultural Census Microdata", Andrzej Młodak and Tomasz Józefowski *
16:45 Guide visit to Antibes Old Town
19:30 Gala Dinner at restaurant Le Café de la Plage

Friday, 27 September

Session: Spatial and georeferenced data. Session Chair: Krish Muralidhar
09:30"Masking Georeferenced Health Data - An Analysis Taking the Example of Partially Synthetic Data on Sleep Disorder", Simon Cremer, Lydia Jehmlich and Rainer Lenz
09:55"Privacy and Disclosure Risks in Spatial Dynamic Microsimulations", Hanna Brenzel, Martin Palm, Jan Weymeirsch and Ralf Münnich
Session: Machine Learning and privacy I. Session Chair: Jordi Castro
10:20"Combinations of AI Models and XAI Metrics Vulnerable to Record Reconstruction Risk", Ryotaro Toma and Hiroaki Kikuchi
10:45"DISCOLEAF: Personalized DIScretization of COntinuous Attributes for LEArning with Federated Decision Trees", Saloni Kwatra and Vicenc Torra
11:10BREAK
Session: Machine Learning and privacy II. Session Chair: Sébastien Gambs
11:30"Node Injection Link Stealing Attack", Oualid Zari, Javier Parra-Arnau, Ayşe Ünsal and Melek Önen
11:55"Assessing the Potentials of LLMs and GANs as State-of-the-art Tabular Synthetic Data Generation Methods", Marko Miletic and Murat Sariyar
12:20"Active Learning for Human Annotation of Privacy-Preserved Synthetic Data", Osamu Saisho, Takayuki Miura, Kazuki Iwahana, Masanobu Kii and Rina Okada *
12:35"Improving Utility in a DP-Fied ML Algorithm with a Metaheuristics-Based Privacy Budget Allocation Strategy", Marianne Abi Kanaan, Jean-François Couchot, Talar Atechian and Rony Darazi *
12:50 Closing remarks. Melek Önen and Josep Domingo-Ferrer

Important Dates

  • Submission deadline MAY 19, 2024
    MAY 26, 2024
  • Acceptance notification June 21, 2024
  • Proceedings version due June 30, 2024
  • USB-only submission deadline June 30, 2024
  • USB-only acceptance notification July 11, 2024
  • USB-only proceedings version due July 17, 2024

Previous editions