Privacy-by-design GEN-RWD Sandbox for Distributed Multicentric Data Analysis in Healthcare: The Proxy Module

Leonardo Nucciarelli, Benedetta Gottardelli, Andrada Mihaela Tudor, Mauro Vallati, Erica Tavazzi, Stefania Orini, Roberto Gatta, Giovanni Arcuri, Andrea Damiani

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Modern data analysis techniques are transforming the healthcare landscape from both research and operational perspectives. Applying statistical analysis, machine learning, data science, and process mining to large databases enables data analysts and clinicians to enhance the quality and precision of their research and care efforts. However, the ingestion and maintenance of high-quality datasets present significant challenges, primarily due to the resource-intensive nature of the task and the difficulty of collecting comprehensive and statistically significant information sets. Multicentric cross-organizational studies, which involve the analysis of datasets collected from various independent data nodes, address this issue; however, concerns regarding privacy and data ownership often hinder the free exchange of data. Among the existing technologies, Distributed Analytics and Federated Learning offer promising solutions to these challenges by facilitating the analysis of large, decentralized datasets while safeguarding patient privacy.In this paper, we present and release as open-source the code of the Proxy Module within the GEN-RWD Sandbox platform, an infrastructure designed for privacy-preserving distributed analytics in healthcare. The module implements essential infrastructural management functions to ensure privacy in a distributed learning environment. A detailed explanation of the module functioning within the platform and test results are provided. The code is available at https://github.com/leonucciarelli/gsproxy.git.

Original languageEnglish
Title of host publicationProceedings - 2025 IEEE 13th International Conference on Healthcare Informatics
Subtitle of host publicationICHI 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages471-477
Number of pages7
ISBN (Electronic)9798331520946
ISBN (Print)9798331520953
DOIs
Publication statusPublished - 22 Jul 2025
Event13th IEEE International Conference on Healthcare Informatics - Rende, Italy
Duration: 18 Jun 202521 Jun 2025
https://events.dimes.unical.it/ichi2025/

Publication series

NameProceedings of International Conference on Healthcare Informatics
PublisherIEEE
ISSN (Print)2575-2626
ISSN (Electronic)2575-2634

Conference

Conference13th IEEE International Conference on Healthcare Informatics
Abbreviated titleICHI 2025
Country/TerritoryItaly
CityRende
Period18/06/2521/06/25
Internet address

Fingerprint

Dive into the research topics of 'Privacy-by-design GEN-RWD Sandbox for Distributed Multicentric Data Analysis in Healthcare: The Proxy Module'. Together they form a unique fingerprint.

Cite this