2022
DOI: 10.21203/rs.3.rs-2015205/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Mining Multi-Center Heterogeneous Medical Data with Distributed Synthetic Learning

Abstract: Overcoming barriers of multi-center data analysis is challenging due to privacy protection and data heterogeneity in the healthcare system. In this study, we propose Distributed Synthetic Learning (DSL) architecture to learn across multi-medical centers without leaking sensitive personal information. DSL emphasizes the building of a homogeneous data center with entirely synthetic medical images via a form of GAN-based synthetic learning. In particular, DSL architecture is extensible with three key variances:… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 52 publications
0
2
0
Order By: Relevance
“…The clinical practice of TVTCS has benefited not only from individual imaging technical progress, but also from fusion technology from multiple imaging modes, including computational modeling and mobile detection modes [ 77 ]. Overcoming the barriers stemming from data heterogeneity among the sorts of detection instruments to establish the common standard for clinicians is a clinical challenge [ 78 ]. A realistic stereoscopically anatomical model from the visible imaging datasets permits the developments of custom-tailored procedure strategies.…”
Section: Commentsmentioning
confidence: 99%
“…The clinical practice of TVTCS has benefited not only from individual imaging technical progress, but also from fusion technology from multiple imaging modes, including computational modeling and mobile detection modes [ 77 ]. Overcoming the barriers stemming from data heterogeneity among the sorts of detection instruments to establish the common standard for clinicians is a clinical challenge [ 78 ]. A realistic stereoscopically anatomical model from the visible imaging datasets permits the developments of custom-tailored procedure strategies.…”
Section: Commentsmentioning
confidence: 99%
“…The acquisition of massive medical data is extremely expensive and time-consuming, especially in fields requiring high-precision equipment, such as MRI imaging, and long-term patient tracking, such as oncology [7] and neurodegenerative diseases [8]. In addition, the large-scale pre-training medical data are typically collected from multi-center into a centralized institution, which significantly increases the risk of exposing patient privacy as they have access to a rich set of personal patient information and routine data anonymization can not guarantee data privacy protection [9][10][11]. Given these challenges, it is critical to develop a new pre-training paradigm with high data efficiency for building medical foundation models from limited real-world pre-training datasets, which can effectively mitigate the issues of data scarcity, extensive resource requirements, and privacy concerns that currently impede the development of medical foundation models.…”
mentioning
confidence: 99%