2022
DOI: 10.1038/s41746-022-00615-8
|View full text |Cite
|
Sign up to set email alerts
|

Distributed learning for heterogeneous clinical data with application to integrating COVID-19 data across 230 sites

Abstract: Integrating real-world data (RWD) from several clinical sites offers great opportunities to improve estimation with a more general population compared to analyses based on a single clinical site. However, sharing patient-level data across sites is practically challenging due to concerns about maintaining patient privacy. We develop a distributed algorithm to integrate heterogeneous RWD from multiple clinical sites without sharing patient-level data. The proposed distributed conditional logistic regression (dCL… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(3 citation statements)
references
References 46 publications
(32 reference statements)
0
3
0
Order By: Relevance
“… 20) OMOP CDM can be a solution for standardizing the feature extraction process while assuring feature quality. 21) ATLAS can improve code production, and a pre-established OMOP CDM network, such as FeederNet, can solve the problem of network connection, which is another major problem of FL. 20) …”
Section: Patient-level Predictionmentioning
confidence: 99%
“… 20) OMOP CDM can be a solution for standardizing the feature extraction process while assuring feature quality. 21) ATLAS can improve code production, and a pre-established OMOP CDM network, such as FeederNet, can solve the problem of network connection, which is another major problem of FL. 20) …”
Section: Patient-level Predictionmentioning
confidence: 99%
“…Further work in semi- and weakly-supervised deep learning methods is necessary. [143,144] Moreover, given the privacy constraints associated with EHRs and other health data sources, leveraging interoperable and multimodal data calls for advancements in federated learning methods that can accommodate distributed data sources stored locally across institutions [145]…”
Section: Discussionmentioning
confidence: 99%
“…Fortunately, multiple rs-fMRI datasets from different sites provide the possibility for enhancing the quality of learned BNs. However, multi-site data are often subject to inter-site heterogeneity and data-sharing policies [Li et al, 2020, Tong et al, 2022. How to learn more helpful heterogeneous information from multi-site data with considering data-sharing policies is still one of the research hotspots of BN learning.…”
Section: Introductionmentioning
confidence: 99%