2019
DOI: 10.1093/jamia/ocz199
|View full text |Cite
|
Sign up to set email alerts
|

Learning from electronic health records across multiple sites: A communication-efficient and privacy-preserving distributed algorithm

Abstract: Objectives We propose a one-shot, privacy-preserving distributed algorithm to perform logistic regression (ODAL) across multiple clinical sites. Materials and Methods ODAL effectively utilizes the information from the local site (where the patient-level data are accessible) and incorporates the first-order (ODAL1) and second-order (ODAL2) gradients of the likelihood function from other sites to construct an estimator without … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
52
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 76 publications
(55 citation statements)
references
References 16 publications
1
52
0
Order By: Relevance
“…The consistent findings from our analysis suggested the ODAL method can be used to identify clinically relevant risk factors for OUD using distributed real-world data. Moreover, due to the low event rates of OUD in the clinical sites, the estimates provided by the ODAL algorithm outperform those of meta-analysis, which is consistent with the findings in Duan et al 20 . ODAL may be especially valuable for studying rare outcomes or exposures in a multicenter analysis.…”
Section: Figuresupporting
confidence: 89%
See 3 more Smart Citations
“…The consistent findings from our analysis suggested the ODAL method can be used to identify clinically relevant risk factors for OUD using distributed real-world data. Moreover, due to the low event rates of OUD in the clinical sites, the estimates provided by the ODAL algorithm outperform those of meta-analysis, which is consistent with the findings in Duan et al 20 . ODAL may be especially valuable for studying rare outcomes or exposures in a multicenter analysis.…”
Section: Figuresupporting
confidence: 89%
“…In multicenter studies, sharing data is a major challenge due to privacy concerns 16 . To circumvent the issue of sharing individual patient-level data, many distributed algorithms have been developed to jointly study multiple datasets by communicating only summary-level statistics [17][18][19][20] . Among them, Duan et al [19][20] proposed a privacy-preserving Oneshot Distributed Algorithm for Logistic regression (ODAL), which can be used to identify risk factors of a binary healthcare outcome of interest.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…A radically different concept in privacy preserving analysis is distributed computation. A University of Pennsylvania-led team describes two performant algorithms [99,100], differentiated by resource requirements and performance, which analyze data behind their home firewalls and aggregate statistical results, mainly regression models. Their particular success lies in controlling for data source heterogeneity and maintaining high faithfulness to gold standard (i.e., analysis of aggregated data).…”
Section: Privacy: Deidentification Distributed Computation Blockchainmentioning
confidence: 99%