2020
DOI: 10.1093/jamiaopen/ooaa060
|View full text |Cite
|
Sign up to set email alerts
|

Spot the difference: comparing results of analyses from real patient data and synthetic derivatives

Abstract: Background Synthetic data may provide a solution to researchers who wish to generate and share data in support of precision healthcare. Recent advances in data synthesis enable the creation and analysis of synthetic derivatives as if they were the original data; this process has significant advantages over data deidentification. Objectives To assess a big-data platform with data-synthesizing capabilities (MDClone Ltd., Beer S… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
45
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
8
1
1

Relationship

3
7

Authors

Journals

citations
Cited by 38 publications
(45 citation statements)
references
References 15 publications
0
45
0
Order By: Relevance
“…There have been limited replications of clinical studies using synthetic data, with only a handful of examples in the context of observational research 42 43 and larger clinical trial data. 44 The current study adds to this body of work and contributes to the evidence base for enabling more access to clinical trial data through synthesis.…”
Section: Introductionmentioning
confidence: 99%
“…There have been limited replications of clinical studies using synthetic data, with only a handful of examples in the context of observational research 42 43 and larger clinical trial data. 44 The current study adds to this body of work and contributes to the evidence base for enabling more access to clinical trial data through synthesis.…”
Section: Introductionmentioning
confidence: 99%
“…The synthetic data generation platform creates a computationally derived data set which is statistically identical to that of the original patients. The computationally-derived variables and their pairwise correlations had the same or very similar distributions as the relationships among variables in the original data ( 20 ). We included a Spearman's correlation comparison between the variables in the original compared to the variables derived from the MDClone synthetic data platform ( Supplementary Figure 1 ).…”
Section: Methodsmentioning
confidence: 96%
“…In this study, we utilized electronic health record (EHR) data from a large academic liver transplant center. Our institution partnered with MDClone [ 18 , 19 ] (Beer Sheva, Israel) for the data storage and retrieval. MDClone platform is a data engine by storing EHR medical events in a time order for each patient.…”
Section: Methodsmentioning
confidence: 99%