2023
DOI: 10.48550/arxiv.2303.01230
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

What Is Synthetic Data? The Good, The Bad, and The Ugly

Abstract: Sharing data can often enable compelling applications and analytics. However, more often than not, valuable datasets contain information of sensitive nature, and thus sharing them can endanger the privacy of users and organizations. A possible alternative gaining momentum in the research community is to share synthetic data instead. The idea is to release artificially generated datasets that resemble the actual data -more precisely, having similar statistical properties.So how do you generate synthetic data? W… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 9 publications
(9 reference statements)
0
1
0
Order By: Relevance
“…One of the major drawbacks of these synthetic data generation techniques is from the context of usable privacy [15]. It is difficult to predict what information a synthetic dataset will preserve and what information will be perturbed and to which extent.…”
Section: Introductionmentioning
confidence: 99%
“…One of the major drawbacks of these synthetic data generation techniques is from the context of usable privacy [15]. It is difficult to predict what information a synthetic dataset will preserve and what information will be perturbed and to which extent.…”
Section: Introductionmentioning
confidence: 99%