2022
DOI: 10.48550/arxiv.2201.05400
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Synthesising Electronic Health Records: Cystic Fibrosis Patient Group

Abstract: Class imbalance can often degrade predictive performance of supervised learning algorithms. Balanced classes can be obtained by oversampling exact copies, with noise, or interpolation between nearest neighbours (as in traditional SMOTE methods). Oversampling using augmentation, as is typical in computer vision tasks, can be achieved with deep generative models. Deep generative models are effective data synthesisers due to their ability to capture complex underlying distributions. Synthetic data in healthcare c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
0
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(4 citation statements)
references
References 19 publications
0
0
0
Order By: Relevance
“…In a recent paper by Muller et al [11], the authors take a research approach similar to this paper, examining the applicability of synthetic EHR created by GANs for the prediction of the patient outcome. However, they mainly focus on the value of the artificially created data through GANs for solving the problem of class imbalance.…”
Section: Synthesizing Electronic Health Records: Cystic Fibrosis Pati...mentioning
confidence: 99%
See 3 more Smart Citations
“…In a recent paper by Muller et al [11], the authors take a research approach similar to this paper, examining the applicability of synthetic EHR created by GANs for the prediction of the patient outcome. However, they mainly focus on the value of the artificially created data through GANs for solving the problem of class imbalance.…”
Section: Synthesizing Electronic Health Records: Cystic Fibrosis Pati...mentioning
confidence: 99%
“…Thus, similar to Section 2.4, a smaller data set is used here to address the problem of data scarcity in the healthcare domain and, thus, also answering the raised research question of Fernandes [12] (see Section 2.3). Furthermore, to the authors' knowledge, only the very recent work of Muller et al [11] examines how to overcome the problem of imbalance through the application of GANs in the context of a binary classification task. This is extended in this paper by examining this issue regarding a multi-class classification problem.…”
Section: Comparison To the Underlying Workmentioning
confidence: 99%
See 2 more Smart Citations