2023
DOI: 10.1109/jbhi.2022.3196697
|View full text |Cite
|
Sign up to set email alerts
|

Synthetic Patient Data Generation and Evaluation in Disease Prediction Using Small and Imbalanced Datasets

Abstract: The increasing prevalence of chronic noncommunicable diseases makes it a priority to develop tools for enhancing their management. On this matter, Artificial Intelligence algorithms have proven to be successful in early diagnosis, prediction and analysis in the medical field. Nonetheless, two main issues arise when dealing with medical data: lack of high-fidelity datasets and maintenance of patient's privacy. To face these problems, different techniques of synthetic data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 16 publications
(6 citation statements)
references
References 49 publications
0
2
0
Order By: Relevance
“…This approach is in line with the AI field pivoting away from the common perception that it is almost synonymous with ‘big data’ [ 36 ] to a focus on small data to deliver valuable biological insights. This could be achieved using synthetic data [ 37 ], artificially generated data that imitate the characteristics and patterns of real-world data without containing actual information from individual observations. Such data are produced using algorithms to simulate the statistical properties, distributions and relationships present in ‘authentic’ clinical datasets.…”
Section: Using Artificial Intelligence To Drive a Precision Medicine ...mentioning
confidence: 99%
“…This approach is in line with the AI field pivoting away from the common perception that it is almost synonymous with ‘big data’ [ 36 ] to a focus on small data to deliver valuable biological insights. This could be achieved using synthetic data [ 37 ], artificially generated data that imitate the characteristics and patterns of real-world data without containing actual information from individual observations. Such data are produced using algorithms to simulate the statistical properties, distributions and relationships present in ‘authentic’ clinical datasets.…”
Section: Using Artificial Intelligence To Drive a Precision Medicine ...mentioning
confidence: 99%
“…In the domain of medical data analysis, this study provided a framework by utilizing algorithms for generating synthetic data on eight different tabular medical datasets [37]. With an eye on ensuring the fidelity of the synthetic data, the research used valid statistical metrics to measure the integrity of the data.…”
Section: Healthcarementioning
confidence: 99%
“…Synthetic data generation is an emerging field that needs to be studied further but has shown promising results in the medical field. Using a framework based on synthetic data generation combined with different machine learning classifiers, Rodriguez-Almeida et al [128] tested 8 medical tabular data sets and assessed the feasibility of using synthetic data to preserve data integrity and maintain classification performance using AI algorithms in the medical domain.…”
Section: Ai Implementation In Mhealthmentioning
confidence: 99%