2022
DOI: 10.3390/electronics11050812
|View full text |Cite
|
Sign up to set email alerts
|

Incorporation of Synthetic Data Generation Techniques within a Controlled Data Processing Workflow in the Health and Wellbeing Domain

Abstract: To date, the use of synthetic data generation techniques in the health and wellbeing domain has been mainly limited to research activities. Although several open source and commercial packages have been released, they have been oriented to generating synthetic data as a standalone data preparation process and not integrated into a broader analysis or experiment testing workflow. In this context, the VITALISE project is working to harmonize Living Lab research and data capture protocols and to provide controlle… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 18 publications
(21 citation statements)
references
References 33 publications
0
18
0
Order By: Relevance
“…In terms of privacy, the use of Wilcoxon signed-rank tests, the analysis of re-identification risks and computation of similarity to real data will be considered. Finally, it is intended to incorporate the proposed approaches in the VITALISE controlled data processing workflow presented by Hernandez et al [4]. This workflow enables…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In terms of privacy, the use of Wilcoxon signed-rank tests, the analysis of re-identification risks and computation of similarity to real data will be considered. Finally, it is intended to incorporate the proposed approaches in the VITALISE controlled data processing workflow presented by Hernandez et al [4]. This workflow enables…”
Section: Discussionmentioning
confidence: 99%
“…In this context, a workflow to make LL data accessible for external researchers by generating synthetic data (SD) has been proposed by Hernandez et al [4]. As explained in the proposed workflow, SD is created with a clear purpose, enabling researchers to develop algorithms and analyses locally using it.…”
Section: Introductionmentioning
confidence: 99%
“…Synthetic data generation is also available as part of other privacy preserving frameworks like OpenSAFELY [ 15 ] and other packages [ 16 – 18 ]. A version of synthpop for DataSHIELD is offered in [ 16 ] but the simstudy method is not available, nor are the detailed workflows for generating synthetic data.…”
Section: Discussionmentioning
confidence: 99%
“…The SDV generates synthetic data by applying mathematical techniques and machine learning models, such as the deep learning model. Several studies have embraced the use of synthetic data, which can be applied for real-world applications [ 21 ]. An additional 900 new rows were generated for testing.…”
Section: Methodsmentioning
confidence: 99%