2023
DOI: 10.1093/jamiaopen/ooad052
|View full text |Cite
|
Sign up to set email alerts
|

A novel method to create realistic synthetic medication data

Abstract: Objective Synthea is a synthetic patient generator that creates synthetic medical records, including medication profiles. Prior to our work, Synthea produced unrealistic medication data that did not accurately reflect prescribing patterns. This project aimed to create an open-source synthetic medication database that could integrate with Synthea to create realistic patient medication profiles. Materials and Methods The Medica… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 9 publications
0
1
0
Order By: Relevance
“…Early efforts have demonstrated that even though the generated text was of poorer quality relative to the original text, when it was used to augment the training data, it still boosted the performance of downstream NLP tasks ( 46 ). Domain-specific databases can be used to mitigate the inherently stochastic nature of large language models ( 47 ) in an attempt to improve the accuracy, diversity and complexity of generated clinical data ( 48 ). Nonetheless, the data annotation bottleneck still persists but may be addressed with strategic prompt engineering.…”
Section: Discussionmentioning
confidence: 99%
“…Early efforts have demonstrated that even though the generated text was of poorer quality relative to the original text, when it was used to augment the training data, it still boosted the performance of downstream NLP tasks ( 46 ). Domain-specific databases can be used to mitigate the inherently stochastic nature of large language models ( 47 ) in an attempt to improve the accuracy, diversity and complexity of generated clinical data ( 48 ). Nonetheless, the data annotation bottleneck still persists but may be addressed with strategic prompt engineering.…”
Section: Discussionmentioning
confidence: 99%