2021
DOI: 10.1093/jamia/ocab112
|View full text |Cite
|
Sign up to set email alerts
|

Are synthetic clinical notes useful for real natural language processing tasks: A case study on clinical entity recognition

Abstract: Objective : Developing clinical natural language processing systems often requires access to many clinical documents, which are not widely available to the public due to privacy and security concerns. To address this challenge, we propose to develop methods to generate synthetic clinical notes and evaluate their utility in real clinical natural language processing tasks. Materials and Methods : We implemented 4 state-of-the-a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 24 publications
(16 citation statements)
references
References 22 publications
0
16
0
Order By: Relevance
“…To address this challenge, Li et al proposed to develop methods to generate synthetic clinical notes and evaluate their utility in real clinical natural language processing tasks. Thanks to the development of deep learning, recent advances in text generation have made it possible to generate synthetic clinical notes that could be useful for training NER models for information extraction from natural clinical notes, thus lowering the privacy concern and increasing data availability [ 48 ].…”
Section: Tasks and Methodsmentioning
confidence: 99%
“…To address this challenge, Li et al proposed to develop methods to generate synthetic clinical notes and evaluate their utility in real clinical natural language processing tasks. Thanks to the development of deep learning, recent advances in text generation have made it possible to generate synthetic clinical notes that could be useful for training NER models for information extraction from natural clinical notes, thus lowering the privacy concern and increasing data availability [ 48 ].…”
Section: Tasks and Methodsmentioning
confidence: 99%
“…language generation. Current applications are towards creating synthetic clinical notes to increase data size 6 , generating summaries of medical trials 7 , and generating simplified clinical notes for patients 8 .…”
Section: Accepted Manuscriptmentioning
confidence: 99%
“…The first one is the challenge of developing a benchmarking standard for synthetic text ( 28 , 29 ). As yet, in the clinical and mental health domains the utility of synthetic text has been evaluated for only some types of AI, such as automated diagnoses prediction ( 21 ) or named entity recognition ( 7 ). There are no best established practices or systematic criteria on how to assess synthetic data.…”
Section: Ethical Challengesmentioning
confidence: 99%
“…Resolving, or at least significantly assuaging privacy concerns will ease this data sparsity constraint. Relevant research investigates how to make this synthetic data statistically-relevant, as well as useful for algorithmic analysis and training ( 7 , 8 ). Such data show potential to improve data accessibility, model performance and eventually boost scientific progress.…”
Section: Introductionmentioning
confidence: 99%