The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
Proceedings of the 2nd Clinical Natural Language Processing Workshop 2019
DOI: 10.18653/v1/w19-1905
|View full text |Cite
|
Sign up to set email alerts
|

Towards Automatic Generation of Shareable Synthetic Clinical Notes Using Neural Language Models

Abstract: Large-scale clinical data is invaluable to driving many computational scientific advances today. However, understandable concerns regarding patient privacy hinder the open dissemination of such data and give rise to suboptimal siloed research. De-identification methods attempt to address these concerns but were shown to be susceptible to adversarial attacks. In this work, we focus on the vast amounts of unstructured natural language data stored in clinical notes and propose to automatically generate synthetic … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
28
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 19 publications
(28 citation statements)
references
References 34 publications
(41 reference statements)
0
28
0
Order By: Relevance
“…We compare two state-of-the-art language modeling approaches for the generation of synthetic EHR notes: a Long Short-Term Memory (LSTM) network [7] and a transformerbased network (GPT-2) [8]. To train these language models, we use a large and heterogeneous corpus of one million Dutch EHR notes.…”
Section: [<Namestart> Maria <Nameend>]mentioning
confidence: 99%
See 3 more Smart Citations
“…We compare two state-of-the-art language modeling approaches for the generation of synthetic EHR notes: a Long Short-Term Memory (LSTM) network [7] and a transformerbased network (GPT-2) [8]. To train these language models, we use a large and heterogeneous corpus of one million Dutch EHR notes.…”
Section: [<Namestart> Maria <Nameend>]mentioning
confidence: 99%
“…The generation of synthetic EHR text for use in medical NLP is still at an early stage [3]. Most studies focus on the creation of English EHR text, using hospital discharge summaries from the MIMIC-III database [7,8,13,14]. In addition, a corpus of English Mental Health Records was explored [15].…”
Section: Generating Synthetic Ehr Notesmentioning
confidence: 99%
See 2 more Smart Citations
“…Medical text generation has been a hot topic recently, such as electronic medical record (EMR) generation (Guan et al, 2018), medical question generation , clinical notes generation (Melamud and Shivade, 2019), etc. However, compared to the research in the general domain, there is still a lot of space for exploration, especially with the assistance of specific knowledge graph.…”
Section: Introductionmentioning
confidence: 99%