Evaluating Neural Text Simplification in the Medical Domain

Bercken, Laurens van den; Sips, Robert-Jan; Lofi, Christoph

doi:10.1145/3308558.3313630

Cited by 43 publications

(30 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Deléger and Zweigenbaum (2008) detect paraphrases from comparable medical corpora of specialized and lay texts, and Kloehn et al (2018) explore UMLS (Bodenreider, 2004) and WordNet (Miller, 2009) with word embedding techniques. Furthermore, Van den Bercken et al (2019) directly align sentences from medical terminological articles in Wikipedia and Simple Wikipedia 2 , which confines the editors' vocabulary to only 850 basic English words. Then, they refine these aligned sentences by experts towards automatic evaluation.…”

Section: Text Simplificationmentioning

confidence: 99%

Expertise Style Transfer: A New Task Towards Better Communication between Experts and Laymen

Cao¹,

Shui²,

Pan³

et al. 2020

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

The curse of knowledge can impede communication between experts and laymen. We propose a new task of expertise style transfer and contribute a manually annotated dataset with the goal of alleviating such cognitive biases. Solving this task not only simplifies the professional language, but also improves the accuracy and expertise level of laymen descriptions using simple words. This is a challenging task, unaddressed in previous work, as it requires the models to have expert intelligence in order to modify text with a deep understanding of domain knowledge and structures. We establish the benchmark performance of five stateof-the-art models for style transfer and text simplification. The results demonstrate a significant gap between machine and human performance. We also discuss the challenges of automatic evaluation, to provide insights into future research directions. The dataset is publicly available at https://srhthu.github. io/expertise-style-transfer/.

show abstract

Section: Text Simplificationmentioning

confidence: 99%

Expertise Style Transfer: A New Task Towards Better Communication between Experts and Laymen

Cao¹,

Shui²,

Pan³

et al. 2020

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

show abstract

“…The resulting medical corpus has 3.3k sentence pairs. This corpus is larger than previously generated corpora (by over 1k sentence pairs) and has stricter quality control (Van den Bercken et al, 2019). Our corpus requires a medical sentence to contain 4 or more medical words and belong to medical titles as compared to the no title requirement and needing to contain only 1 medical word, as described in Van den Bercken et al (2019).…”

Section: Simplementioning

confidence: 99%

“…The final medical parallel corpus has 3.3k aligned sentence pairs 1 . Van den Bercken et al (2019) also created a parallel medical corpus by filtering sentence pairs from Wikipedia. Our corpus is significantly larger (45% larger; 2,267 pairs vs. 3,300 pairs) and uses a stricter criteria for identifying sentences: they only required a single word match in the text itself (not the title) and used a lower similarity threshold of 0.75 (vs. our approach of 0.85).…”

Section: Medical Parallel English Wikipedia Corpus Creationmentioning

confidence: 99%

AutoMeTS: The Autocomplete for Medical Text Simplification

Van¹,

Kauchak²,

Leroy³

2020

Proceedings of the 28th International Conference on Computational Linguistics

View full text Add to dashboard Cite

The goal of text simplification (TS) is to transform difficult text into a version that is easier to understand and more broadly accessible to a wide variety of readers. In some domains, such as healthcare, fully automated approaches cannot be used since information must be accurately preserved. Instead, semi-automated approaches can be used that assist a human writer in simplifying text faster and at a higher quality. In this paper, we examine the application of autocomplete to text simplification in the medical domain. We introduce a new parallel medical data set consisting of aligned English Wikipedia with Simple English Wikipedia sentences and examine the application of pretrained neural language models (PNLMs) on this dataset. We compare four PNLMs (BERT, RoBERTa, XLNet, and GPT-2), and show how the additional context of the sentence to be simplified can be incorporated to achieve better results (6.17% absolute improvement over the best individual model). We also introduce an ensemble model that combines the four PNLMs and outperforms the best individual model by 2.1%, resulting in an overall word prediction accuracy of 64.52%.

show abstract

“…While machine translation-based approaches have not yet been proposed for translating eprescription directions, prior works such as (Yolchuyeva et al, 2018;Shardlow and Nawaz, 2019;Van den Bercken et al, 2019) have suggested solving machine translation tasks without the need for explicitly-defined rules. Neural machine translation (NMT) models have been shown to be able to learn contextual rules automatically from large corpora and produce higher quality translations (Bahdanau et al, 2014;Wu et al, 2016b;Lee et al, 2017).…”

Section: Related Workmentioning

confidence: 99%

PharmMT: A Neural Machine Translation Approach to Simplify Prescription Directions

Li¹,

Lester²,

Zhao³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

The language used by physicians and health professionals in prescription directions includes medical jargon and implicit directives and causes much confusion among patients. Human intervention to simplify the language at the pharmacies may introduce additional errors that can lead to potentially severe health outcomes. We propose a novel machine translation-based approach, PharmMT, to automatically and reliably simplify prescription directions into patient-friendly language, thereby significantly reducing pharmacist workload. We evaluate the proposed approach over a dataset consisting of over 530K prescriptions obtained from a large mail-order pharmacy. The end-to-end system achieves a BLEU score of 60.27 against the reference directions generated by pharmacists, a 39.6% relative improvement over the rule-based normalization. Pharmacists judged 94.3% of the simplified directions as usable as-is or with minimal changes. This work demonstrates the feasibility of a machine translation-based tool for simplifying prescription directions in real-life.

show abstract

Evaluating Neural Text Simplification in the Medical Domain

Cited by 43 publications

References 27 publications

Expertise Style Transfer: A New Task Towards Better Communication between Experts and Laymen

Expertise Style Transfer: A New Task Towards Better Communication between Experts and Laymen

AutoMeTS: The Autocomplete for Medical Text Simplification

PharmMT: A Neural Machine Translation Approach to Simplify Prescription Directions

Contact Info

Product

Resources

About