Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset

Yue, Xiang; Gutiérrez, Bernal Jiménez; Sun, Huan

doi:10.18653/v1/2020.acl-main.410

Cited by 26 publications

(43 citation statements)

References 38 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…and relations-related questions as Yue et al 75 found that the two subsets are more consistent. We utilized both F1-score and exact match score for evaluation.…”

Section: Acknowledgmentsmentioning

confidence: 94%

GatorTron: A Large Language Model for Clinical Natural Language Processing

Yang

Nejatian

Shin

et al. 2022

Preprint

View full text Add to dashboard Cite

There is an increasing interest in developing massive-size deep learning models in natural language processing (NLP) - the key technology to extract patient information from unstructured electronic health records (EHRs). However, there are limited studies exploring large language models in the clinical domain; the current largest clinical NLP model was trained with 110 million parameters (compared with 175 billion parameters in the general domain). It is not clear how large-size NLP models can help machines understand patients' clinical information from unstructured EHRs. In this study, we developed a large clinical transformer model - GatorTron - using >90 billion words of text and evaluated it on 5 clinical NLP tasks including clinical concept extraction, relation extraction, semantic textual similarity, natural language inference, and medical question answering. GatorTron is now the largest transformer model in the clinical domain that scaled up from the previous 110 million to 8.9 billion parameters and achieved state-of-the-art performance on the 5 clinical NLP tasks targeting various healthcare information documented in EHRs. GatorTron models perform better in understanding and utilizing patient information from clinical narratives in ways that can be applied to improvements in healthcare delivery and patient outcomes.

show abstract

“…and relations-related questions as Yue et al 75 found that the two subsets are more consistent. We utilized both F1-score and exact match score for evaluation.…”

Section: Acknowledgmentsmentioning

confidence: 94%

GatorTron: A Large Language Model for Clinical Natural Language Processing

Yang

Nejatian

Shin

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…SeaReader is proposed to answer questions in clinical medicine using documents extracted from publications in the medical domain. Yue et al (2020) conduct a thorough analysis of the emrQA dataset (Pampari et al, 2018) and explore the ability of QA systems to utilize clinical domain knowledge and to generalize to unseen questions. introduce PubMedQA where questions are derived based on article titles and can be answered with its respective abstracts.…”

Section: Related Workmentioning

confidence: 99%

“…Some recent works are trying to construct medical MRC dataset such as PubMedQA , emrQA (Pampari et al, 2018) and HEAD-QA (Vilares and Gómez-Rodríguez, 2019), etc. However, either these data sets are noisy (e.g., due to semi-automatically or heuristic rules generated), or the annotated data scale is too small (Yoon et al, 2019;Yue et al, 2020). Instead, we constructs a large scale medical MRC dataset by collecting 21.7k multiplechoice problems with human-annotated answers for the National Licensed Pharmacist Examination in China.…”

Section: Introductionmentioning

confidence: 99%

“…Though several pre-trained language models have been introduced for domain-specific MRC, BERT based models are not as consistently dominant as they are in open field MRC tasks (Zhong et al, 2020;Yue et al, 2020). Another challenge is that medical questions are often more difficult; no labeled paragraph contains the answer to a given question.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Towards Medical Machine Reading Comprehension with Structural Knowledge and Plain Text

Li¹,

Hu²,

Chen³

et al. 2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

Machine reading comprehension (MRC) has achieved significant progress on the open domain in recent years, mainly due to large-scale pre-trained language models. However, it performs much worse in specific domains such as the medical field due to the lack of extensive training data and professional structural knowledge neglect. As an effort, we first collect a large scale medical multi-choice question dataset (more than 21k instances) for the National Licensed Pharmacist Examination in China. It is a challenging medical examination with a passing rate of less than 14.2% in 2018. Then we propose a novel reading comprehension model KMQA, which can fully exploit the structural medical knowledge (i.e., medical knowledge graph) and the reference medical plain text (i.e., text snippets retrieved from reference books). The experimental results indicate that the KMQA outperforms existing competitive models with a large margin and passes the exam with 61.8% accuracy rate on the test set.

show abstract

“…To prevent the model overfitting on specific cases and encourage it to learn general language patterns, one possible way is to enlarge training data (Yang et al, 2019). However, clinical texts are usually difficult to obtain, not to mention the requirement of tremendous expert effort for annotations (Yue et al, 2020). To solve this, we introduce our data augmentation method PHICON, which consists of PHI augmentation and Context augmentation.…”

Section: Introductionmentioning

confidence: 99%

PHICON: Improving Generalization of Clinical Text De-identification Models via Data Augmentation

Yue¹,

Zhou²

2020

Proceedings of the 3rd Clinical Natural Language Processing Workshop

Self Cite

View full text Add to dashboard Cite

De-identification is the task of identifying protected health information (PHI) in the clinical text. Existing neural de-identification models often fail to generalize to a new dataset. We propose a simple yet effective data augmentation method PHICON to alleviate the generalization issue. PHICON consists of PHI augmentation and Context augmentation, which creates augmented training corpora by replacing PHI entities with named-entities sampled from external sources, and by changing background context with synonym replacement or random word insertion, respectively. Experimental results on the i2b2 2006 and 2014 deidentification challenge datasets show that PH-ICON can help three selected de-identification models boost F1-score (by at most 8.6%) on cross-dataset test. We also discuss how much augmentation to use and how each augmentation method influences the performance. 1 3 https://portal.dbmi.hms.harvard.edu/ projects/n2c2-nlp/

show abstract

Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset

Cited by 26 publications

References 38 publications

GatorTron: A Large Language Model for Clinical Natural Language Processing

GatorTron: A Large Language Model for Clinical Natural Language Processing

Towards Medical Machine Reading Comprehension with Structural Knowledge and Plain Text

PHICON: Improving Generalization of Clinical Text De-identification Models via Data Augmentation

Contact Info

Product

Resources

About