CliCR: a Dataset of Clinical Case Reports for Machine Reading Comprehension

Šuster, Simon; Daelemans, Walter

doi:10.18653/v1/n18-1140

Cited by 70 publications

(60 citation statements)

References 43 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The performances of the baselines rand-entity and maxfreq-entity presented in [7] are very poor because a random entity and the most frequent entity in the passage are used as answers, respectively. The lang-model method performs poor because it is based on queries only, without reading the document, it is difficult to provide accurate answers.…”

Section: Results Analysismentioning

confidence: 99%

“…These tasks have attracted some researchers to carry out various researches, and have played dramatic roles in promoting researches in the clinical medical field [6]. And some related data sets have been proposed, such as CliCR [7], PubMedQA [8], Chimed [2] and emrQA [3] etc. Besides, the clinical field has accumulated extensive experience and knowledge, some of which have been uploaded to PubMed, one of the literature databases in the biomedical field, and has nearly 2 million publications with case types [9,10].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Gated Dilated Convolution with Attention Model for Clinical Cloze-Style Reading Comprehension

Wang

Zhang

Zhou

et al. 2020

IJERPH

View full text Add to dashboard Cite

The machine comprehension research of clinical medicine has great potential value in practical application, but it has not received sufficient attention and many existing models are very time consuming for the cloze-style machine reading comprehension. In this paper, we study the cloze-style machine reading comprehension in the clinical medical field and propose a Gated Dilated Convolution with Attention (GDCA) model, which consists of a gated dilated convolution module and an attention mechanism. Our model has high parallelism and is capable of capturing long-distance dependencies. On the CliCR data set, our model surpasses the present best model on several metrics and obtains state-of-the-art result, and the training speed is 8 times faster than that of the best model.

show abstract

Section: Results Analysismentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

A Gated Dilated Convolution with Attention Model for Clinical Cloze-Style Reading Comprehension

Wang

Zhang

Zhou

et al. 2020

IJERPH

View full text Add to dashboard Cite

show abstract

“…As questions are not proposed directly from documents, this task is challenging and some information extraction methods fail to deal with it. This methodology of creating MRC datasets enlightens lots of other researches [77,52,69]. In order to avoid that questions can be answered by knowledge out of the documents, all entities in documents are anonymized by random markers.…”

Section: -Cnn and Daily Mailmentioning

confidence: 99%

“…-CliCR To address the problem that there are scarce datasets for specific domains, Suster et al [77] build a large-scale cloze-style dataset based on clinical case reports for healthcare and medicine. Similar to the CNN & Daily Mail, summary points of each case reports are used to create queries by blanking out a medical entity.…”

Section: Ms Marco[51]mentioning

confidence: 99%

Neural Machine Reading Comprehension: Methods and Trends

et al. 2019

View full text Add to dashboard Cite

Machine Reading Comprehension (MRC), which requires the machine to answer questions based on the given context, has gained increasingly wide attention with the incorporation of various deep learning techniques over the past few years. Although the research of MRC based on deep learning is flourishing, there remains a lack of a comprehensive survey to summarize existing approaches and recent trends, which motivates our work presented in this article. Specifically, we give a thorough review of this research field, covering different aspects including (1) typical MRC tasks: their definitions, differences and representative datasets; (2) general architecture of neural MRC: the main modules and prevalent approaches to each of them; and (3) new trends: some emerging focuses in neural MRC as well as the corresponding challenges. Last but not least, in retrospect of what has been achieved so far, the survey also envisages what the future may hold by discussing the open issues left to be addressed. * Work in progress.

show abstract

“…• We improve Japanese PAS analysis by combining the PAS-QA and RC-QA datasets. (Welbl et al, 2017;Suster and Daelemans, 2018;Pampari et al, 2018).…”

Section: Introductionmentioning

confidence: 99%

Machine Comprehension Improves Domain-Specific Japanese Predicate-Argument Structure Analysis

Takahashi¹,

Shibata²,

Kawahara³

et al. 2019

Proceedings of the 2nd Workshop on Machine Reading for Question Answering

View full text Add to dashboard Cite

To improve the accuracy of predicateargument structure (PAS) analysis, large-scale training data and knowledge for PAS analysis are indispensable. We focus on a specific domain, specifically Japanese blogs on driving, and construct two wide-coverage datasets as a form of QA using crowdsourcing: a PAS-QA dataset and a reading comprehension QA (RC-QA) dataset. We train a machine comprehension (MC) model based on these datasets to perform PAS analysis. Our experiments show that a stepwise training method is the most effective, which pre-trains an MC model based on the RC-QA dataset to acquire domain knowledge and then fine-tunes based on the PAS-QA dataset.

show abstract

CliCR: a Dataset of Clinical Case Reports for Machine Reading Comprehension

Cited by 70 publications

References 43 publications

A Gated Dilated Convolution with Attention Model for Clinical Cloze-Style Reading Comprehension

A Gated Dilated Convolution with Attention Model for Clinical Cloze-Style Reading Comprehension

Neural Machine Reading Comprehension: Methods and Trends

Machine Comprehension Improves Domain-Specific Japanese Predicate-Argument Structure Analysis

Contact Info

Product

Resources

About