2021
DOI: 10.1101/2021.04.26.21256038
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Ensemble of deep masked language models for effective named entity recognition in multi-domain corpora

Abstract: The health and life science domains are well-known for their wealth of entities. These entities are presented as free text in large corpora, such as biomedical scientific and electronic health records. To enable the secondary use of these corpora and unlock their value, named entity recognition (NER) methods are proposed. Inspired by the success of deep masked language models, we present an ensemble approach for NER using these models. Results show statistically significant improvement of the ensemble models o… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
1

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 43 publications
(51 reference statements)
0
3
0
Order By: Relevance
“…The majority of research work cited above was proposed for text written in English or Chinese. Few studies were proposed on French corpora [47,48,22,3,49]. [47] proposed a rule-based system for medication.…”
Section: Statistical Clinical Ner Methods Have Been Widely Usedmentioning
confidence: 99%
See 2 more Smart Citations
“…The majority of research work cited above was proposed for text written in English or Chinese. Few studies were proposed on French corpora [47,48,22,3,49]. [47] proposed a rule-based system for medication.…”
Section: Statistical Clinical Ner Methods Have Been Widely Usedmentioning
confidence: 99%
“…These three research works used private clinical annotated dataset. [22] and [49] used a publicly available dataset, provided in the context of DEFT 2020 [50] and that consists of a collection of French clinical cases. [22] proposed two models: a layered Bi-LSTM-CRF model combined with the language model CamemBERT [51], a French version of BERT and a Greedy NER model.…”
Section: Statistical Clinical Ner Methods Have Been Widely Usedmentioning
confidence: 99%
See 1 more Smart Citation