Ensemble of Deep Masked Language Models for Effective Named Entity Recognition in Health and Life Science Corpora

Naderi, Nona; Knafou, Julien; Copara, Jenny; Ruch, Patrick; Teodoro, Douglas

doi:10.3389/frma.2021.689803

Cited by 7 publications

(5 citation statements)

References 60 publications

(75 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Thus, it is unclear how the proposed methodology will generalize to corpora and categories used in other reviews and living evidence knowledge bases. That said, given the strong performance obtained in other corpus types by a similar methodology (31), we believe that it shall generalize well. Second, in our experiments, we fail to explore the full contents of the articles.…”

Section: Discussionsupporting

confidence: 53%

“…Then, at inference time, the classifiers were applied to individual records to predict the publication category as output. Two ensemble strategies were created using these predictions (29,31). The first strategy uses a voting system that takes each classifier output as a vote for a class, while the second considers the sum of the class probabilities attributed by the individual classifiers.…”

Section: Methodsmentioning

confidence: 99%

“…In this article, we investigated the use of automatic text classifiers supported by deep learning-based language models to enhance literature triage and annotation in COVID-19 living systematic review systems. Our analysis assessed the effectiveness of different individual deep learning-based language classifiers against two ensemble strategies, in which individual models are combined using either the probability sum of the predictions or a voting strategy where each classifier has a voting right and the classification decision is given to the class obtaining a majority of votes (29–31).…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature

Knafou

Haas²,

Borissov

et al. 2023

Preprint

Self Cite

View full text Add to dashboard Cite

Background: The COVID-19 pandemic has led to an unprecedented amount of scientific publications, growing at a pace never seen before. Multiple living systematic reviews have been developed to assist professionals with up-to-date and trustworthy health information, but it is increasingly challenging for systematic reviewers to keep up with the evidence in electronic databases. We aimed to investigate deep learning-based machine learning algorithms to classify COVID-19 related publications to help scale-up the epidemiological curation process. Methods: In this retrospective study, five different pre-trained deep learning-based language models were fine-tuned on a dataset of 6,365 publications manually classified into two classes, three subclasses and 22 sub-subclasses relevant for epidemiological triage purposes. In a k-fold cross-validation setting, each standalone model was assessed on a classification task and compared against an ensemble, which takes the standalone model predictions as input and uses different strategies to infer the optimal article class. A ranking task was also considered, in which the model outputs a ranked list of sub-subclasses associated with the article. Results: The ensemble model significantly outperformed the standalone classifiers, achieving a F1-score of 89.2 at the class level of the classification task. The difference between the standalone and ensemble models increases at the sub-subclass level, where the ensemble reaches a micro F1-score of 70% against 67% for the best performing standalone model. For the ranking task, the ensemble obtained the highest recall@3, with a performance of 89%. Using an unanimity voting rule, the ensemble can provide predictions with higher confidence on a subset of the data, achieving detection of original papers with a F1-score up to 97% on a subset of 80% of the collection instead of 93% on the whole dataset. Conclusion: This study shows the potential of using deep learning language models to perform triage of COVID-19 references efficiently and support epidemiological curation and review. The ensemble consistently and significantly outperforms any standalone model. Fine-tuning the voting strategy thresholds is an interesting alternative to annotate a subset with higher predictive confidence.

show abstract

Section: Discussionsupporting

confidence: 53%

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature

Knafou

Haas²,

Borissov

et al. 2023

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Furthermore, IberLEF 2022 [17] and BioASQ 2022 [116] released their datasets translated into seven other languages, encouraging future contributions to multilingual medical NLP. For the French language, the CAS corpus [27] was used in DEFT [53,[120][121][122][123][124], an annual French-speaking text-mining challenge. The 2020 edition of DEFT involved the automatic annotation of 13 different medical entity types, while the 2021 edition proposed to identify the patient's clinical profile through multilabel classification of diseases using the Medical Subject Headings (MeSH) thesaurus.…”

Section: Shared Tasksmentioning

confidence: 99%

Exploring the Latest Highlights in Medical Natural Language Processing across Multiple Languages: A Survey

Shaitarova,

Zaghir,

Lavelli

et al. 2023

Yearb Med Inform

View full text Add to dashboard Cite

Objectives: This survey aims to provide an overview of the current state of biomedical and clinical Natural Language Processing (NLP) research and practice in Languages other than English (LoE). We pay special attention to data resources, language models, and popular NLP downstream tasks. Methods: We explore the literature on clinical and biomedical NLP from the years 2020-2022, focusing on the challenges of multilinguality and LoE. We query online databases and manually select relevant publications. We also use recent NLP review papers to identify the possible information lacunae. Results: Our work confirms the recent trend towards the use of transformer-based language models for a variety of NLP tasks in medical domains. In addition, there has been an increase in the availability of annotated datasets for clinical NLP in LoE, particularly in European languages such as Spanish, German and French. Common NLP tasks addressed in medical NLP research in LoE include information extraction, named entity recognition, normalization, linking, and negation detection. However, there is still a need for the development of annotated datasets and models specifically tailored to the unique characteristics and challenges of medical text in some of these languages, especially low-resources ones. Lastly, this survey highlights the progress of medical NLP in LoE, and helps at identifying opportunities for future research and development in this field.

show abstract

“…Having trained multiple NER models, we use an ensemble strategy based on a majority vote to assign the predictions (Copara et al, 2020b,a;Knafou et al, 2020;Naderi et al, 2021). More in detail, for a given sentence S, three NER models infer their predictions independently.…”

Section: Ensemble Of the Ner Modelsmentioning

confidence: 99%

DS4DH at SemEval-2022 Task 11: Multilingual Named Entity Recognition Using an Ensemble of Transformer-based Language Models

Rouhizadeh¹,

Teodoro²

2022

Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)

Self Cite

View full text Add to dashboard Cite

In this paper, we describe our proposed method for the SemEval 2022 Task 11: Multilingual Complex Named Entity Recognition (Multi-CoNER). The goal of this task is to locate and classify named entities in unstructured short complex texts in 11 different languages. After training a variety of contextual language models on the NER dataset, we used an ensemble strategy based on a majority vote to finalize our model. We evaluated our proposed approach on the multilingual NER dataset at SemEval-2022. The ensemble model provided consistent improvements against the individual models on the multilingual track, achieving a macro F1 performance of 65.2%. However, our results were significantly outperformed by the top ranking systems, achieving thus a baseline performance.

show abstract

Ensemble of Deep Masked Language Models for Effective Named Entity Recognition in Health and Life Science Corpora

Cited by 7 publications

References 60 publications

Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature

Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature

Exploring the Latest Highlights in Medical Natural Language Processing across Multiple Languages: A Survey

DS4DH at SemEval-2022 Task 11: Multilingual Named Entity Recognition Using an Ensemble of Transformer-based Language Models

Contact Info

Product

Resources

About