Terminologies augmented recurrent neural network model for clinical named entity recognition

Lerner, Ivan; Paris, Nicolas; Tannier, Xavier

doi:10.1016/j.jbi.2019.103356

Cited by 29 publications

(22 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, we took the condition of the intake, and not the reason for the intake, into consideration (which is more specific), and we added a tag regarding the class name; therefore, overall F-measures cannot be compared. Compared with results from a study [33] using a different French-language corpus that obtained a token-level F-measure of 90.4, our system's raw results were higher. Comparisons should be made with caution because the corpus used in [33], though in the same language, was from a different source and contained only 147 documents.…”

Section: Related Workcontrasting

confidence: 62%

“…Compared with results from a study [33] using a different French-language corpus that obtained a token-level F-measure of 90.4, our system's raw results were higher. Comparisons should be made with caution because the corpus used in [33], though in the same language, was from a different source and contained only 147 documents.…”

Section: Related Workcontrasting

confidence: 62%

“…These approaches were designed for text written in English. To the best of our knowledge, there are only a few studies [32,33] on French corpora: Deleger et al [32] used a rule-based system, and Lerner et al [33] developed a hybrid system that associated expert rules using terminology and bidirectional gated recurrent units with a conditional random field.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Hybrid Deep Learning for Medication-Related Information Extraction From Clinical Texts in French: MedExt Algorithm Development Study

Jouffroy¹,

Feldman²,

Lerner³

et al. 2021

JMIR Med Inform

Self Cite

View full text Add to dashboard Cite

Background Information related to patient medication is crucial for health care; however, up to 80% of the information resides solely in unstructured text. Manual extraction is difficult and time-consuming, and there is not a lot of research on natural language processing extracting medical information from unstructured text from French corpora. Objective We aimed to develop a system to extract medication-related information from clinical text written in French. Methods We developed a hybrid system combining an expert rule–based system, contextual word embedding (embedding for language model) trained on clinical notes, and a deep recurrent neural network (bidirectional long short term memory–conditional random field). The task consisted of extracting drug mentions and their related information (eg, dosage, frequency, duration, route, condition). We manually annotated 320 clinical notes from a French clinical data warehouse to train and evaluate the model. We compared the performance of our approach to those of standard approaches: rule-based or machine learning only and classic word embeddings. We evaluated the models using token-level recall, precision, and F-measure. Results The overall F-measure was 89.9% (precision 90.8; recall: 89.2) when combining expert rules and contextualized embeddings, compared to 88.1% (precision 89.5; recall 87.2) without expert rules or contextualized embeddings. The F-measures for each category were 95.3% for medication name, 64.4% for drug class mentions, 95.3% for dosage, 92.2% for frequency, 78.8% for duration, and 62.2% for condition of the intake. Conclusions Associating expert rules, deep contextualized embedding, and deep neural networks improved medication information extraction. Our results revealed a synergy when associating expert knowledge and latent knowledge.

show abstract

Section: Related Workcontrasting

confidence: 62%

Section: Related Workcontrasting

confidence: 62%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Hybrid Deep Learning for Medication-Related Information Extraction From Clinical Texts in French: MedExt Algorithm Development Study

Jouffroy¹,

Feldman²,

Lerner³

et al. 2021

JMIR Med Inform

Self Cite

View full text Add to dashboard Cite

show abstract

“…French was the third language (seven mentions) as in the work by Lerner et al 6 , followed by three other European languages with less than five mentions: German, Italian 7 , and Spanish 8 ;…”

Section: Principal Findingsmentioning

confidence: 84%

“…Yet, we can consider that the papers that did not explicitly indicate the language should also be dedicated to the processing of data in English ; • Chinese became the second language processed in medical NLP papers with 17 mentions. Among the papers published in 2019, we can mention Guan et al [3] working on the generation of synthetic medical record texts, Chen et al [4] aiming at identifying named entities, and Zheng et al [5] interested by the detection of medical text similarity ; • French was the third language (seven mentions) as in the work by Lerner et al [6], followed by three other European languages with less than five mentions: German, Italian [7], and Spanish [8] ; • Other languages identified in the abstracts accounted for one or two papers and included both languages spoken by millions of people (Arabic, Portuguese, Russian) and languages spoken by small communities (Basque, Danish, Japanese, Korean, Lithuanian, Persian, Romanian, Turkish, and Urdu).…”

Section: The Languages Addressedmentioning

confidence: 99%

A Year of Papers Using Biomedical Texts:

Grouin

Grabar

2020

Yearb Med Inform

View full text Add to dashboard Cite

Objectives: Analyze papers published in 2019 within the medical natural language processing (NLP) domain in order to select the best works of the field. Methods: We performed an automatic and manual pre-selection of papers to be reviewed and finally selected the best NLP papers of the year. We also propose an analysis of the content of NLP publications in 2019. Results: Three best papers have been selected this year including the generation of synthetic record texts in Chinese, a method to identify contradictions in the literature, and the BioBERT word representation. Conclusions: The year 2019 was very rich and various NLP issues and topics were addressed by research teams. This shows the will and capacity of researchers to move towards robust and reproducible results. Researchers also prove to be creative in addressing original issues with relevant approaches.

show abstract

Enhanced conditional random field‐long short‐term memory for name entity recognition in English texts

Bhumireddypalli¹,

Koppula²,

Koppula³

2023

Concurrency and Computation

View full text Add to dashboard Cite

Named Entity recognition (NER) is the essential topic in the real world during the advanced development of technologies. Hence, in this paper, to develop Enhanced Conditional Random Field-Long Short-Term Memory (ECRF-LSTM) for NER in English language. The proposed ECRF-LSTM is combination of Conditional Random Field-Long Short-Term Memory (ECRF-LSTM) and Arithmetic Optimization Algorithm (AOA). This proposed method is utilizing to NER from the English texts. The proposed method is working with three phases such as preprocessing phase, feature extraction phase, and NER phase. Initially, the datasets are collected from the online system. In the pre-processing phase, removal of URL, removal of special symbol, username removal, tokenization and stop word removal are done. After that, the essential features such as domain weight, event weight, textual similarity, spatial similarity, temporal similarity, and Relative Document-Term Frequency Difference (RDTFD) are extracted and then applied for training the proposed model. To empower the training phase of CRF-LSTM method, AOA is utilized to select optimal weight parameter coefficients of CRF-LSTM for training the model parameters. The proposed method is validated by statistical measurements and compared with the conventional methods such as Convolutional Neural Network-Particle Swarm Optimization (CNN-PSO) and Convolutional Neural Network (CNN) respectively.

show abstract

Terminologies augmented recurrent neural network model for clinical named entity recognition

Cited by 29 publications

References 17 publications

Hybrid Deep Learning for Medication-Related Information Extraction From Clinical Texts in French: MedExt Algorithm Development Study

Hybrid Deep Learning for Medication-Related Information Extraction From Clinical Texts in French: MedExt Algorithm Development Study

A Year of Papers Using Biomedical Texts:

Enhanced conditional random field‐long short‐term memory for name entity recognition in English texts

Contact Info

Product

Resources

About