Fine-Tuning Large Neural Language Models for Biomedical Natural Language Processing

Tinn, Robert; Cheng, Hui‐Ming; Gu, Yu; Usuyama, Naoto; Liu, Xiaodong; Naumann, Tristan; Gao, Jianfeng; Poon, Hoifung

doi:10.48550/arxiv.2112.07869

Cited by 2 publications

(5 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We, therefore, introduce a tokenizer training stage that generates a new vocabulary file, by combining the biomedical and clinical corpora used to train and fine-tune the final model. The main difference between our approach and most BERT variants is that other LMs are based technically on a continuous training approach, whereby the source model is fine-tuned on a specific domain corpus [13], whereas our model adopts a similar approach to that of Tinn et al [8] by including a dedicated tokenizer in the process.…”

Section: • We Propose a Modeling Optimization Technique Using Amentioning

confidence: 99%

“…• training epoch i = [5 : 100] • per device training batch size β = [4,8,12,16,32,64] By running 200 trials for each search, the resultant customized hyperparameters helped boost our performance to an average 3.11% F1 score for all downstream tasks.…”

Section: A Fine-tuning Hyperparametersmentioning

confidence: 99%

“…EHRs contain vast amounts of structured and unstructured data, which can be used not only to fine-tune predictive algorithms and drug compatibility tests but also to help understand the course of diseases and patient histories. Researchers have proposed a variety of NLP-based models to better analyze biomedical documents [4], [5], [7], [8]. With medical texts representing 80% of the EHR data [1], it is imperative to develop more robust and efficient language models (LMs), which can then be used to better understand and extract the relevant information contained in those texts.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Medical BigBERTa: An Optimized Transformer for Long Medical Document

Mugisha¹,

Paik²

2022

Preprint

View full text Add to dashboard Cite

show abstract

Section: • We Propose a Modeling Optimization Technique Using Amentioning

confidence: 99%

Section: A Fine-tuning Hyperparametersmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Medical BigBERTa: An Optimized Transformer for Long Medical Document

Mugisha¹,

Paik²

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…EHRs contain a tremendous amount of structured and unstructured data, which can be used to fine-tune predictive algorithms and drug compatibilities and help to understand the course of diseases and patients. Various researchers have proposed adapted NLP models to address better biomedical documents [4], [5], [7], [8]. With medical texts representing 80% of the EHR data [1], it's imperative to develop more robust and efficient language models which can be used to understand and extract relevant information contained in those texts.…”

Section: Introductionmentioning

confidence: 99%

“…We thus conducted a tokenizer training that generated a new vocabulary file, combining biomedical and clinical corpora used to train and fine-tune the final model. The main difference between our approach and most BERT variants is that those LM are technically based on a continuous training approach where the source model is fine-tuned on a specific domain corpus [13], while ours adopts a similar approach as [8] by including in the process a dedicated tokenizer.…”

Section: Introductionmentioning

confidence: 99%