This is a BERT. Now there are several of them. Can they generalize to novel words?

Haley, Coleman

doi:10.18653/v1/2020.blackboxnlp-1.31

Cited by 8 publications

(7 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…While BERT was created for tasks with an English corpus, mBERT was created using all Wikipedia pages with 104 different languages, making it able to handle tasks in multiple languages [ 8 ]. However, this resulted in worse or comparable performance to monolingual models with BERT’s architecture settings [ 14 ]. It has been speculated that this is because Wikipedia is not representative of general language use [ 15 ].…”

Section: Related Workmentioning

confidence: 99%

Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models

et al. 2023

View full text Add to dashboard Cite

Understanding the diagnostic goal of medical reports is valuable information for understanding patient flows. This work focuses on extracting the reason for taking an MRI scan of Multiple Sclerosis (MS) patients using the attached free-form reports: Diagnosis, Progression or Monitoring. We investigate the performance of domain-dependent and general state-of-the-art language models and their alignment with domain expertise. To this end, eXplainable Artificial Intelligence (XAI) techniques are used to acquire insight into the inner workings of the model, which are verified on their trustworthiness. The verified XAI explanations are then compared with explanations from a domain expert, to indirectly determine the reliability of the model. BERTje, a Dutch Bidirectional Encoder Representations from Transformers (BERT) model, outperforms RobBERT and MedRoBERTa.nl in both accuracy and reliability. The latter model (MedRoBERTa.nl) is a domain-specific model, while BERTje is a generic model, showing that domain-specific models are not always superior. Our validation of BERTje in a small prospective study shows promising results for the potential uptake of the model in a practical setting.

show abstract

Section: Related Workmentioning

confidence: 99%

Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models

et al. 2023

View full text Add to dashboard Cite

show abstract

“…Multilingual models have also demonstrated that synthetic transfer can occur between languages (Guarasci et al, 2022). Meanwhile, Haley (2020) show that BERT can perform the Wug test, a standard grammatical generalisation test (Berko Gleason, 1958), significantly better than chance in English, French, Spanish and Dutch. However, there are still many gaps in this research.…”

Section: Grammatical Embedding and Indeclinable Nounsmentioning

confidence: 99%

Proceedings of the 9th Workshop on Slavic Natural Language Processing 2023 (SlavicNLP 2023)

2023

View full text Add to dashboard Cite

show abstract

“…Pre-trained language models, such as BERT or GPT-3, are either fine-tuned on a new task or are primed on prompts either crafted by humans [Haley, 2020, Misra et al, 2020, Petroni et al, 2019 or found automatically [Shin et al, 2020, Jiang et al, 2020. These prompts (also called stimuli, constraints, or demonstrations) are then dependent only on the specific task at hand and nothing else.…”

Section: Specificitymentioning

confidence: 99%

Artefact Retrieval: Overview of NLP Models with Knowledge Base Access

Zouhar¹,

Mosbach²,

Biswas³

et al. 2022

Preprint

View full text Add to dashboard Cite

Many NLP models gain performance by having access to a knowledge base. A lot of research has been devoted to devising and improving the way the knowledge base is accessed and incorporated into the model, resulting in a number of mechanisms and pipelines. Despite the diversity of proposed mechanisms, there are patterns in the designs of such systems. In this paper, we systematically describe the typology of artefacts (items retrieved from a knowledge base), retrieval mechanisms and the way these artefacts are fused into the model. This further allows us to uncover combinations of design decisions that had not yet been tried. Most of the focus is given to language models, though we also show how question answering, fact-checking and knowledgable dialogue models fit into this system as well. Having an abstract model which can describe the architecture of specific models also helps with transferring these architectures between multiple NLP tasks.

show abstract

This is a BERT. Now there are several of them. Can they generalize to novel words?

Cited by 8 publications

References 19 publications

Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models

Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models

Proceedings of the 9th Workshop on Slavic Natural Language Processing 2023 (SlavicNLP 2023)

Artefact Retrieval: Overview of NLP Models with Knowledge Base Access

Contact Info

Product

Resources

About