Probing Pre-Trained Language Models for Disease Knowledge

Alghanmi, Israa; Espinosa-Anke, Luis; Schockaert, Steven

doi:10.18653/v1/2021.findings-acl.266

Cited by 6 publications

(4 citation statements)

References 32 publications

(21 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In NLI, given a hypothesis and a premise, a system must determine whether the premise entails, contradicts or is neutral with respect to the hypothesis. Previous works have shown that supervised models could rely on superficial factors, e.g., in the SNLI dataset (Bowman et al, 2015), hypothesis-only models are surprisingly competitive (Poliak et al, 2018;Gururangan et al, 2018), a trend also observed in medical NLI datasets (Alghanmi et al, 2021).…”

Section: Introductionmentioning

confidence: 61%

Construction Artifacts in Metaphor Identification Datasets

Boisson,

Espinosa-Anke,

Camacho-Collados

2023

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

Metaphor identification aims at understanding whether a given expression is used figuratively in context. However, in this paper we show how existing metaphor identification datasets can be gamed by fully ignoring the potential metaphorical expression or the context in which it occurs. We test this hypothesis in a variety of datasets and settings, and show that metaphor identification systems based on language models without complete information can be competitive with those using the full context. This is due to the construction procedures to build such datasets, which introduce unwanted biases for positive and negative classes. Finally, we test the same hypothesis on datasets that are carefully sampled from natural corpora and where this bias is not present, making these datasets more challenging and reliable.

show abstract

Section: Introductionmentioning

confidence: 61%

Construction Artifacts in Metaphor Identification Datasets

Boisson,

Espinosa-Anke,

Camacho-Collados

2023

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

show abstract

“…Probing factual knowledge in PLMs Since first proposed by LAMA (Petroni et al, 2019), promptbased probing has become the main technique to assess factual knowledge in PLMs (Davison et al, 2019;Bouraoui et al, 2020;Shin et al, 2020;Brown et al, 2020;Alghanmi et al, 2021;. Given the knowledge represented in a tuple (subject, relation, object), a query q is formed by filling the subject into a relation-specific template, which is fed into the PLM.…”

Section: Related Workmentioning

confidence: 99%

“…Large-scale Pre-trained Language Models (PLMs) have demonstrated powerful capabilities in tasks where factual knowledge plays an important role (Roberts et al, 2020;. While most previous work on probing factual knowledge in PLMs has focused on English (Davison et al, 2019;Bouraoui et al, 2020;Shin et al, 2020;Brown et al, 2020;Alghanmi et al, 2021;, a few notable studies have extended the evaluation to a number of other languages (Jiang et al, 2020;Kassner et al, 2021;Yin et al, 2022). The results of these studies show a large variation in 1 All code and data released at https://github.com/ Betswish/Cross-Lingual-Consistency the extent to which factual knowledge generalizes across languages, revealing yet another facet of language inequality in modern NLP technologies (Hupkes et al, 2022).…”

Section: Introductionmentioning

confidence: 99%

Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models

Qi,

Fernández,

Bisazza

2023

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

Multilingual large-scale Pretrained Language Models (PLMs) have been shown to store considerable amounts of factual knowledge, but large variations are observed across languages. With the ultimate goal of ensuring that users with different language backgrounds obtain consistent feedback from the same model, we study the cross-lingual consistency (CLC) of factual knowledge in various multilingual PLMs. To this end, we propose a Rankingbased Consistency (RankC) metric to evaluate knowledge consistency across languages independently from accuracy. Using this metric, we conduct an in-depth analysis of the determining factors for CLC, both at model level and at language-pair level. Among other results, we find that increasing model size leads to higher factual probing accuracy in most languages, but does not improve cross-lingual consistency. Finally, we conduct a case study on CLC when new factual associations are inserted in the PLMs via model editing. Results on a small sample of facts inserted in English reveal a clear pattern whereby the new piece of knowledge transfers only to languages with which English has a high RankC score. 1

show abstract

“…Prior work, however, has shown that existing biomedical LMs often struggle with such tasks. For instance, Alghanmi et al [2] found that the standard BERT model was remarkably competitive with specialised biomedical LMs for inferring diagnoses from patient descriptions. Meng et al [44] furthermore introduced a probing task for evaluating the knowledge captured by biomedical LMs, which also revealed significant issues.…”

Section: Introductionmentioning

confidence: 99%

Interpreting Patient Descriptions using Distantly Supervised Similar Case Retrieval

Alghanmi

Espinosa-Anke

Schockaert

2022

Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

Self Cite

View full text Add to dashboard Cite

Biomedical natural language processing often involves the interpretation of patient descriptions, for instance for diagnosis or for recommending treatments. Current methods, based on biomedical language models, have been found to struggle with such tasks. Moreover, retrieval augmented strategies have only had limited success, as it is rare to find sentences which express the exact type of knowledge that is needed for interpreting a given patient description. For this reason, rather than attempting to retrieve explicit medical knowledge, we instead propose to rely on a nearest neighbour strategy. First, we retrieve text passages that are similar to the given patient description, and are thus likely to describe patients in similar situations, while also mentioning some hypothesis (e.g. a possible diagnosis of the patient). We then judge the likelihood of the hypothesis based on the similarity of the retrieved passages. Identifying similar cases is challenging, however, as descriptions of similar patients may superficially look rather different, among others because they often contain an abundance of irrelevant details. To address this challenge, we propose a strategy that relies on a distantly supervised cross-encoder. Despite its conceptual simplicity, we find this strategy to be effective in practice. CCS CONCEPTS• Applied computing → Life and medical sciences; • Information systems → Information retrieval; • Computing methodologies → Natural language processing.

show abstract

Probing Pre-Trained Language Models for Disease Knowledge

Cited by 6 publications

References 32 publications

Construction Artifacts in Metaphor Identification Datasets

Construction Artifacts in Metaphor Identification Datasets

Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models

Interpreting Patient Descriptions using Distantly Supervised Similar Case Retrieval

Contact Info

Product

Resources

About