John praised Mary because _he_? Implicit Causality Bias and Its Interaction with Explicit Cues in LMs

Kementchedjhieva, Yova; Anderson, Mark; Søgaard, Anders

doi:10.18653/v1/2021.findings-acl.429

Cited by 3 publications

(4 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Overall, the performance varies across different models, decoding procedures, and bias types. However, in general, the models are more likely to capture the object bias, as can be noted by the (almost) overall higher CS values for the ES verbs, aligning with results from, for example, Kementchedjhieva et al, 2021 andZarrieß et al, 2022 which as well point towards a general tendency of LLMs to establish coreference to the object. Moreover, it is noticeable that for each decoding procedure the ability to capture the IC bias of SE verbs tends to improve when prompts are augmented with adverbial modifiers.…”

Section: Resultssupporting

confidence: 83%

“…And, as several psycholinguistic studies have demonstrated that the IC bias is not only highly reliable but also robust across different languages (Ferstl et al, 2011;Goikoetxea et al, 2008;Hartshorne et al, 2013;Bott and Solstad, 2014), it has become an an intriguing domain for testing language models. Earlier studies, including those conducted by Upadhye et al, 2020, Davis and van Schijndel, 2020, Kementchedjhieva et al, 2021and Zarrieß et al, 2022, have examined the performance of LLMs in capturing the IC coreference bias. I.e., they concentrated on single-word prediction tasks and evaluated the models' ability to generate continuations of such classic prompts, like examples (1) and (2), and predominantly found that LLMs display limited ability to systematically incorporate the IC coreference bias in their genera-tions.…”

Section: Implicit Causalitymentioning

confidence: 99%

“…A range of recent studies investigated LLMs' predictions of the next mention in examples like (1) and whether these mentions (i.e. pronouns) follow the same coreference biases as can be found in human data (e.g., Upadhye et al, 2020;Davis and van Schijndel, 2020;Kementchedjhieva et al, 2021;Zarrieß et al, 2022). These studies predominantly indicated that LLMs are not generally congruent with the human IC bias, which has been interpreted as evidence for LLMs struggling with certain aspects of discourse understanding (but see Cai et al, 2023).…”

Section: Introductionmentioning

confidence: 96%

See 2 more Smart Citations

Beyond the Bias: Unveiling the Quality of Implicit Causality Prompt Continuations in Language Models

Sieker,

Bott,

Solstad

et al. 2023

Proceedings of the 16th International Natural Language Generation Conference

View full text Add to dashboard Cite

Recent studies have used human continuations of Implicit Causality (IC) prompts collected in linguistic experiments to evaluate discourse understanding in large language models (LLMs), focusing on the well-known IC coreference bias in the LLMs' predictions of the next word following the prompt. In this study, we investigate how continuations of IC prompts can be used to evaluate the text generation capabilities of LLMs in a linguistically controlled setting. We conduct an experiment using two open-source GPT-based models, employing human evaluation to assess different aspects of continuation quality. Our findings show that LLMs struggle in particular with generating coherent continuations in this rather simple setting, indicating a lack of discourse knowledge beyond the wellknown IC bias. Our results also suggest that a bias congruent continuation does not necessarily equate to a higher continuation quality. Furthermore, our study draws upon insights from the Uniform Information Density hypothesis, testing different prompt modifications and decoding procedures and showing that samplingbased methods are particularly sensitive to the information density of the prompts.

show abstract

Section: Resultssupporting

confidence: 83%

Section: Implicit Causalitymentioning

confidence: 99%

Section: Introductionmentioning

confidence: 96%

See 1 more Smart Citation

Beyond the Bias: Unveiling the Quality of Implicit Causality Prompt Continuations in Language Models

Sieker,

Bott,

Solstad

et al. 2023

Proceedings of the 16th International Natural Language Generation Conference

View full text Add to dashboard Cite

show abstract

“…1 Within work in natural language processing, existing models have been claimed to capture aspects of Principle A (e.g., Warstadt et al, 2020;Hu et al, 2020). Principle C has received less attention, though see Mitchell et al (2019) which found that LSTM language models failed to obey Principle C. Coreference, more broadly, has also been explored, with results suggesting that models encode features of coreference resolution (e.g., Sorodoc et al, 2020) and the interaction of implicit causality and pronouns (verb biases that influence preferred antecedents for pronouns; Upadhye et al, 2020;Davis and van Schijndel, 2021;Kementchedjhieva et al, 2021).…”

Section: Introductionmentioning

confidence: 99%

Incremental Processing of Principle B: Mismatches Between Neural Models and Humans

Davis

2022

Preprint

View full text Add to dashboard Cite

Despite neural language models qualitatively capturing many human linguistic behaviors, recent work has demonstrated that they underestimate the true processing costs of ungrammatical structures. We extend these more fine-grained comparisons between humans and models by investigating the interaction between Principle B and coreference processing. While humans use Principle B to block certain structural positions from affecting their incremental processing, we find that GPT-based language models are influenced by ungrammatical positions. We conclude by relating the mismatch between neural models and humans to properties of training data and suggest that certain aspects of human processing behavior do not directly follow from linguistic data.Paper at CoNLL 2022

show abstract

Language Model Behavior: A Comprehensive Survey

Chang,

Bergen

2024

Computational Linguistics

View full text Add to dashboard Cite

Transformer language models have received widespread public attention, yet their generated text is often surprising even to NLP researchers. In this survey, we discuss over 250 recent studies of English language model behavior before task-specific fine-tuning. Language models possess basic capabilities in syntax, semantics, pragmatics, world knowledge, and reasoning, but these capabilities are sensitive to specific inputs and surface features. Despite dramatic increases in generated text quality as models scale to hundreds of billions of parameters, the models are still prone to unfactual responses, commonsense errors, memorized text, and social biases. Many of these weaknesses can be framed as over-generalizations or under-generalizations of learned patterns in text. We synthesize recent results to highlight what is currently known about large language model capabilities, thus providing a resource for applied work and for research in adjacent fields that use language models.

show abstract

John praised Mary because _he_? Implicit Causality Bias and Its Interaction with Explicit Cues in LMs

Cited by 3 publications

References 32 publications

Beyond the Bias: Unveiling the Quality of Implicit Causality Prompt Continuations in Language Models

Beyond the Bias: Unveiling the Quality of Implicit Causality Prompt Continuations in Language Models

Incremental Processing of Principle B: Mismatches Between Neural Models and Humans

Language Model Behavior: A Comprehensive Survey

Contact Info

Product

Resources

About