Shared computational principles for language processing in humans and deep language models

Goldstein, Ariel; Zada, Zaid; Buchnik, Eliav; Schain, Mariano; Price, Amy; Aubrey, Bobbi; Nastase, Samuel A.; Feder, Amir; Emanuel, Dotan; Cohen, Alon; Jansen, Aren; Gazula, Harshvardhan; Choe, Gina; Rao, Aditi; Kim, Catherine; Casto, Colton; Fanda, Lora; Doyle, Werner; Friedman, Daniel; Dugan, Patricia; Melloni, Lucía; Reichart, Roi; Devore, Sasha; Flinker, Adeen; Hasenfratz, Liat; Levy, Omer; Hassidim, Avinatan; Brenner, Michael P.; Matias, Yossi; Norman, Kenneth A.; Devinsky, Orrin; Hasson, Uri

doi:10.1038/s41593-022-01026-4

Cited by 209 publications

(295 citation statements)

References 63 publications

Supporting

Mentioning

217

Contrasting

Order By: Relevance

“…Given that we were inspired by work in cognitive psychology on human use of explanations (e.g. Ahn et al, 1992;Lombrozo and Carey, 2006), and given the accumulating evidence that language models predict language processing in the human brain to a surprising degree (Goldstein et al, 2022;Schrimpf et al, 2021), it is natural to ask whether there are cognitive implications of our experiments. However, as we noted above, the fact that both language models and humans benefit from explanations does not imply that they necessarily benefit through the same mechanisms.…”

Section: How Do Explanations Relate To Task Instructions?mentioning

confidence: 99%

Can language models learn from explanations in context?

Lampinen¹,

Dasgupta²,

Chan³

et al. 2022

Preprint

View full text Add to dashboard Cite

Large language models can perform new tasks by adapting to a few in-context examples. For humans, rapid learning from examples can benefit from explanations that connect examples to task principles. We therefore investigate whether explanations of few-shot examples can allow language models to adapt more effectively. We annotate a set of 40 challenging tasks from BIG-bench collaboration (2021) with explanations of answers to a small subset of questions, as well as a variety of matched control explanations. We evaluate the effects of various zero-shot and few-shot prompts that include different types of explanations, instructions, and controls on the performance of a range of large language models. We analyze these results using statistical multilevel modeling techniques that account for the nested dependencies among conditions, tasks, prompts, and models. We find that explanations of examples can improve performance. Adding untuned explanations to a few-shot prompt offers a modest improvement in performance; about 1/3 the effect size of adding few-shot examples, but twice the effect size of task instructions. We then show that explanations tuned for performance on a small validation set offer substantially larger benefits; building a prompt by selecting examples and explanations together substantially improves performance over selecting examples alone. Hand-tuning explanations can substantially improve performance on challenging tasks. Furthermore, even untuned explanations outperform carefully matched controls, suggesting that the benefits are due to the link between an example and its explanation, rather than lower-level features of the language used. However, only large models can benefit from explanations. In summary, explanations can support the in-context learning abilities of large language models on challenging tasks.

show abstract

Section: How Do Explanations Relate To Task Instructions?mentioning

confidence: 99%

Can language models learn from explanations in context?

Lampinen¹,

Dasgupta²,

Chan³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Unlike convolutional neural networks, whose architectural design principles are roughly inspired by biological vision [Lindsay, 2021], the design of current neural network language models is largely uninformed by psycholinguistics and neuroscience. And yet, there is an ongoing effort to adopt and adapt neural network language models to serve as computational hypotheses of how humans process language, making use of a variety of different architectures, training corpora, and training tasks [e.g., Wehbe et al, 2014, Toneva and Wehbe, 2019, Heilbron et al, 2020, Jain et al, 2020, Lyu et al, 2021, Schrimpf et al, 2021, Wilcox et al, 2021, Goldstein et al, 2022, Caucheteux and King, 2022. We found that recurrent neural networks make markedly human-inconsistent predictions once pitted against transformer-based neural networks.…”

Section: Implications For Artificial Neural Network Language Models A...mentioning

confidence: 89%

Testing the limits of natural language models for predicting human language judgments

Golan¹,

Siegelman²,

Kriegeskorte³

et al. 2022

Preprint

View full text Add to dashboard Cite

Neural network language models can serve as computational hypotheses about how humans process language. We compared the model-human consistency of diverse language models using a novel experimental approach: controversial sentence pairs. For each controversial sentence pair, two language models disagree about which sentence is more likely to occur in natural text. Considering nine language models (including n-gram, recurrent neural networks, and transformer models), we created hundreds of such controversial sentence pairs by either selecting sentences from a corpus or synthetically optimizing sentence pairs to be highly controversial. Human subjects then provided judgments indicating for each pair which of the two sentences is more likely. Controversial sentence pairs proved highly effective at revealing model failures and identifying models that aligned most closely with human judgments. The most human-consistent model tested was GPT-2, although experiments also revealed significant shortcomings of its alignment with human perception.

show abstract

“…Theory of Mind is a central facet of human intelligence [9][10][11][18][19][20]. Inspired by the success of DL in understanding biological vision [1][2][3][4][5] and language processing [6][7][8], over the last years a challenge has emerged to develop DL agents that can mimic aspects of ToM.…”

Section: Discussionmentioning

confidence: 99%

“…Rapid advances in deep learning (DL) have led to human-level performance on certain visual recognition and natural language processing tasks. Moreover, research has revealed shared computational principles in humans and DL models for vision [1][2][3][4][5] and language processing [6][7][8]. These findings do not imply that DL has fully captured how these processes operate in the human brain, but DL has definitely contributed to better characterizing the computational principles underlying them.…”

Section: Introductionmentioning

confidence: 99%

Mind the gap: Challenges of deep learning approaches to Theory of Mind

Aru¹,

Labash²,

Corcoll³

et al. 2022

Preprint

View full text Add to dashboard Cite

Theory of Mind is an essential ability of humans to infer the mental states of others. Here we provide a coherent summary of the potential, current progress, and problems of deep learning approaches to Theory of Mind. We highlight that many current findings can be explained through shortcuts. These shortcuts arise because the tasks used to investigate Theory of Mind in deep learning systems have been too narrow. Thus, we encourage researchers to investigate Theory of Mind in complex openended environments. Furthermore, to inspire future deep learning systems we provide a concise overview of prior work done in humans. We further argue that when studying Theory of Mind with deep learning, the research's main focus and contribution ought to be opening up the network's representations. We recommend researchers to use tools from the field of interpretability of AI to study the relationship between different network components and aspects of Theory of Mind.

show abstract

Shared computational principles for language processing in humans and deep language models

Cited by 209 publications

References 63 publications

Can language models learn from explanations in context?

Can language models learn from explanations in context?

Testing the limits of natural language models for predicting human language judgments

Mind the gap: Challenges of deep learning approaches to Theory of Mind

Contact Info

Product

Resources

About