Transformer Networks of Human Conceptual Knowledge

Bhatia, Siddharth; Richie, Russell

doi:10.31234/osf.io/hs4ra

Cited by 9 publications

(7 citation statements)

References 70 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As Figure 2C shows, the GPT-3 similarity ratings are correlated to some extent with human ratings. This is consistent with previous work suggesting that the internal representations of TLMs can be used to make reasonable predictions about human similarity judgments (Bhatia & Richie, 2021). GPT-3 accounts for some superordinate categories better than others, with correlations ranging between 0.16 (fish) and 0.58 (professions).…”

Section: Distinguishing Representation From Reasoningsupporting

confidence: 91%

“…For psychologists, property induction is relevant to a literature that assesses TLMs and predecessors such as LSA (Landauer & Dumais, 1997) as computational accounts of the acquisition, use, and representation of semantic knowledge. Recent work has evaluated the extent to which TLMs account for human similarity ratings, typicality ratings, and response times (Bhatia & Richie, 2021;Lake & Murphy, 2021), but there has been relatively little work on inductive reasoning. A notable exception is the work of Misra, Ettinger, and Taylor Rayz (2021), who focus on typicality and include property induction as one of the tasks that they consider.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Human-like property induction is a challenge for large language models

Han¹,

Ransom²,

Perfors³

et al. 2022

Preprint

View full text Add to dashboard Cite

The impressive recent performance of large language models such as GPT-3 has led many to wonder to what extent they can serve as models of general intelligence or are similar to human cognition. We address this issue by applying GPT-3 to a classic problem in human inductive reasoning known as property induction. Our results suggest that while GPT-3 can qualitatively mimic human performance for some inductive phenomena (especially those that depend primarily on similarity relationships), it reasons in a qualitatively distinct way on phenomena that require more theoretical understanding. We propose that this emerges due to the reasoning abilities of GPT-3 rather than its underlying representations, and suggest that increasing its scale is unlikely to change this pattern.

show abstract

Section: Distinguishing Representation From Reasoningsupporting

confidence: 91%

Section: Introductionmentioning

confidence: 99%

Human-like property induction is a challenge for large language models

Han¹,

Ransom²,

Perfors³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…As the first step toward automating analogical mapping, we adopt semantic representations of individual words generated by a machine-learning model, Word2vec (Mikolov et al, 2013). Word2vec and similar models based on distributional semantics, such as Global Vectors (GloVe; Pennington et al, 2014) and Bidirectional Encoder Representations from Transformers (BERT; Devlin et al, 2019), have proved successful in predicting behavioral judgments of lexical similarity or association (Hill et al, 2015; Hofmann et al, 2018; Pereira et al, 2016; Richie & Bhatia, 2021), neural responses to word and relation meanings (Huth et al, 2016; Pereira et al, 2018; Zhang et al, 2020), and high-level inferences including assessments of probability (Bhatia, 2017; Bhatia et al, 2019) and semantic verification (Bhatia & Richie, in press). In the simulations reported here, the semantic meanings of individual concepts are represented by 300-dimensional embeddings created by Word2vec after training on a corpus of articles drawn from Google News.…”

Section: Forming Representations Of Word Meanings and Semantic Relationsmentioning

confidence: 99%

Probabilistic analogical mapping with semantic relation networks.

Ichien²,

Holyoak³

2022

Psychological Review

View full text Add to dashboard Cite

The human ability to flexibly reason using analogies with domain-general content depends on mechanisms for identifying relations between concepts, and for mapping concepts and their relations across analogs. Building on a recent model of how semantic relations can be learned from nonrelational word embeddings, we present a new computational model of mapping between two analogs. The model adopts a Bayesian framework for probabilistic graph matching, operating on semantic relation networks constructed from distributed representations of individual concepts and of relations between concepts. Through comparisons of model predictions with human performance in a novel mapping task requiring integration of multiple relations, as well as in several classic studies, we demonstrate that the model accounts for a broad range of phenomena involving analogical mapping by both adults and children. We also show the potential for extending the model to deal with analog retrieval. Our approach demonstrates that human-like analogical mapping can emerge from comparison mechanisms applied to rich semantic representations of individual concepts and relations.

show abstract

“…Common optimization tasks for pretraining transformers, such as the masked LM task (Devlin et al, 2018) are quite similar to the word prediction tasks that are known to predict children's performance on other linguistic skills (Borovsky et al, 2012;Neuman et al, 2011;Gambi et al, 2020). Finally, TLMs tend to outperform other LMs in recent work modeling human reading times, eye-tracking data, and other psychological and psycholinguistic phenomena (Merkx and Frank, 2021;Schrimpf et al, 2020b,a;Hao et al, 2020;Bhatia and Richie, 2020;.…”

Section: Related Workmentioning

confidence: 98%

Can Transformer Language Models Predict Psychometric Properties?

Laverghetta¹,

Nighojkar²,

Mirzakhalov³

et al. 2021

Proceedings of *SEM 2021: The Tenth Joint Conference on Lexical and Computational Semantics

View full text Add to dashboard Cite

Transformer-based language models (LMs) continue to advance state-of-the-art performance on NLP benchmark tasks, including tasks designed to mimic human-inspired "commonsense" competencies. To better understand the degree to which LMs can be said to have certain linguistic reasoning skills, researchers are beginning to adapt the tools and concepts of the field of psychometrics. But to what extent can the benefits flow in the other direction? I.e., can LMs be of use in predicting what the psychometric properties of test items will be when those items are given to human participants? We gather responses from numerous human participants and LMs (transformerand non-transformer-based) on a broad diagnostic test of linguistic competencies. We then use the responses to calculate standard psychometric properties of the items in the diagnostic test, using the human responses and the LM responses separately. We then determine how well these two sets of predictions match. We find cases in which transformerbased LMs predict psychometric properties consistently well in certain categories but consistently poorly in others, thus providing new insights into fundamental similarities and differences between human and LM reasoning. 1

show abstract

Transformer Networks of Human Conceptual Knowledge

Cited by 9 publications

References 70 publications

Human-like property induction is a challenge for large language models

Human-like property induction is a challenge for large language models

Probabilistic analogical mapping with semantic relation networks.

Can Transformer Language Models Predict Psychometric Properties?

Contact Info

Product

Resources

About