Pseudo Factor Analysis of Language Embedding Similarity Matrices: New Ways to Model Latent Constructs

Guenole, Nigel; D'Urso, E. Damiano; Samo, Andrew; Sun, Tianjun

doi:10.31234/osf.io/vf3se

Cited by 1 publication

(1 citation statement)

References 21 publications

(28 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Pre-transformer era attempts to use semantic features of items to predict associations between measurement scales using latent semantic analysis have demonstrated moderate utility (Arnulf et al, 2014;Larsen & Bong, 2016;Rosenbusch et al, 2020). As the ability of computerised language models to capture meaning has grown, researchers have sought to directly quantify relationships between adjectives from textual data (Cutler & Condon, 2022), to assign items to constructs (Fyffe et al, 2024;Guenole et al, 2024), to directly predict item responses (Abdurahman et al, 2024;Argyle et al, 2023) and quantify openended answers to questions (Kjell et al, 2019(Kjell et al, , 2024. used large language models (LLMs) to map survey items to vector space and predict empirical item correlations.…”

Section: Introductionmentioning

confidence: 99%

Language models accurately infer correlations between psychological items and scales from text alone

Hommel,

Arslan

2024

Preprint

View full text Add to dashboard Cite

Many behavioural scientists do not agree on core constructs and how they should be measured. Different literatures measure related constructs, but the connections are not always obvious to readers and meta-analysts. Many measures in behavioural science are based on agreement with survey items. Because these items are sentences, computerised language models can make connections between disparate measures and constructs and help researchers regain an overview over the rapidly growing, fragmented literature. Our fine-tuned language model, the SurveyBot3000, accurately predicts the correlations between survey items, the reliability of aggregated measurement scales, and intercorrelations between scales from item positions in semantic vector space. In our pilot study, the out-of-sample accuracy for item correlations was .71, .86 for reliabilities, and .89 for scale correlations. In a preregistered study, we will investigate whether the performance of our model generalises to measures across behavioural science.

show abstract