A natural language fMRI dataset for voxelwise encoding models

LeBel, Amanda; Wagner, Lauren; Jain, Shailee; Adhikari-Desai, Aneesh; Gupta, Bhavin; Morgenthal, Allyson; Tang, Jerry; Xu, Lixiang; Huth, Alexander G.

doi:10.1101/2022.09.22.509104

Cited by 8 publications

(8 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, unlike a non-human primate with chronically implanted electrodes, where it is possible to obtain neural responses to thousands of stimuli (over the course of days or weeks; e.g., 140 ), for humans, we are generally more limited in how much data is feasible to collect, both because of general constraints on participants' time and boredom/cognitive fatigue. One recent approach to combat the latter has been to turn to rich naturalistic stimuli, like stories, podcasts, or movies and to collect massive amounts of data (sometimes, many hours' worth) from a small number of individuals (e.g., 119,141,142 )-what is often referred to as the 'deep data' approach (e.g., [143][144][145][146][147][148] ). However, such stimuli do not sample the space of linguistic and/or semantic variation well, and consequently, do not allow for testing the model on stimuli that differ substantially from those used for training.…”

Section: Discussionmentioning

confidence: 99%

“…And second, language processing requires attentional engagement 141 , and such engagement is difficult to sustain for an extended period of time, especially if stimuli are repeated. One recent approach to combat fatigue/boredom has been to turn to rich naturalistic stimuli, like stories, podcasts, or movies and to collect massive amounts of data (sometimes, many hours' worth) from a small number of individuals (e.g., 118,142,143 )-what is often referred to as the 'deep data' approach (e.g., [144][145][146][147][148][149] ). However, such stimuli plausibly do not sample the space of linguistic and/or semantic variation well (see SI 10 for evidence), and consequently, do not allow for testing models on stimuli that differ substantially from those used during training.…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

Driving and suppressing the human language network using large language models

Tuckute

Sathe

Srikant

et al. 2023

Preprint

View full text Add to dashboard Cite

Transformer language models are today's most accurate models of language processing in the brain. Here, using fMRI-measured brain responses to 1,000 diverse sentences, we develop a GPT-based encoding model to identify new sentences that are predicted to drive or suppress responses in the human language network. We demonstrate that these model-selected 'out-of distribution' sentences indeed drive and suppress activity of human language areas in new individuals (85.7% increase and 97.5% decrease relative to the diverse naturalistic sentences). A systematic analysis of the model-selected sentences reveals that surprisal and well-formedness of linguistic input are key determinants of response strength in the language network. These results establish the ability of accurate models of the brain to noninvasively control neural activity in higher-level cortical areas, like the language network.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Discussionmentioning

confidence: 99%

Driving and suppressing the human language network using large language models

Tuckute

Sathe

Srikant

et al. 2023

Preprint

View full text Add to dashboard Cite

show abstract

“…Naturalistic stimulus data sets are easier to construct and often larger than controlled stimuli. For example, J. Chen et al (2017) publicly released a data set collected on a 50 min movie, Wehbe, Murphy, et al (2014) released data collected on an entire chapter from the Harry Potter books, comprising more than 5,000 words, and LeBel et al (2022) released data collected on over 5 hr of English podcasts per participant. These stimuli also provide a diverse test bed of linguistic phenomena—from a broad array of semantic concepts to rich temporal structure capturing discourse-level information.…”

Section: Experimental Designs In Language Neurosciencementioning

confidence: 99%

Computational Language Modeling and the Promise of In Silico Experimentation

Jain

Wehbe

et al. 2024

Neurobiology of Language

Self Cite

View full text Add to dashboard Cite

Language neuroscience currently relies on two major experimental paradigms: controlled experiments using carefully hand-designed stimuli, and natural stimulus experiments. These approaches have complementary advantages which allow them to address distinct aspects of the neurobiology of language, but each approach also comes with drawbacks. Here we discuss a third paradigm—in silico experimentation using deep learning-based encoding models—that has been enabled by recent advances in cognitive computational neuroscience. This paradigm promises to combine the interpretability of controlled experiments with the generalizability and broad scope of natural stimulus experiments. We show four examples of simulating language neuroscience experiments in silico and then discuss both the advantages and caveats of this approach.

show abstract

“…For this representation to be useful in an encoding analysis, it is important to sample a large set of different stimuli. To this avail, several efforts have been devoted towards the creation of large datasets to be used for testing the encoding of computational models in fMRI [Allen et al, 2022, LeBel et al, 2023]. Next, a link between brain activity (e.g.…”

Section: Introductionmentioning

confidence: 99%

Unbiased estimation of the coefficient of determination in linear models: an application to fMRI encoding model comparison

Castellanos,

De Martino,

Valente

2024

Preprint

View full text Add to dashboard Cite

Neuroscientific investigation has greatly benefited from the combination of functional Magnetic Resonance Imaging (fMRI) with linearized encoding, which allows to validate and compare computational models of neural activity based on neuroimaging data. In linearized encoding, a multidimensional feature space, usually obtained from a computational model applied to the stimuli, is related to the measured brain activity. This is often done by mapping such space to a dataset (training data, or in-sample), and validating the mapping on a separate dataset (test data, or out-of-sample), to avoid overfitting. When comparing models, the one with the highest explained variance on the test data, as indicated by the coefficient of determination (R2 ), is the one that better reflects the neural computations performed by the brain. An implicit assumption underlying this procedure is that the out-of-sample R2 is an unbiased estimator of the explanatory power of a computational model in the population of stimuli, and can therefore be safely used to compare models. In this work, we show that this is not the case, as the out-of-sample R2 has a negative bias, related to the amount of overfitting in the training data. This phenomenon has dramatic implications for model comparison when models of different dimensionalities are compared. To this aim, we develop an analytical framework that allows us to evaluate and correct biases in both in- and out-of-sample R2, with and without L2 regularization. Our proposed approach yields unbiased estimators of the population R2, thus enabling a valid model comparison. We validate it through illustrative simulations and with an application to a large public fMRI dataset.

show abstract

A natural language fMRI dataset for voxelwise encoding models

Cited by 8 publications

References 44 publications

Driving and suppressing the human language network using large language models

Driving and suppressing the human language network using large language models

Computational Language Modeling and the Promise of In Silico Experimentation

Unbiased estimation of the coefficient of determination in linear models: an application to fMRI encoding model comparison

Contact Info

Product

Resources

About