Neural surprise in somatosensory Bayesian learning

Gijsen, Sam; Grundei, Miro; Lange, R. T.; Ostwald, Dirk; Blankenburg, Felix

doi:10.1371/journal.pcbi.1008068

Cited by 34 publications

(74 citation statements)

References 110 publications

(166 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To do so, we consider a generative model of a volatile environment (Fig. 1A) that captures a few key features of daily life and unifies many existing model environments in neuroscience and psychology (Behrens et al, 2007;Daw et al, 2011;Findling et al, 2021;Gijsen et al, 2021;Gläscher et al, 2010;Glaze et al, 2015;Heilbron & Meyniel, 2019;Horvath et al, 2021;Huys et al, 2015;Liakoni et al, 2021;Mars et al, 2008;Meyniel et al, 2016;Nassar et al, 2012;Nassar et al, 2010;Ostwald et al, 2012;Wilson et al, 2013;Xu et al, 2021). The generative model describes the subjective interpretation of the environment from the point of view of an agent (e.g., a human participant or an animal).…”

Section: Subjective World-model: a Unifying Generative Modelmentioning

confidence: 99%

“…Specifically: B. Standard generative model for studying passive learning in volatile environments (Adams & MacKay, 2007;Fearnhead & Liu, 2007;Liakoni et al, 2021;Nassar et al, 2012;Nassar et al, 2010;Wilson et al, 2013), C. Generative model corresponding to variants of bandit and reversal bandit tasks (Behrens et al, 2007;Findling et al, 2021;Horvath et al, 2021), where the cue variable X t = A t is a participant's action, D. Generative model for modeling human inferences about binary sequences (Gijsen et al, 2021;Maheu et al, 2019;Meyniel et al, 2016;Modirshanechi et al, 2019;Mousavi et al, 2020), and E. classic Markov Decision Processes (MDPs) (Daw et al, 2011;Gläscher et al, 2010;Huys et al, 2015;Lehmann et al, 2019;Schultz et al, 1997;Sutton & Barto, 2018), where the cue variable X t = (A t−1 , Y t−1 ) consists of previous action and observation. See Appendix A: Special cases and links to related works for details.…”

Section: Additional Notation Belief and Marginal Probabilitymentioning

confidence: 99%

“…4 variable Z (e.g., the amplitude of the EEG P300 component (Kolossa et al, 2015;Meyniel et al, 2016)) is sensitive to or representative of surprise. Given two measures of surprise S and S , a typical experimental question is which one of them (if any) more accurately explains the variations of the variable Z (Gijsen et al, 2021;Kolossa et al, 2015;Mousavi et al, 2020;Ostwald et al, 2012;Visalli et al, 2021); see Fig. 2A1.…”

Section: Theories Of Surprise: a Technical Reviewmentioning

confidence: 99%

See 2 more Smart Citations

Surprise: a unified theory and experimental predictions

Modirshanechi

Brea

Gerstner

2021

Preprint

View full text Add to dashboard Cite

Surprising events trigger measurable brain activity and influence human behavior by affecting learning, memory, and decision-making. Currently there is, however, no consensus on the definition of surprise. Here we identify 16 mathematical definitions of surprise in a unifying framework, show how these definitions relate to each other, and prove under what conditions they are indistinguishable. We classify these surprise measures into four main categories: (i) change-point detection surprise, (ii) information gain surprise, (iii) prediction surprise, and (iv) confidence-correction surprise. We design experimental paradigms where different categories make different predictions: we show that surprise-modulation of the speed of learning leads to sensible adaptive behavior only for change-point detection surprise whereas surprise-seeking leads to sensible exploration strategies only for information gain surprise. However, since neither change-point detection surprise nor information gain surprise perfectly reflect the definition of ‘surprise’ in natural language, a combination of prediction surprise and confidence-correction surprise is needed to capture intuitive aspects of surprise perception. We formalize this combination in a new definition of surprise with testable experimental predictions. We conclude that there cannot be a single surprise measure with all functions and properties previously attributed to surprise. Consequently, we postulate that multiple neural mechanisms exist to detect and signal different aspects of surprise.Author noteAM is grateful to Vasiliki Liakoni, Martin Barry, and Valentin Schmutz for many useful discussions in the course of the last few years, and to Andrew Barto for insightful discussions through and after EPFL Neuro Symposium 2021 on “Surprise, Curiosity and Reward: from Neuroscience to AI”. We thank K. Robbins and collaborators for their publicly available experimental data (Robbins et al., 2018). All code needed to reproduce the results reported here will be made publicly available after publication acceptance. This research was supported by Swiss National Science Foundation (no. 200020_184615). Correspondence concerning this article should be addressed to Alireza Modirshanechi, School of Computer and Communication Sciences and School of Life Sciences, EPFL, Lausanne, Switzerland. E-mail: alireza.modirshanechi@epfl.ch.

show abstract

Section: Subjective World-model: a Unifying Generative Modelmentioning

confidence: 99%

Section: Additional Notation Belief and Marginal Probabilitymentioning

confidence: 99%

Section: Theories Of Surprise: a Technical Reviewmentioning

confidence: 99%

See 1 more Smart Citation

Surprise: a unified theory and experimental predictions

Modirshanechi

Brea

Gerstner

2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Expected Bayesian surprise is one of many quantities that have been proposed to formally capture immediate information gain and thus myopic exploration (Schwartenbeck et al, 2019). As alluded to in the Introduction, we here opted for Bayesian surprise due to its putative representation in the human neurocognitive system (Itti & Baldi, 2009;Ostwald et al, 2012;Gijsen et al, 2020).…”

Section: Discussionmentioning

confidence: 99%

Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task

et al. 2021

Self Cite

View full text Add to dashboard Cite

Humans often face sequential decision-making problems, in which information about the environmental reward structure is detached from rewards for a subset of actions. In the current exploratory study, we introduce an information-selective symmetric reversal bandit task to model such situations and obtained choice data on this task from 24 participants. To arbitrate between different decision-making strategies that participants may use on this task, we developed a set of probabilistic agent-based behavioral models, including exploitative and explorative Bayesian agents, as well as heuristic control agents. Upon validating the model and parameter recovery properties of our model set and summarizing the participants’ choice data in a descriptive way, we used a maximum likelihood approach to evaluate the participants’ choice data from the perspective of our model set. In brief, we provide quantitative evidence that participants employ a belief state-based hybrid explorative-exploitative strategy on the information-selective symmetric reversal bandit task, lending further support to the finding that humans are guided by their subjective uncertainty when solving exploration-exploitation dilemmas.

show abstract

“…We also compared them to two types of heuristics which perform very well in this environment: the classic 'delta-rule' heuristic ( Rescorla & Wagner, 1972 ;R. S. Sutton & Barto, 1998 ) and the more accurate 'leaky' heuristic ( Gijsen et al, 2021 ;Heilbron & Meyniel, 2019 ;Meyniel et al, 2016 ;Yu & Cohen, 2008 ) (see Methods for details). To test the statistical reliability of our conclusions, we trained separately 20 agents of each type (each type of network and each type of heuristic).…”

Section: Performance In the Face Of Changes In Latent Probabilitiesmentioning

confidence: 99%

Gated recurrence enables simple and accurate sequence prediction in stochastic, changing, and structured environments

Foucault

Meyniel

2021

Preprint

View full text Add to dashboard Cite

From decision making to perception to language, predicting what is coming next is crucial. It is also challenging in stochastic, changing, and structured environments; yet the brain makes accurate predictions in many situations. What computational architecture could enable this feat? Bayesian inference makes optimal predictions but is prohibitively difficult to compute. Here, we show that a specific recurrent neural network architecture enables simple and accurate solutions in several environments. To this end, a set of three mechanisms suffices: gating, lateral connections, and recurrent weight tuning. Like the human brain, such networks develop internal representations of their changing environment (including estimates of the environment's latent variables and the precision of these estimates), leverage multiple levels of latent structure, and adapt their effective learning rate to changes without changing their connection weights. Being ubiquitous in the brain, gated recurrence could therefore serve as a generic building block to predict in real-life environments.

show abstract

Neural surprise in somatosensory Bayesian learning

Cited by 34 publications

References 110 publications

Surprise: a unified theory and experimental predictions

Surprise: a unified theory and experimental predictions

Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task

Gated recurrence enables simple and accurate sequence prediction in stochastic, changing, and structured environments

Contact Info

Product

Resources

About