Michael J. Tobia scite author profile

We derive a family of risk-sensitive reinforcement learning methods for agents, who face sequential decision-making tasks in uncertain environments. By applying a utility function to the temporal difference (TD) error, nonlinear transformations are effectively applied not only to the received rewards but also to the true transition probabilities of the underlying Markov decision process. When appropriate utility functions are chosen, the agents' behaviors express key features of human behavior as predicted by prospect theory (Kahneman & Tversky, 1979 ), for example, different risk preferences for gains and losses, as well as the shape of subjective probability curves. We derive a risk-sensitive Q-learning algorithm, which is necessary for modeling human behavior when transition probabilities are unknown, and prove its convergence. As a proof of principle for the applicability of the new framework, we apply it to quantify human behavior in a sequential investment task. We find that the risk-sensitive variant provides a significantly better fit to the behavioral data and that it leads to an interpretation of the subject's responses that is indeed consistent with prospect theory. The analysis of simultaneously measured fMRI signals shows a significant correlation of the risk-sensitive TD error with BOLD signal change in the ventral striatum. In addition we find a significant correlation of the risk-sensitive Q-values with neural activity in the striatum, cingulate cortex, and insula that is not present if standard Q-values are used.

show abstract

Multiple sensitivity profiles to diversity and transition structure in non-stationary input

Tobia

Iacovella

Hasson

2012

NeuroImage

View full text Add to dashboard Cite

Dynamic functional connectivity and individual differences in emotions during social stress

Tobia

Hayashi

Ballard

et al. 2017

Human Brain Mapping

View full text Add to dashboard Cite

Exposure to acute stress induces multiple emotional responses, each with their own unique temporal dynamics. Dynamic functional connectivity (dFC) measures the temporal variability of network synchrony and captures individual differences in network neurodynamics. This study investigated the relationship between dFC and individual differences in emotions induced by an acute psychosocial stressor. Sixteen healthy adult women underwent fMRI scanning during a social evaluative threat (SET) task, and retrospectively completed questionnaires that assessed individual differences in subjectively experienced positive and negative emotions about stress and stress relief during the task. Group dFC was decomposed with parallel factor analysis (PARAFAC) into 10 components, each with a temporal signature, spatial network of functionally connected regions, and vector of participant loadings that captures individual differences in dFC. Participant loadings of two networks were positively correlated with stress-related emotions, indicating the existence of networks for positive and negative emotions. The emotion-related networks involved the ventromedial prefrontal cortex, cingulate cortex, anterior insula, and amygdala, among other distributed brain regions, and time signatures for these emotion-related networks were uncorrelated. These findings demonstrate that individual differences in stress-induced positive and negative emotions are each uniquely associated with large-scale brain networks, and suggest that dFC is a mechanism that generates individual differences in the emotional components of the stress response. Hum Brain Mapp 38:6185-6205, 2017. © 2017 Wiley Periodicals, Inc.

show abstract

The effect of estrogen synthesis inhibition on hippocampal memory

Bayer

Rune

Schultz

et al. 2015

Psychoneuroendocrinology

View full text Add to dashboard Cite

Neural systems mediating recognition of changes in statistical regularities

et al. 2012

View full text Add to dashboard Cite

Rapidly acquired multisensory association in the olfactory cortex

et al. 2015

View full text Add to dashboard Cite

BackgroundThe formation of an odor percept in humans is strongly associated with visual information. However, much less is known about the roles of learning and memory in shaping the multisensory nature of odor representations in the brain.MethodThe dynamics of odor and visual association in olfaction was investigated using three functional magnetic resonance imaging (fMRI) paradigms. In two paradigms, a visual cue was paired with an odor. In the third, the same visual cue was never paired with an odor. In this experimental design, if the visual cue was not influenced by odor–visual pairing, then the blood‐oxygen‐level‐dependent (BOLD) signal elicited by subsequent visual cues should be similar across all three paradigms. Additionally, intensity, a major dimension of odor perception, was used as a modulator of associative learning which was characterized in terms of the spatiotemporal behavior of the BOLD signal in olfactory structures.ResultsA single odor–visual pairing cue could subsequently induce primary olfactory cortex activity when only the visual cue was presented. This activity was intensity dependent and was also detected in secondary olfactory structures and hippocampus.ConclusionThis study provides evidence for a rapid learning response in the olfactory system by a visual cue following odor and visual cue pairing. The novel data and paradigms suggest new avenues to explore the dynamics of odor learning and multisensory representations that contribute to the construction of a unified odor percept in the human brain.

show abstract

Reward Processing in Children With Disruptive Behavior Disorders and Callous-Unemotional Traits in the ABCD Study

Hawes

Waller

Byrd

et al. 2021

AJP

View full text Add to dashboard Cite

Neural systems for choice and valuation with counterfactual learning signals

et al. 2014

View full text Add to dashboard Cite

12 3 4

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Michael J. Tobia

Risk-Sensitive Reinforcement Learning

Multiple sensitivity profiles to diversity and transition structure in non-stationary input

Dynamic functional connectivity and individual differences in emotions during social stress

The effect of estrogen synthesis inhibition on hippocampal memory

Neural systems mediating recognition of changes in statistical regularities

Rapidly acquired multisensory association in the olfactory cortex

Reward Processing in Children With Disruptive Behavior Disorders and Callous-Unemotional Traits in the ABCD Study

Neural systems for choice and valuation with counterfactual learning signals

Contact Info

Product

Resources

About