Behavioral strategies are often classified based on whether reinforcement is controlled by the value of the reinforcer. Value-sensitive behaviors, in which animals update their actions when reinforcer value is changed, are classified as goal-directed; conversely, value-insensitive actions, where behavior remains consistent when the reinforcer is removed or devalued, are considered habitual. Understanding the features of operant training that bias behavioral control toward either strategy is essential to understanding the cognitive and neuronal processes on which they rely. Using basic reinforcement principles, behavior can be biased toward relying on either process: random ratio (RR) schedules are thought to promote the formation of goal-directed behaviors while random intervals (RI) promote habitual control. However, how the schedule-specific features of these task structures relate to external factors to influence behavior is not well understood. Using male and female mice on distinct food restriction levels, we trained each group on RR schedules with responses-per-reinforcer rates matched to their RI counterparts to control for differences in reinforcement rate. We determined that food restriction level has a stronger effect on the behavior of mice following RR schedules than mice following RI schedules and that food restriction better predicted sensitivity to outcome devaluation than training schedule. Our results support the idea that relationships between RR or RI schedules with goal-directed or habitual behaviors, respectively, are more nuanced than previously appreciated and suggest that an animal's engagement in a task must be accounted for, together with the structure of reinforcement schedules, to appropriately interpret the cognitive underpinnings of behavior.
Concatenating actions into automatic routines is evolutionarily advantageous as it allows organisms to efficiently use time and energy under predictable conditions. However, over reliance on inflexible behaviors can be life-threatening in a changing environment and can become pathological in disease states such as obsessive compulsive disorder (OCD) and substance use disorder (SUD). Understanding the conditions under which stereotypical sequences of actions are produced is crucial to studying how these behaviors can become maladaptive. Here, we investigated the ability of operant conditioning schedules and contingencies to promote reproducible sequences of five lever presses. We found that signaling reinforcer delivery with a visual cue was effective at increasing learning rates but resulted in mice pressing the lever in fast succession until the cue turned on, rather than pressing it five times. We also found that requiring mice to collect their reinforcer between sequences had little effect on both rate of behavior and on quantitative metrics of reproducibility such as inter-response interval (IRI) variance, and that a training strategy that directly reinforced sequences with low variance IRIs was not more effective than a traditional fixed ratio schedule at promoting reproducible action execution. Together, our findings provide insights into the parameters of behavioral training that promote reproducible sequences and serve as a roadmap to investigating the neural substrates of automatic behaviors.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.