In this study, the authors tested the hypothesis that the basolateral amygdala (BLA), orbitofrontal cortex (OFC), nucleus accumbens core (NA-core), and the extended hippocampus mediate different aspects of the development-maintenance of unique reward expectancies produced by the differential outcomes procedure (DOP). Rats were trained with either DOP or a nondifferential outcomes procedure (NOP) on a simple discrimination task. Fornix lesions did not affect either version of the task, demonstrating that the extended hippocampal system has no role in stimulus-outcome (S-O) associations. In contrast, in the DOP condition, BLA lesions impaired performance throughout training, OFC lesions impaired choice accuracy only in the later maintenance phase, and NA-core lesions resulted in enhanced learning. These results suggest that BLA and OFC are important for establishment (BLA) and behavioral maintenance (OFC) of S-O associations, whereas the NA-core is not needed and can in fact impede using multiple S-O associations. No impairments were observed in the NOP condition, demonstrating that these structures are not critical to stimulus-response learning.Keywords reward expectancy; amygdala; orbitofrontal cortex; nucleus accumbens; rat Learned expectations are the product of various associations (stimulus-outcome [S-O], stimulus-response [S-R], response-outcome [R-O]) that provide accurate predictive information for behavioral adaptation. The differential outcomes procedure (DOP) is a manipulation of reward contingencies that modifies the cognitive strategy-brain regions used on conditional discrimination tasks (Savage, 2001;Trapold, 1970;Trapold & Overmier, 1972;Urcuioli, 2005). During the generic conditional discrimination task, a common or randomized reward procedure is used (nondifferential outcomes procedure [NOP]). Under the NOP condition, an S-R association driven by memory of the sample is used to guide behavior. In contrast, with the DOP condition, each sample stimulus and the subsequent correct response are consistently paired with a specific reinforcer that results in S-O and/or R-O associations. As a result, the subject develops an expectation of the specific reward before the actual reinforcer (Demarse & Urcuioli, 1993;Overmier & Linwick, 2001;Savage, 2001;Savage & Langlais, 1995;Trapold, 1970;Trapold & Overmier, 1972;Urcuioli, 2005).The DOP produces a more rapid learning curve and higher terminal accuracy than the NOP, presumably through the use of unique reward expectations called the differential outcomes effect (DOE;Overmier & Linwick, 2001;Savage, 2001;Savage & Parsons, 1997 1970;Urcuioli, 1990Urcuioli, , 2005. We propose that the functional difference produced by the DOP changes the neural systems used and their associated learning and memory strategies, as compared with the NOP (see Ramirez, Buzzetti, & Savage, 2005;Savage, 2001;Savage, Buzzetti, & Ramirez, 2004;Savage & Parsons, 1997;Savage, Pitkin, & Careri, 1999).Previous findings suggest that several structures, including the basolateral amy...