“…Even considering only model-free (as opposed to model-based (Daw and Dayan, 2014)) reinforcement learning, there exists a variety of learning rules (e.g., Palminteri et al, 2015; Rescorla and Wagner, 1972; Rummery and Niranjan, 1994; Sutton, Richard, 1988), as well as the possibility of multiple learning rates for positive and negative prediction errors (Christakou et al, 2013; Daw et al, 2002; Frank et al, 2009; Gershman, 2015; Haughey et al, 2007; Niv et al, 2012), and many additional concepts, such as eligibility traces to allow for updating of previously visited states (Barto et al, 1981; Bogacz et al, 2007). Similarly, in the decision-making literature, there exists a wide range of evidence-accumulation models, including most prominently the diffusion decision model (DDM; Ratcliff, 1978; Ratcliff et al, 2016) and race models such as the linear ballistic accumulator model (LBA; Brown and Heathcote, 2008) and racing diffusion (RD) models (Boucher et al, 2007; Hawkins and Heathcote, 2020; Leite and Ratcliff, 2010; Logan et al, 2014; Purcell et al, 2010; Ratcliff et al, 2011; Tillman et al, 2020).…”