“…A variety of models have been developed for this tradeoff in many different contexts. They range from the idea of satisficing (Simon, 1990) and derived Bayesian satisficing models (Fu & Gray, 2006) to sequential heuristic models of optimal stopping (Seale & Rapoport, 2000), mutual mate choice (Todd & Miller, 1999), Bayesian optional stopping models (Edwards, 1965), Bayesian observer models (Vul et al, 2009), the accumulation of evidence to a threshold criterion (Busemeyer & Townsend, 1993;Ratcliff, 1978;Vickers, 1979), and models based on cognitive architectures for dynamic decision making (Gonzalez & Dutt, 2011;Gonzalez, Lerch, & Lebiere, 2003) During an exploitation to exploration tradeoff, the agent faces the question of how long to continue exploiting a current option and thereby obtain its rewards, and when to switch to exploring alternatives and thereby increasing the chance to find potentially better options elsewhere. Perhaps the most prominent model of this tradeoff, the marginal value theorem (MVT; Charnov, 1976), comes from the foraging literature where research has long attempted to formalize optimal behavior (Stephens & Krebs, 1987).…”