“…Specifically: B. Standard generative model for studying passive learning in volatile environments (Adams & MacKay, 2007;Fearnhead & Liu, 2007;Liakoni et al, 2021;Nassar et al, 2012;Nassar et al, 2010;Wilson et al, 2013), C. Generative model corresponding to variants of bandit and reversal bandit tasks (Behrens et al, 2007;Findling et al, 2021;Horvath et al, 2021), where the cue variable X t = A t is a participant's action, D. Generative model for modeling human inferences about binary sequences (Gijsen et al, 2021;Maheu et al, 2019;Meyniel et al, 2016;Modirshanechi et al, 2019;Mousavi et al, 2020), and E. classic Markov Decision Processes (MDPs) (Daw et al, 2011;Gläscher et al, 2010;Huys et al, 2015;Lehmann et al, 2019;Schultz et al, 1997;Sutton & Barto, 2018), where the cue variable X t = (A t−1 , Y t−1 ) consists of previous action and observation. See Appendix A: Special cases and links to related works for details.…”