“…Contrary to that, early studies on the ΔRewP effect indicated that it corresponds to a simple dichotomous classification into good and bad outcomes (e.g., Gehring & Willoughby, 2002; Hajcak et al., 2006; Yeung & Sanfey, 2004). However, with an increase in complexity of experimental tasks later research repeatedly demonstrated that RewP amplitudes can mirror a more finely graded scaling of outcome values (e.g., Bellebaum et al., 2010; Frömer et al., 2016; Kreussel et al., 2012; Meadows et al., 2016; Osinsky et al., 2018; Osinsky et al., 2012; also see Sambrook & Goslin, 2015). For instance, hierarchical reinforcement learning has primarily been investigated with pseudo‐reward tasks where an overall goal depends on the success of various subgoals, and hence, involves multiple decision steps (e.g., Diuk et al., 2013; Ribas‐Fernandes et al., 2011, 2019).…”