“…One traditional goal in reinforcement learning is for agents to continually improve their performance as they obtain more data (Hutter, 2005;Ring, 1997;Singh, Barto, & Chentanez, 2004;Sutton et al, 2011;Thrun & Mitchell, 1993;Wilson, 1985). Measuring the extent to which this is the case for a given agent can be a challenge, and this challenge is exacerbated in the Arcade Learning Environment, where the agent is evaluated across 60 games.…”