“…Benchmarks have played a significant role in other areas such as computer vision and speech recognition. Examples include MNIST (Le-Cun et al, 1998), Caltech101 (Fei-Fei et al, 2006, CI-FAR (Krizhevsky & Hinton, 2009), ImageNet (Deng et al, 2009), PASCAL VOC (Everingham et al, 2010), BSDS500 (Martin et al, 2001), SWITCHBOARD (Godfrey et al, 1992), TIMIT (Garofolo et al, 1993), Aurora (Hirsch & Pearce, 2000), and VoiceSearch (Yu et al, 2007). The lack of a standardized and challenging testbed for reinforcement learning and continuous control makes it difficult to quantify scientific progress.…”