“…While Central Processing Unit (CPU) simulation techniques have reached maturity, GPU simulation often suffers from the following problems: (a) instruction sets are not accurately modeled, but approximated by an artificial, low-level intermediate representation [7], [8], (b) GPU simulators do not model existing commercial GPUs, but only simplified GPU architectures [9], (c) instead of using vendor provided driver stacks and compilers, GPU simulators often rely on simplified system software, which may behave entirely differently to original tools [10], [11], and (d) GPUs are treated as standalone devices, not modeling any CPU-GPU transactions [12]. This has led researchers using GPU simulation to rely on tools providing questionable accuracy [13]. For example, many existing GPU simulators including gem5-GPU [14], GpuTejas [15], Multi2Sim [10], GPGPU-Sim [16], and Multi2Sim-Kepler [11] claim cycle-accuracy.…”