“…There has been an incredible amount of interest in DNN hardware acceleration. Broadly speaking, the architecture community has focused on designing efficient dataflows to maximize local reuse of data and functional unit utilization [4,10,11,15,28,34,37,39], explore the space of possible dataflows and mappings [26,45,74], exploit model sparsity and data quantization [17,21,29,38,46,53,71,73,78], map DNN accelerators to FPGAs [20,66,69], and explore alternative compute, memory, and packaging technologies [35,58,59,67]. All of these works are highly relevant to this field.…”