“…Optimizations on Sunway architecture. There're a few works exploiting architectural features on Sunway, e.g., heterogeneous computing cores, SIMD, register-level communication, SPM, and so on, which are either hand-tuned application-specific implementations [3,8,19,61], or domain-specific frameworks [18,36,75]. Specially, [38,62,76] perform hand-tuned tiling for parallelism.…”