“…To address * corresponding author this problem, the quantization technique has emerged as a promising network compression solution and achieved substantial progress in recent years. It can largely reduce the network storage and meanwhile accelerate the inference speed using different types of quantizers, mainly including binary/ternary [8,9,16,23,45,25,33], uniform [46,28,6,26,43,18,36,42,20,7,2,39,21] and nonuniform [44,35,13,5,41,30,38,37,3].…”