“…Hence, it is of special importance to develop an efficient quantization scheme, because it can significantly contribute to the performance of quantized NN. Most of the available quantization schemes are based on fixed-length coding, where different codewords length are employed including > 8 bits [23], 8 bits [1], 4 bits [3], 2 bits [4,5,20,21,26] or even 1 bit [8,22,27,28]. It was reported that quantized NN provides a negligible decreasing of performance with respect to full precision NN when high bit lengths (≥ 8 bits) have been employed, where compression ratio ≤ 4 times was achieved.…”