Research Article

A Novel Low-Bit Quantization Strategy for Compressing Deep Neural Networks

Figure 1

Convolution operation pipeline. (a) General convolution operation without quantization of weight and activation. (b) Description of proposed method with weight and activation quantized by low-bit.
(a)
(b)