Research Article
A Novel Low-Bit Quantization Strategy for Compressing Deep Neural Networks
Figure 1
Convolution operation pipeline. (a) General convolution operation without quantization of weight and activation. (b) Description of proposed method with weight and activation quantized by low-bit.
(a) |
(b) |