Research Article

A Novel Low-Bit Quantization Strategy for Compressing Deep Neural Networks

Figure 4

Comparison of accuracy with different combinations of quantized weights and activations. The horizontal axis shows the activation approximation bits and the vertical axis represents the quantization bits of network weight.