Research Article

Custom Network Quantization Method for Lightweight CNN Acceleration on FPGAs

Algorithm 1

Framework of custom network quantization.
Input: Data, quantizers, pre-trained FP network with convolutional layers
Output: The quantized network inference model
1: Add quantizers before convolution operators;
2: for do
3:   Forward propagation by to weights of the network and by to activations of the network ;
4:   Backward propagation by STE to update network parameters;
5: end for
6: Add quantizers before non-convolution operators;
7: Re-train the network and subgraph fusion;
8:return quantized network inference model;