Research Article
Custom Network Quantization Method for Lightweight CNN Acceleration on FPGAs
Algorithm 1
Framework of custom network quantization.
Input: Data, quantizers, pre-trained FP network with convolutional layers | Output: The quantized network inference model | 1: Add quantizers before convolution operators; | 2: for do | 3: Forward propagation by to weights of the network and by to activations of the network ; | 4: Backward propagation by STE to update network parameters; | 5: end for | 6: Add quantizers before non-convolution operators; | 7: Re-train the network and subgraph fusion; | 8:return quantized network inference model; |
|