Research Article
Differentiable Network Pruning via Polarization of Probabilistic Channelwise Soft Masks
Figure 2
Analysis of the effect of batch size on learning the probabilistic masks. Batch sizes of 64, 128, and 256 were explored. The results show the soft masks were clearly separated into two parts across different batch sizes.