Research Article

Differentiable Network Pruning via Polarization of Probabilistic Channelwise Soft Masks

Figure 2

Analysis of the effect of batch size on learning the probabilistic masks. Batch sizes of 64, 128, and 256 were explored. The results show the soft masks were clearly separated into two parts across different batch sizes.