VOVU: A Method for Predicting Generalization in Deep Neural Networks

<div>The dynamic trend of the test loss and VOVU trained on Fashion-MNIST in fully connected networks.The point when VOVU begins to rise is nearly the same point that the test loss starts to rise (the variance of truncated Gaussian distribution of the initialized weight matrix in each nets is 0.05 and 0.1 and 0.15 from left to right, respectively). (a) Variance = 0.05. (b) Variance = 0.1. (c) Variance = 0.15.</div>

Mathematical Problems in Engineering

VOVU: A Method for Predicting Generalization in Deep Neural Networks

Figure 3