Research Article
[Retracted] Gradient Descent Optimization in Deep Learning Model Training Based on Multistage and Method Combination Strategy
Figure 6
Performance of decay and combination of cosine decay and decay during 100 epochs of training on MNIST. (a, b) The accuracy and loss of the result (init-lr = 0.01).
(a) |
(b) |