Research Article

[Retracted] Gradient Descent Optimization in Deep Learning Model Training Based on Multistage and Method Combination Strategy

Figure 6

Performance of decay and combination of cosine decay and decay during 100 epochs of training on MNIST. (a, b) The accuracy and loss of the result (init-lr = 0.01).
(a)
(b)