Research Article

[Retracted] Gradient Descent Optimization in Deep Learning Model Training Based on Multistage and Method Combination Strategy

Table 15

Performance of the proposed method, Adam with decay.

ResNet-20 on Cafri-10LSTM on IMDB
Val-lossVal-accVal-lossVal-acc

(Adam + d) + SGD0.59800.85130.93230.8143
(Adam + d) + (SGD + d)0.59590.85280.92910.8150
(Adam + d) + (SGD + M)0.67450.82171.09510.8172
(Adam + d) + (SGD + M + d)0.69600.82321.11280.8137
(Adam + d) + RMSprop0.76570.80781.30520.7917
(Adam + d) + (RMSprop + d)0.77600.81001.08370.8137
(Adam + d) + Adam1.01360.73991.17200.8048
(Adam + d) + (Adam + d)0.96410.75091.24290.8060

The bold values represent the best results.