Research Article

An Improved EfficientNetV2 Model Based on Visual Attention Mechanism: Application to Identification of Cassava Disease

Table 9

Different network performance results on augmentation dataset. Infer time is measured on 1080ti GPU with batch size 16 using the same codebase; train time is the total training time. All models are trained with transfer learning.

ModelTop1 Acc (%)Param (M)FLOPs (G)Infer time (s)Train time (h)

AlexNet89.857.00.82.014.8
VGG1696.310215.51.5822.4
GoogLeNet94.910.21.521.027.8
ResNet3497.822.33.681.3068.7
RegNetX95.65.50.431.2125
RegNetY98.45.10.421.3227
EfficientNetV298.5321.42.9081.0135
PDRNet99.5621.52.9091.0135