Review Article
Automatic Image Caption Generation Based on Some Machine Learning Algorithms
Table 1
The BLEU result for the models using the InceptionV3, ResNet-50, MobileNet, and EffectiveNet-B1 pretrained networks.
| Model | B1 | B2 | B3 | B4 | CIDEr |
| Up-down [24] | 0.802 | 0.641 | 0.491 | 0.369 | 1.179 | Attention based [23] | 0.748 | 0.525 | 0.365 | 0.235 | 1.041 | Our method (InceptionV3) | 0.821 | 0.693 | 0.452 | 0.441 | 1.092 | Our method (MobNet) | 0.707 | 0.563 | 0.516 | 0.366 | 0.797 | Our method (ResNet-50) | 0.784 | 0.732 | 0.458 | 0.38 | 0.090 | Our method (EffNet-B1) | 0.802 | 0.756 | 0.501 | 0.396 | 0.812 |
|
|