Research Article

Medical Image Description Based on Multimodal Auxiliary Signals and Transformer

Table 3

Performance of different feature extraction networks.

BLEU1BLEU2BLEU3BLEU4METEORL

ResNet-1010.5050.3180.2190.1590.1950.383
ResNet-1520.4890.3100.2190.1570.2100.375
ResNet_101_32 × 8d0.4930.3060.2030.1370.1980.366
wide_ResNet-101_20.4990.3090.2060.1430.1980.346

The bold values indicate that the model performance of the algorithm is optimal in a certain type of dataset.