Research Article

Medical Image Description Based on Multimodal Auxiliary Signals and Transformer

Table 2

Ablation experiments of each module.

DatasetMethodsBLEU1BLEU2BLEU3BLEU4METEORROUGE_LCIDEr

IU-X-rayR2Gen (base)0.4700.3040.2190.1650.1870.3710.398
MDAK0.4800.3280.2310.1720.2010.3690.424
MDAKF (add)0.4940.3180.2290.1740.1940.3890.371
MDAKF (ATT)0.5050.3180.2190.1590.1950.3830.344
MDAKF (cat)0.4840.3070.2210.1670.1920.3910.334
MDAKF (mul)0.4570.2910.2090.1590.1790.3710.372

COV-CTRR2Gen (base)0.7250.6410.5800.5280.3990.6771.358
MDAK0.7230.6520.5860.5450.4030.6761.452
MDAKF (add)0.7260.6510.5830.5390.4010.6831.354
MDAKF (ATT)0.7270.6490.5880.5370.4000.6741.243
MDAKF (cat)0.7220.6400.5760.5240.4050.6831.305
MDAKF (mul)0.7180.6370.5740.5210.4010.6810.302

The bold values indicate that the model performance of the algorithm is optimal in a certain type of dataset.