Computational Intelligence and Neuroscience

Research Article

Visual-Text Reference Pretraining Model for Image Captioning

Changes of loss scores of different input methods during training. The abscissa is the value of the epoch, and the ordinate is the value of the loss.