Computational Intelligence and Neuroscience

Research Article

Visual-Text Reference Pretraining Model for Image Captioning

A visual display of the generated captions and the corresponding visual regions on Visual Genome.