Research Article

CCAH: A CLIP-Based Cycle Alignment Hashing Method for Unsupervised Vision-Text Retrieval

Figure 5

t-SNE visualization of the data on the Flickr-25K. (a) Original image features. (b) Image encoded feature distribution. (c) Original text features. (d) Text encoded feature distribution. In the figure, the circle (○) and star () denote the representation of text and image samples, respectively, and different colors denote the representation with different semantic categories.
(a)
(b)
(c)
(d)