Research Article
CCAH: A CLIP-Based Cycle Alignment Hashing Method for Unsupervised Vision-Text Retrieval
Figure 5
t-SNE visualization of the data on the Flickr-25K. (a) Original image features. (b) Image encoded feature distribution. (c) Original text features. (d) Text encoded feature distribution. In the figure, the circle (○) and star () denote the representation of text and image samples, respectively, and different colors denote the representation with different semantic categories.
(a) |
(b) |
(c) |
(d) |