Research Article
CCAH: A CLIP-Based Cycle Alignment Hashing Method for Unsupervised Vision-Text Retrieval
Algorithm 1
CLIP-based cycle alignment hashing for unsupervised vision-text retrieval.
| Require: Image set ; text set ; | | Batch size set , hash code length , Max epoch . | | Ensure: Deep Feature extract functions , and ; | | encoder function set , and ; | | Hash coding functions , and . | (1) | Initialize the pretrained extractor parameters: . | (2) | Whiledo | (3) | ; | (4) | Extract the depth characteristics of each mode: , ; | (5) | Encode the features to get the hidden states, by ; | (6) | Using the hidden states to generate truth matrix and hash codes; | (7) | Decode the hidden states to generate heterogeneous features and | (8) | Calculate the objective function; | (9) | Back propagate the gradient with the chain rule; | (10) | Update the whole parameters; | (11) | end while | | |
|