International Journal of Intelligent Systems

Research Article

CCAH: A CLIP-Based Cycle Alignment Hashing Method for Unsupervised Vision-Text Retrieval

CLIP-based cycle alignment hashing for unsupervised vision-text retrieval.

	Require: Image set ; text set ;
	Batch size set , hash code length , Max epoch .
	Ensure: Deep Feature extract functions , and ;
	encoder function set , and ;
	Hash coding functions , and .
(1)	Initialize the pretrained extractor parameters: .
(2)	Whiledo
(3)	;
(4)	Extract the depth characteristics of each mode: , ;
(5)	Encode the features to get the hidden states, by ;
(6)	Using the hidden states to generate truth matrix and hash codes;
(7)	Decode the hidden states to generate heterogeneous features and
(8)	Calculate the objective function;
(9)	Back propagate the gradient with the chain rule;
(10)	Update the whole parameters;
(11)	end while