Research Article

CCAH: A CLIP-Based Cycle Alignment Hashing Method for Unsupervised Vision-Text Retrieval

Table 1

Comparison results on mean accuracy (MAP@50) for different code lengths under the Flickr-25K and NUS-WIDE dataset.

DatasetFlickr-25KNUS-WIDE
TaskI->TT->II->TT->I
Method16 bits32 bits64 bits128 bits16 bits32 bits64 bits128 bits16 bits32 bits64 bits128 bits16 bits32 bits64 bits128 bits

CVH0.6060.5990.5960.5980.5910.5830.5760.5760.3720.3620.4060.390.4010.3840.4420.432
IMH0.6120.6010.5920.5790.6030.5950.5890.580.470.4730.4760.4590.4780.4830.4720.462
LCMH0.5590.5690.5850.5930.5610.5690.5820.5820.3540.3610.3890.3830.3760.3870.4080.419
CMFH0.6210.6240.6250.6270.6420.6620.6760.6850.4550.4590.4650.4670.5290.5770.6140.645
LSSH0.5840.5990.6020.6140.6180.6260.6260.6280.4810.4890.5070.5070.4550.4590.4680.473
RFDH0.6320.6360.6410.6520.6810.6930.6980.7020.4880.4920.4940.5080.6120.6410.6580.68
DBRC0.6170.6190.620.6210.6180.6260.6260.6280.4240.4590.4470.4470.4550.4590.4680.473
UDCMH0.6890.6980.7140.7170.6920.7040.7180.7330.5110.5190.5240.5580.6370.6530.6950.716
DJSRH0.8100.8430.8620.8760.7860.8220.8350.8470.7240.7730.7980.8170.7120.7440.7710.789
DSAH0.8630.8770.8950.9030.8460.8600.8810.8820.7750.8050.8180.8270.7700.7900.8040.815
HNH0.8530.8830.8950.9020.8330.8540.8680.8780.5820.7470.8000.8160.4230.7430.7810.780
CCAH0.8630.8790.8990.9100.8910.9080.9140.9130.7150.7540.7750.7870.8340.8510.8640.874