You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Download and extract the data folder, containing annotations, the splits by Karpathy et al. and ROUGEL - SPICE precomputed relevances for both COCO and Flickr30K datasets:
wget https://datino.isti.cnr.it/teran/data.tar
tar -xvf data.tar
Download the bottom-up features for both COCO and Flickr30K. We use the code by Anderson et al. for extracting them.
The following command extracts them under data/coco/ and data/f30k/. If you prefer another location, be sure to adjust the configuration file accordingly.
# for MS-COCO
wget https://datino.isti.cnr.it/teran/features_36_coco.tar
tar -xvf features_36_coco.tar -C data/coco
# for Flickr30k
wget https://datino.isti.cnr.it/teran/features_36_f30k.tar
tar -xvf features_36_f30k.tar -C data/f30k
@article{liu2023efficient,
title={Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training},
author={Liu, Chong and Zhang, Yuqi and Wang, Hongsong and Chen, Weihua and Wang, Fan and Huang, Yan and Shen, Yi-Dong and Wang, Liang},
journal={IEEE Transactions on Image Processing},
year={2023}
}
About
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training