Dataset Entries from this Author
This dataset contains precomputed MS-COCO and Flickr30K Faster R-CNN image features, which are all the data needed for reproducing the experiments in "Stacked Cross Attention for Image-Text Matching", our ECCV 2018 paper. We use splits produced by Andrej Karpathy. The raw images can be downloaded from their original sources http://nlp.cs.illinois.edu/HockenmaierGroup/Framing_Image_Description/KCCA.html, http://shannon.cs.illinois.edu/DenotationGraph/ and http://mscoco.org/.
- Categories:
-