Cross-modal Retrieval; Image-text Matching; Multi-modal; Deep Learning

LeafNet: A large-scale dataset for training image-text models in leaf disease identification

The PlantVillage dataset, with over 54,000 images spanning 14 plant species and 26 disease types, has been widely used for leaf disease classification. However, it is limited in both scale and diversity. To address these limitations, we developed LeafNet, a large-scale dataset designed to support foundation models for leaf disease diagnosis. LeafNet comprises over 186,000 images from 22 crop species, covering 43 fungal diseases, 8 bacterial diseases, 2 mould (oomycete) diseases, 6 viral diseases, and 3 mite-induced diseases, categorized into 97 classes.

Categories:

SCAN Faster R-CNN Image Features

This dataset contains precomputed MS-COCO and Flickr30K Faster R-CNN image features, which are all the data needed for reproducing the experiments in "Stacked Cross Attention for Image-Text Matching", our ECCV 2018 paper. We use splits produced by Andrej Karpathy. The raw images can be downloaded from their original sources http://nlp.cs.illinois.edu/HockenmaierGroup/Framing_Image_Description/KCCA.html, http://shannon.cs.illinois.edu/DenotationGraph/ and http://mscoco.org/.

Categories:

Subscribe to Cross-modal Retrieval; Image-text Matching; Multi-modal; Deep Learning