Datasets
Standard Dataset
Four Public Datasets for Explainable Medical Image Classifications
- Citation Author(s):
- Submitted by:
- xiangyu xiong
- Last updated:
- Fri, 08/16/2024 - 04:58
- DOI:
- 10.21227/440a-dp26
- License:
- Categories:
- Keywords:
Abstract
Modern deep neural networks are overparameterized and thus require data augmentation techniques to prevent over-fitting and improve generalization ability. Generative adversarial networks (GANs) are famous for generating visually realistic images. However, the generated images lack diversity and have uncertain class labels. On the other hand, recent methods mix labels proportionally to the salient region. As the main diagnostic information is not always contained within the salient region, we argue that methods along this line can cause label mismatch issues in medical image classifications. Therefore, we propose VariMix, which exploits an absolute difference map (ADM) to address the label mismatching of mixed medical images. The VariMix generates ADM using the image-to-image (I2I) GAN across multiple classes and allows for bidirectional mixing operations between the training samples. We collect four public medical image datasets for automatic medical image classifications: Breast Ultrasound datset, Chest X-Ray Images (CXR) dataset, Eye Disease Retinal Images (Retinal) dataset, Maternal-fetal ultrasound dataset. Extensive experiments prove the superiority of VariMix compared with the existing GAN-based and Mixupbased augmentation methods on four public datasets using Swin Transformer V2 and ConvNeXt architectures. Furthermore, by projecting the source image to the hyperplane of the support vector machine, the proposed I2I GAN can generate hyperplane difference maps (HDM) between the source image and the hyperplane image, demonstrating its ability to interpret medical image classifications.
In each dataset, a three-way split was performed to create training, validation, and test sets with no subject overlap.
1) Breast US Dataset: This dataset consists of the Breast Ultrasound Images (BUSI) dataset [53], and the UDIAT Diagnostic Centre (UDIAT) dataset [54]. The BUSI dataset contains 780 US images: 437 benign, 210 malignant, and 133 normal cases. The UDIAT dataset contains 163 US images:
109 benign and 54 malignant cases, with only one lesion per image. The Breast US dataset total contains 943 US images, which are divided into three classes: benign (546), malignant (264), and normal (133).
2) CXR Dataset: The Chest X-Ray Images (CXR) dataset contains 5228 images, which are divided into three classes: covid-19 (1626), pneumonia (1800), and normal (1802).
3) Retinal Dataset: The Eye Disease Retinal Images (Retinal) dataset contains 4217 retinal images, which are divided into four classes: cataract (1038), diabetic retinopathy (DR) (1098), glaucoma (1007), and normal (1074).
4) Maternal-fetal US Dataset: The Maternal-fetal ultrasound (US) dataset contains 12400 maternal-fetal anatomical planes, which are divided into six classes: fetal-abdomen (711), fetalbrain (3092), fetal-femur (1626), fetal-thorax (1040), maternalcervix (4213) and others (1718).