Datasets
Standard Dataset
Brain Tumor MRI Image Dataset
- Citation Author(s):
- Submitted by:
- Sanasultana Alagur
- Last updated:
- Thu, 02/20/2025 - 10:53
- DOI:
- 10.21227/c62y-r669
- Data Format:
- License:
- Categories:
- Keywords:
Abstract
Abstract
Brain tumors pose a significant challenge in medical diagnostics, necessitating advanced computational approaches for accurate detection and classification. This dataset comprises a curated collection of Magnetic Resonance Imaging (MRI) scans categorized into four distinct classes: No Tumor, Glioma Tumor, Meningioma Tumor, and Pituitary Tumor. To ensure data integrity and reliability, an extensive preprocessing pipeline was implemented, including duplicate image removal using perceptual hashing and correction of mislabeled samples through expert verification. All images were standardized to a resolution of 224×224 pixels to optimize computational efficiency. Furthermore, data augmentation techniques, such as salt-and-pepper noise addition, histogram equalization, rotation, brightness adjustment, and horizontal/vertical flipping, were employed to enhance dataset diversity and robustness. This dataset serves as a valuable benchmark for developing and evaluating deep learning models in medical imaging applications, including automated tumor classification, radiomics analysis, and treatment planning. Researchers are encouraged to utilize this dataset for advancing machine learning-driven medical diagnostics, ensuring appropriate validation through collaboration with domain experts.
This dataset is designed for research purposes in medical imaging and deep learning applications. It consists of MRI scans classified into four categories: No Tumor, Glioma Tumor, Meningioma Tumor, and Pituitary Tumor. The dataset has been preprocessed by removing duplicates, correcting mislabeled samples, and standardizing image resolution to 224×224 pixels. Augmentation techniques, including histogram equalization, rotation, brightness adjustment, and flipping, have been applied to enhance dataset diversity. Users are encouraged to apply machine learning algorithms for classification tasks and validate results with medical experts before clinical application.
Documentation