Machine Learning
The dataset provides crop-type surveys for Canada's prairie provinces (Manitoba, Saskatchewan and Alberta) in 2020 and 2021. The data were collected via windshield survey(driving through the countryside with GPS-enabled data collection software and satellite imagery). Crop-type points and their geographic coordinates on the ground were gathered using data collection software. Field boundaries were identified on satellite imagery. A single observation point is dropped in a homogeneous area within the field.
- Categories:
Endemic fish species are key components in seafood culinary excursions. Despite the increasing interest in leveraging technology to enhance various seafood culinary activities, there is a shortage of comprehensive datasets containing images of seafood used in artificial intelligence research, mainly those showcasing endemic fish. This research endeavors to bridge this gap by increasing the accuracy of fish recognition and introducing a new dataset comprising images of native fish for application in various machine-learning investigations.
- Categories:
This dataset consists of MRI images of brain tumors, specifically curated for tasks such as brain tumor classification and detection. The dataset includes a variety of tumor types, including gliomas, meningiomas, and glioblastomas, enabling multi-class classification. Each MRI scan is labeled with the corresponding tumor type, providing a comprehensive resource for developing and evaluating machine learning models for medical image analysis. The data can be used to train deep learning algorithms for brain tumor detection, aiding in early diagnosis and treatment planning.
- Categories:
To download this dataset without purchasing an IEEE Dataport subscription, please visit: https://zenodo.org/records/13896353
Please cite the following paper when using this dataset:
- Categories:
you can download these datasets from OpenML: https://www.openml.org/search?type=data&status=active&tags.tag=2019_mult...
- Categories:
you can download these datasets from OpenML: https://www.openml.org/search?type=data&status=active&tags.tag=2019_mult...
- Categories:
Liver cancer treatment, especially for metastatic cases, poses significant challenges in accurately targeting tumours while sparing healthy tissue. Radioembolisation with yttrium-90 (Y-90) microspheres is a promising technique, but precise imaging of microsphere distribution is crucial. This study utilises T-PEPT, a novel Positron Emission Particle Tracking (PEPT) algorithm that combines topological data analysis with machine learning to identify Y-90 microsphere clusters in a digital twin of a patient's liver.
- Categories:
The Unified Multimodal Network Intrusion Detection System (UM-NIDS) dataset is a comprehensive, standardized dataset that integrates network flow data, packet payload information, and contextual features, making it highly suitable for machine learning-based intrusion detection models. This dataset addresses key limitations in existing NIDS datasets, such as inconsistent feature sets and the lack of payload or time-window-based contextual features.
- Categories:
We collected programming problems and their solutions from previous studies. After applying some pre-processing steps, we queried advanced LLMs, such as GPT4, with the collected problems to produce machine-generated codes, while the original solutions were labeled as human-written codes. Finally, the entire collected dataset was divided into training, validation, and test sets, ensuring that there is no overlap among these sets, meaning no solutions in two different sets that solve the same programming problem.
- Categories:
The dataset consists of uplink channel gains, downlink channel gains and uplink to downlink channel gains along with corresponding power allocations for uplink users and downlink users across all subcarriers. Additionally, it consists of NOMA decoding order for successful implementation of SIC at NOMA receiver. The number of UL users and DL users are considered as N=M=6, and subcarriers are S=9. Each column in the dataset is a sample for fading channel realization and it should be converted back to the matrix to compute sumrate.
- Categories: