Machine Learning
This is a compressed package containing nine multi-label text classification data sets, including AAPD, CitySearch, Heritage, Laptop, Ohsumed, RCV1, Restaurant, Reuters, and Sentihood.
- Categories:
Nasal Cytology, or Rhinology, is the subfield of otolaryngology, focused on the microscope observation of samples of the nasal mucosa, aimed to recognize cells of different types, to spot and diagnose ongoing pathologies. Such methodology can claim good accuracy in diagnosing rhinitis and infections, being very cheap and accessible without any instrument more complex than a microscope, even optical ones.
- Categories:
This database contains Synthetic High-Voltage Power Line Insulator Images.
There are two sets of images: one for image segmentation and another for image classification.
The first set contains images with different types of materials and landscapes, including the following landscape types: Mountains, Forest, Desert, City, Stream, Plantation. Each of the above-mentioned landscape types consists of 2,627 images per insulator type, which can be Ceramic, Polymeric or made of Glass, with a total of 47,286 distinct images.
- Categories:
To address the challenges faced by patients with neurodegenerative disorders, Brain-Computer Interface (BCI) solutions are being developed. However, many current datasets lack inclusion of languages spoken by patients, such as Telugu, which is spoken by over 90 million people in India. To bridge this gap, we have created a dataset comprising Electroencephalograph (EEG) signal samples of commonly used Telugu words. Using the Open-BCI Cyton device, EEG samples were captured from volunteers as they pronounced these words.
- Categories:
The Landsat 8 imagery, sourced from USGS Earth Explorer, covers diverse regions like the northeastern USA snow region, Brazilian forests, UAE deserts, and Indian zones (northern, central, and southern) from 2018 to 2023, capturing long-term trends and seasonal changes. The dataset, including bands B4, B5, and B10 with 30-meter resolution from LANDSAT/LC08/C02/T1\_TOA imagery, is crucial for accurate LST and emissivity prediction models. These bands capture vital land surface properties like vegetation health, moisture, and thermal characteristics, enhancing model reliability.
- Categories:
In this work, we download the circRNA-drug sensitivity associations from the circRic database, in which the drug sensitivity data comes from the GDSC database, containing 80076 associations that involve 404 circRNAs and 250 drugs.
- Categories:
This dataset encapsulates a comprehensive collection of eye movement recordings captured during sleep, exceeding 100 distinct episodes. The recordings are primarily categorized into Rapid Eye Movement (REM), Slow Eye Movement (SEM), and non-movement phases, providing a rich resource for sleep research. Each video is meticulously recorded in high-definition .mp4 format, ensuring clarity and precision in capturing subtle ocular dynamics.
- Categories:
The dataset consists of 4-channeled EOG data recorded in two environments. First category of data were recorded from 21 poeple using driving simulator (1976 samples). The second category of data were recorded from 30 people in real-road conditions (390 samples).
All the signals were acquired with JINS MEME ES_R smart glasses equipped with 3-point EOG sensor. Sampling frequency is 200 Hz.
- Categories:
The dataset involves two sets of participants: a group of twenty skilled drivers aged between 40 and 68, each having a minimum of ten years of driving experience (class 1), and another group consisting of ten novice drivers aged between 18 and 46, who were currently undergoing driving lessons at a driving school (class 2).
The data was recorded using JINS MEME ES_R smart glasses by JINS, Inc. (Tokyo, Japan).
Each file consists of a signals from one sigle ride.
- Categories: