Machine Learning

MBTI Augmented Dataset

This dataset extends the standard Myers-Briggs Type Indicator (MBTI) dataset, widely available on Kaggle, by incorporating advanced data augmentation techniques leveraging GPT-based Transformers. The augmentation addresses inherent class imbalance and data sparsity issues in the original dataset, significantly enriching the volume and diversity of textual samples while maintaining linguistic and contextual fidelity to the MBTI personality types.

Categories:: Artificial Intelligence
Machine Learning

387 Views

Autonomous Vehicle dataset for Anomaly detection

This dataset was developed using the MOBATSim simulator in MATLAB 2020b, designed to mimic real-world autonomous vehicle (AV) environments. It focuses on providing high-quality data for research in anomaly detection and cybersecurity, particularly addressing False Data Injection Attacks (FDIA). The dataset includes comprehensive sensor information, such as speed, rotational movements, positional coordinates, and labelled attack data, enabling supervised learning.

Categories:: Artificial Intelligence
Machine Learning
Security
Transportation

869 Views

Online Shoppers Purchasing Intention Dataset

The dataset consists of feature vectors belonging to 12,330 sessions. The dataset was formed so that each session would belong to a different user in a 1-year period to avoid any tendency to a specific campaign, special day, user profile, or period.
Of the 12,330 sessions in the dataset, 84.5% (10,422) were negative class samples that did not end with shopping, and the rest (1908) were positive class samples ending with shopping.
The dataset consists of 10 numerical and 8 categorical attributes. The 'Revenue' attribute can be used as the class label.

Categories:: Artificial Intelligence
Machine Learning
Financial

452 Views

Violence Detection in Campus Surveillance

The dataset was specifically created to address the need for violence detection in surveillance systems. It consists of self-recorded videos simulating different types of violent activities relevant to college environments. The dataset is organized into four distinct classes:

Slap

Punch

Kick

Group Violence

Others - Over Crowding, Loitering, Assault, Abuse

Each video is labeled according to its corresponding class to facilitate supervised learning for violence detection models.

Categories:: Artificial Intelligence
Education and Learning Technologies
Machine Learning

645 Views

BSAM DATASETS

All multimodal recommendation datasets used in the manuscript Enhancing Robustness and Generalization Capability for Multimodal Recommender Systems via Sharpness-Aware Minimization (BSAM), which includes five Amazon datasets. Each dataset includes both visual and textual modalities. Baby, Sports, Clothing, Pet, and Office from Amazon. All the datasets comprise textual and visual features in the form of item descriptions and images. Our data preprocessing methodology follows the approach outlined in the MMRec Framework.

Categories:: Machine Learning

16 Views

BSAM DATASETS

Categories:: Machine Learning

17 Views

Unified Comprehensive Freshness Classification Dataset (UC-FCD) for Diverse Food Categories

The proper evaluation of food freshness is critical to ensure safety, quality along with customer satisfaction in the food industry. While numerous datasets exists for individual food items,a unified and comprehensive dataset which encompass diversified food categories remained as a significant gap in research. This research presented UC-FCD, a novel dataset designed to address this gap.

Categories:: Agriculture
Artificial Intelligence
Machine Learning
Image Processing
Biomedical and Health Sciences
Computational Intelligence
Computer Vision

401 Views

Brain Tumor MRI Classification Dataset

Brain tumors are among the most severe and life-threatening conditions affecting both children and adults. They constitute approximately 85-90% of all primary Central Nervous System (CNS) tumors, with an estimated 11,700 new cases diagnosed annually. The 5-year survival rate for individuals with malignant brain or CNS tumors is alarmingly low, at 34% for men and 36% for women. Brain tumors are categorized into various types, including benign, malignant, and pituitary tumors.

Categories:: Machine Learning
Image Processing
Cancer Data
Medical Imaging
Brain
Computer Vision

595 Views

Repeated Route Naturalistic Driving Dataset (R2ND2)

Repeated Route Naturalistic Driving Dataset (R2ND2) is a dual-perspective dataset for driver behavior analysis constituent of vehicular data collected using task-specific CAN decoding sensors using OBD port and external sensors, and (b) gaze-measurements collected using industry-standard multi-camera gaze calibration and collection system. Our experiment is designed to consider the variability associated with driving experience that depends on the time of day and provides valuable insights into the correlation of these additional metrics on driver behavior.

Categories:: Artificial Intelligence
Signal Processing
Digital signal processing
Machine Learning
Sensors
Mechanical Sensing
Image Fusion
Standards Research Data
Transportation
Image Processing
Computer Vision
Transportation
Sensors

493 Views

Channel state information data of animal crossings on rural roads

This dataset is shared as part of the paper Towards scalable and low-cost WiFi sensing: preventing animal-vehicle collisions on rural roads, submitted to the IEEE Internet of Things Journal (IoT-J). It contains Wi-Fi Channel State Information (CSI) data from roadway crossings of small and large animals, persons and vehicles in rural environments.

Categories:: Artificial Intelligence
Wireless Networking
IoT
Machine Learning
Transportation
Ecology
Remote Sensing

224 Views

Machine Learning

Machine Learning

Pages