Machine Learning

The dataset is constructed using SUMO. It contains two road network datasets of different scales: a small-scale network (SR) and a larger regional network in Shenyang (SY). The dataset was constructed using the SUMO simulation platform, containing two road network datasets at different scales: a small-scale test network (SR) and a regional-level Shenyang network (SY). The SR network comprises 110 road segments, while the SY network contains 514 segments.

Categories:
5 Views

This dataset comprises high-resolution 3-axis accelerometer recordings collected from human participants performing distinct hand gestures, intended for training gesture-based assistive interfaces. Each participant’s raw motion signals are individually organized, enabling both user-specific and generalizable model development. The dataset includes time-series accelerometer data, along with a feature-augmented version containing extracted statistical and temporal descriptors such as RMS, Jerk, Entropy, and SMA. 

Categories:
7 Views

This data provides a comprehensive collection of air quality and meteorological data from several large cities in India. With 1,410 records, it includes key characteristics like the Air Quality Index (AQI) according to both U.S. and China standards, temperature, atmospheric pressure, humidity, wind speed, wind direction, and timestamps. By combining pollution concentrations with weather variables, the dataset facilitates better insight into the spatial and temporal patterns of urban air quality.

Categories:
96 Views

Fair Use for Academic Research: If you use this dataset, please cite the following paper to ensure proper attribution

M. A. Onsu, P. Lohan, B. Kantarci, A. Syed, M. Andrews, S. Kennedy, "Leveraging Multimodal-LLMs Assisted by Instance Segmentation for Intelligent Traffic Monitoring," 30th IEEE Symposium on Computers and Communications (ISCC), July 2025, Bologna, Italy.

 

 

Preprint available here: https://arxiv.org/pdf/2502.11304

 

Categories:
134 Views

This dataset includes conjunctival and retinal images collected from both diabetic and healthy individuals to support research on diabetes-related vascular changes. For each subject, eight conjunctival images (four per eye: looking left, right, up, and down) are provided. Subjects with diabetes additionally have corresponding left and right retinal fundus images. Metadata for diabetic participants includes classification into subgroups: diabetes only, diabetes with retinopathy, or diabetes with related complications such as hypertension.

Categories:
60 Views

<p><span style="font-family: 'Times New Roman'; font-size: medium;">This is the data set used for The Third International Knowledge Discovery and Data Mining Tools Competition, which was held in conjunction with KDD-99 The Fifth International Conference on Knowledge Discovery and Data Mining. The competition task was to build a network intrusion detector, a predictive model capable of distinguishing between ``bad'' connections, called intrusions or attacks, and ``good'' normal connections.

Categories:
18 Views

The use of technology in cricket has seen a significant increase in recent years, leading to overlapping computer vision-based research efforts. This study aims to extract front pitch view shots in cricket broadcasts by utilizing deep learning. The front pitch view (FPV) shots include ball delivery by the bowler and the stroke played by the batter. FPV shots are valuable for highlight generation, automatic commentary generation and bowling and batting techniques analysis. We classify each broadcast video frame as FPV and non-FPV using deep-learning models.

Categories:
92 Views

This dataset presents a curated collection of 9,000 English verbs annotated with normalized fuzzy values across four cognitive-behavioral quadrants of the BEET-M (Behavior Engagement Emotion Trigger Modes) model: Value & Credibility (NW)Relationship & Human Impact (NE)Process & Information (SE), and Time Urgency (SW).

Categories:
70 Views

Droidware is an Android malware dataset developed at the Cybersecurity Lab, GLA University, India. It comprises 253,527 applications, including 129,950 benign and 123,577 malicious samples. The dataset captures 68 features extracted from function call graphs, permissions, and Java source code, providing a comprehensive view of Android malware behavior. This latest and up-to-date dataset supports the training of AI-based malware detection models, aiding in the development of robust malware classification and threat mitigation strategies for cybersecurity research.

Categories:
46 Views

The Travel Recommendation Dataset is a comprehensive dataset designed for building and evaluating conversational recommendation systems in the travel domain. It includes detailed information about users, destinations, and ratings, enabling researchers and developers to create personalized travel recommendation models. The dataset supports use cases such as personalizing travel recommendations, analyzing user behavior, and training machine learning models for recommendation tasks.

Categories:
44 Views

Pages