Machine Learning

Wild-SHARD: Smartphone Sensor-based Human Activity Recognition Dataset in Wild

Wild-SHARD presents a novel Human Activity Recognition (HAR) dataset collected in an uncontrolled, real-world (wild) environment to address the limitations of existing datasets, which often need more non-simulated data. Our dataset comprises a time series of Activities of Daily Living (ADLs) captured using multiple smartphone models such as Samsung Galaxy F62, Samsung Galaxy A30s, Poco X2, One Plus 9 Pro and many more. These devices enhance data variability and robustness with their varied sensor manufacturers.

Categories:: Artificial Intelligence
Signal Processing
Machine Learning
Wearable Sensing
Health

551 Views

Spectral images - 8 varieties of maize seeds

This dataset consists of near-infrared spectral images of eight different varieties of corn seeds, classified as FH759, JL59，JY54，JY205， LH205，XX5， ZY2207， SY81. Each variety contains images of embryonic and endosperm surfaces, with 50 samples per image. The wavelength range is 881-1715 nm.

Categories:: Machine Learning
Image Processing

184 Views

Ti3D-contact, a high-resolution and whole-body dataset of hand-object contact area based on 3D scanning method

Hand contact data, reflecting the intricate behaviours of human hands during object operation, exhibits significant potential for analysing hand operation patterns to guide the design of hand-related sensors and robots, and predicting object properties. However, these potential applications are hindered by the constraints of low resolution and incomplete capture of the hand contact data.

Categories:: Artificial Intelligence
Signal Processing
Machine Learning

222 Views

Indian Road Driver’s View Dataset

Despite the existence of road image datasets, these datasets predominantly focus on European roads with less variability in traffic and road conditions. To address this limitation, we have developed an image dataset tailored to Indian road conditions, capturing the extensive variations in traffic and environment.

Categories:: Machine Learning
Transportation
Image Processing

314 Views

SinOCR and SinFUND - Sinhala OCR and Form Understanding Datasets

We present the SinOCR and SinFUND datasets, two comprehensive resources designed to advance Optical Character Recognition (OCR) and form understanding for the Sinhala language. SinOCR, the first publicly available and the most extensive dataset for Sinhala OCR to date, includes 100,000 images featuring printed text in 200 different Sinhala fonts and 1,135 images of handwritten text, capturing a wide spectrum of writing styles.

Categories:: Machine Learning
Computer Vision

487 Views

Binary Files for Analysis

The dataset is compiled from different versions of multiple projects across six architectures (ARM-32, ARM-64, MIPS-32, MIPS-64, X86-32, X86-64) and four compilation optimization levels (O0, O1, O2, O3), totaling 36,864 binary files. Each file corresponds to a specific combination of architecture and optimization level, providing a wide range of samples for analyzing and researching the properties and characteristics of binary files.

Categories:: Machine Learning

255 Views

Real-Time Adaptive Resource Allocation in 5G Network Slicing Using Hybrid Optimization and AI-driven Traffic Prediction

In the evolving landscape of 5G network, network slicing has been considered as a key technology for the realization of multiple virtual networks running on a shared physical infrastructure, each designed to fulfill a specific service or application. However, with such networks, the dynamic and real-time allocation of these resources remains a prime concern, particularly with respect to highly variable conditions of traffic.

Categories:: Artificial Intelligence
Wireless Networking
Machine Learning

46 Views

Rolling bearing fault diagnosis data set

The data set is from the Case Western Reserve University Rolling Bearing data set. SK6205 bearing located at the drive end is selected as the research object, and the acquisition frequency is 12KHz. The fault type is divided into three types, namely inner ring fault, rolling body fault and outer ring fault, and each fault type is divided into three fault sizes: 0.007, 0.014 and 0.021 inches.The length of each sample is 1024 and the repetition rate is 50%

Categories:: Artificial Intelligence
Machine Learning

449 Views

Reliability feature of Java programs

The dataset is based on the latent faults detected by the popular OSS static code analysis tool, sonarQube Community Edition. The dataset is populated using the latent faults found in popular Java software from the open source repository GitHub . This dataset was specifically developed to identify the significant latent faults that affect the reliability of Java programs. This dataset can be used in its current form to conduct experiments with machine learning algorithms and to infer new reliability characteristics of Java programs. Please refer to the documents associated with sona

Categories:: Machine Learning

7 Views

Database of Simulated Driver Behaviors Using the SUMO Simulator

Classifying the driving styles is of particular interest for enhancing road safety in smart cities. The vehicle can assist the driver by providing advice to increase awareness of potential dangers. Accordingly, dissuasive measures, such as adjusting insurance costs, can be implemented. The service is called Pay-As-You-Drive insurance (PAYD), and to address it, the paper introduces a method for constructing a database of simulated driver behaviors using the Simulation of Urban MObility Simulation of Urban MObility (SUMO) simulator.

Categories:: Machine Learning
Transportation

316 Views

Machine Learning

Machine Learning

Pages