Machine Learning
Hand contact data, reflecting the intricate behaviours of human hands during object operation, exhibits significant potential for analysing hand operation patterns to guide the design of hand-related sensors and robots, and predicting object properties. However, these potential applications are hindered by the constraints of low resolution and incomplete capture of the hand contact data.
- Categories:
Despite the existence of road image datasets, these datasets predominantly focus on European roads with less variability in traffic and road conditions. To address this limitation, we have developed an image dataset tailored to Indian road conditions, capturing the extensive variations in traffic and environment.
- Categories:
We present the SinOCR and SinFUND datasets, two comprehensive resources designed to advance Optical Character Recognition (OCR) and form understanding for the Sinhala language. SinOCR, the first publicly available and the most extensive dataset for Sinhala OCR to date, includes 100,000 images featuring printed text in 200 different Sinhala fonts and 1,135 images of handwritten text, capturing a wide spectrum of writing styles.
- Categories:
The dataset is compiled from different versions of multiple projects across six architectures (ARM-32, ARM-64, MIPS-32, MIPS-64, X86-32, X86-64) and four compilation optimization levels (O0, O1, O2, O3), totaling 36,864 binary files. Each file corresponds to a specific combination of architecture and optimization level, providing a wide range of samples for analyzing and researching the properties and characteristics of binary files.
- Categories:
In the evolving landscape of 5G network, network slicing has been considered as a key technology for the realization of multiple virtual networks running on a shared physical infrastructure, each designed to fulfill a specific service or application. However, with such networks, the dynamic and real-time allocation of these resources remains a prime concern, particularly with respect to highly variable conditions of traffic.
- Categories:
The data set is from the Case Western Reserve University Rolling Bearing data set. SK6205 bearing located at the drive end is selected as the research object, and the acquisition frequency is 12KHz. The fault type is divided into three types, namely inner ring fault, rolling body fault and outer ring fault, and each fault type is divided into three fault sizes: 0.007, 0.014 and 0.021 inches.The length of each sample is 1024 and the repetition rate is 50%
- Categories:
The dataset is based on the latent faults detected by the popular OSS static code analysis tool, sonarQube Community Edition. The dataset is populated using the latent faults found in popular Java software from the open source repository GitHub . This dataset was specifically developed to identify the significant latent faults that affect the reliability of Java programs. This dataset can be used in its current form to conduct experiments with machine learning algorithms and to infer new reliability characteristics of Java programs. Please refer to the documents associated with sona
- Categories:
Classifying the driving styles is of particular interest for enhancing road safety in smart cities. The vehicle can assist the driver by providing advice to increase awareness of potential dangers. Accordingly, dissuasive measures, such as adjusting insurance costs, can be implemented. The service is called Pay-As-You-Drive insurance (PAYD), and to address it, the paper introduces a method for constructing a database of simulated driver behaviors using the Simulation of Urban MObility Simulation of Urban MObility (SUMO) simulator.
- Categories:
This dataset consists of 462 field of views of Giemsa(dye)-stained and field(dye)-stained thin blood smear images acquired using an iPhone 10 mobile phone with a 12MP camera. The phone was attached to an Olympus microscope with 1000× objective lens. Half of the acquired images are red blood cells with a normal morphology and the other half have a Rouleaux formation morphology.
- Categories:
This dataset comprises 1718 annotated images extracted from 29 video clips recorded during Endoscopic Third Ventriculostomy (ETV) procedures, each captured at a frame rate of 25 FPS. Out of these images, 1645 are allocated for the training set, while the remainder is designated for the testing set. The images contain a total of 4013 anatomical or intracranial structures, annotated with bounding boxes and class names for each structure. Additionally, there are at least three language descriptions of varying technicality levels provided for each structure.
- Categories: