Machine Learning

Description

This data set contains 100,000 pcd files taken by LiDAR, a 3-D image sensor, of a vehicle orbiting an indoor field.

Data Acquisition

The indoor field was built as a 1/60 scale model of an intersection, where two vehicles kept moving along pre-fixed tracks independently of each other.

The size of the vehicles was 0.040 m  × 0.035 m × 0.240 m 

We captured the indoor field by two LiDAR sensor units, which was commercialized by Velodyne.

Categories:
1447 Views

Human Activity Recognition (HAR) is the process of handling information from sensors and/or video capture devices under certain circumstances to correctly determine human activities. Nowadays, several simple and automatic HAR methods based on sensors and Artificial Intelligence platforms can be easily implemented.

In this challenge, participants are required to determine the nurse care daily activities by utilizing the accelerometer data collected from the smartphone, which is the cheapest and easy-to-implement way in real life.

Last Updated On: 
Wed, 06/30/2021 - 21:50
Citation Author(s): 
Sayeda Shamma Alia, Kohei Adachi, Paula Lago, Le Nhat Tan, Haru Kaneko, Sozo Inoue

The dataset consists of 751 videos, each containing the performance one of the handball actions out of 7 categories (passing, shooting, jump-shot, dribbling, running, crossing, defence). The videos were manually extracted from longer videos recorded in handball practice sessions. 

Categories:
1301 Views

Of late, efforts are underway to build computer-assisted diagnostic tools for cancer diagnosis via image processing. Such computer-assisted tools require capturing of images, stain color normalization of images, segmentation of cells of interest, and classification to count malignant versus healthy cells. This dataset is positioned towards robust segmentation of cells which is the first stage to build such a tool for plasma cell cancer, namely, Multiple Myeloma (MM), which is a type of blood cancer. The images are provided after stain color normalization.

Categories:
6462 Views

We introduce a novel dataset specifically designed for the evaluation of “alarm flood classification” (AFC) methods within process plants. The growing complexity of industrial systems and the heightened demands for operational safety and efficiency underscore the critical need for advanced diagnostic tools capable of handling alarm floods—situations where numerous alarms are triggered simultaneously.

Categories:
221 Views

Passwords that were leaked or stolen from sites. The Rockyou Dataset is about 14 million passwords.

Categories:
1399 Views

Mother’s Significant Feature (MSF) Dataset has been designed to provide data to researchers working towards woman and child health betterment. MSF dataset records are collected from the Mumbai metropolitan region in Maharashtra, India. Women were interviewed just after childbirth between February 2018 to March 2021. MSF comprise of 450 records with a total of 130 attributes consisting of mother’s features, father’s features and health outcomes. A detailed dataset is created to understand the mother’s features spread across three phases of her reproductive age i.e.

Categories:
2228 Views

The dataset used in the paper "A Deep Learning Approach for Segmentation, Classification and Visualization of 3D High Frequency Ultrasound Images of Mouse Embryos" is provided here. It contains both the segmentation and classification images with manual labels. 

Categories:
673 Views

Dataset used in the article "The Reverse Problem of Keystroke Dynamics: Guessing Typed Text with Keystroke Timings". Source data contains CSV files with dataset results summaries, false positives lists, the evaluated sentences, and their keystroke timings. Results data contains training and evaluation ARFF files for each user and sentence with the calculated Manhattan and euclidean distance, R metric, and the directionality index for each challenge instance.

Categories:
653 Views

Expanding our knowledge of small molecules beyond what is known in nature or designed in wet laboratories promises to significantly advance drug discovery, biotechnology, and material science. Computing novel small molecules with specific structural and functional properties is non-trivial, primarily due to the size, dimensionality, and multi-modality of the corresponding search space. Deep generative models that learn directly from data without the need for domain insight are recently providing a way forward.

Categories:
379 Views

Pages