Machine Learning

CO2 dataset

We obtained 6 million instances to be used as an analysis for modelling CO2 behavior. The Data Logging and sensors nodes acquisition are every 1 second.

Categories:: Artificial Intelligence
IoT
Machine Learning
Standards Research Data

640 Views

Pedestrian detection has never been an easy task for computer vision and automotive industry. Systems like the advanced driver assistance system (ADAS) highly rely on far infrared (FIR) data captured to detect pedestrians at nighttime. The recent development of deep learning-based detectors has proven the excellent results of pedestrian detection in perfect weather conditions. However, it is still unknown what is the performance in adverse weather conditions.

Categories:: Computer Vision
Image Processing
Machine Learning
Transportation

2272 Views

COSINE: Cellular cOmmunication SIgNal datasEt

This dataset covers cellular communication signals in the SCF format. There is a total of 60000 signal instances, 36000 of them are reserved as training data and the rest is for the test. The SNR levels are between 1 dB and 15 dB.

Categories:: Machine Learning
Communications
Signal Processing

5022 Views

Endoscopy Disease Detection and Segmentation (EDD2020)

Endoscopy is a widely used clinical procedure for the early detection of cancers in hollow-organs such as oesophagus, stomach, and colon. Computer-assisted methods for accurate and temporally consistent localisation and segmentation of diseased region-of-interests enable precise quantification and mapping of lesions from clinical endoscopy videos which is critical for monitoring and surgical planning. Innovations have the potential to improve current medical practices and refine healthcare systems worldwide.

Artificial Intelligence

Image Processing

Computer Vision

Machine Learning

Biomedical and Health Sciences

Medical Imaging

Submitted On:

Wed, 01/15/2020 - 11:24

Last Updated On:

Sat, 02/27/2021 - 05:11

defect detection dataset

The data set includes three sub-data sets, namely the DAGM2007 data set, the ground crack data set, and the Yibao bottle cap defect data set, which are divided into a training set and a test set, in which the positive and negative samples are unbalanced.

Categories:: Artificial Intelligence
Computer Vision
Machine Learning

2215 Views

ExoNet: Egocentric Images of Walking Environments

Computer vision can be used for environment-adaptive control of robotic exoskeletons and prostheses. However, small-scale and private training datasets have impeded the development of image classification algorithms (e.g., convolutional neural networks) to recognize the walking environment. To address these limitations, we developed ExoNet, a large-scale dataset of wearable camera images (i.e., egocentric perception) of real-world walking environments.

Categories:: Artificial Intelligence
Machine Learning
Computer Vision

5887 Views

WIKIBIOCN

A Chinese dataset for table-to-text generation named WIKIBIOCN which inculeds 33,244 biography sentences with related tables from Chinese Wikipedia (July 2018).

The dataset is divided into training set (30,000), verification set (1000) and test set (2,244).

Categories:: Artificial Intelligence
Machine Learning

170 Views

A Time-Scale Modification Dataset with Subjective Quality Labels

Time Scale Modification (TSM) is a well-researched field; however, no effective objective measure of quality exists. This paper details the creation, subjective evaluation, and analysis of a dataset for use in the development of an objective measure of quality for TSM. Comprised of two parts, the training component contains 88 source files processed using six TSM methods at 10 time scales, while the testing component contains 20 source files processed using three additional methods at four time scales.

Categories:: Machine Learning
Signal Processing
Discrete-time signal processing
Digital signal processing

664 Views

Border Gateway Protocol (BGP) routing records from Reseaux IP Europeens (RIPE) and BCNET

Five well-known Border Gateway Anomalies (BGP) anomalies:
WannaCrypt, Moscow blackout, Slammer, Nimda, Code Red I, occurred in May 2017, May 2005, January 2003, September 2001, and July 2001, respectively.
The Reseaux IP Europeens (RIPE) BGP update messages are publicly available from the Network Coordination Centre (NCC) and contain:
WannaCrypt, Moscow blackout, Slammer, Nimda, Code Red I, and regular data: https://www.ripe.net/analyse/.

Categories:: Machine Learning
Communications
Security
Other

1338 Views

Data for Prediction of Apparent Personality Traits from Selfies using the Five-Factor Model

Since there is no image-based personality dataset, we used the ChaLearn dataset for creating a new dataset that met the characteristics we required for this work, i.e., selfie images where only one person appears and his face is visible, labeled with the person's apparent personality in the photo.

Categories:: Artificial Intelligence
Image Processing
Machine Learning

3559 Views