Machine Learning

WIKIBIOCN

A Chinese dataset for table-to-text generation named WIKIBIOCN which inculeds 33,244 biography sentences with related tables from Chinese Wikipedia (July 2018).

The dataset is divided into training set (30,000), verification set (1000) and test set (2,244).

Categories:: Artificial Intelligence
Machine Learning

170 Views

A Time-Scale Modification Dataset with Subjective Quality Labels

Time Scale Modification (TSM) is a well-researched field; however, no effective objective measure of quality exists. This paper details the creation, subjective evaluation, and analysis of a dataset for use in the development of an objective measure of quality for TSM. Comprised of two parts, the training component contains 88 source files processed using six TSM methods at 10 time scales, while the testing component contains 20 source files processed using three additional methods at four time scales.

Categories:: Machine Learning
Signal Processing
Discrete-time signal processing
Digital signal processing

669 Views

Border Gateway Protocol (BGP) routing records from Reseaux IP Europeens (RIPE) and BCNET

Five well-known Border Gateway Anomalies (BGP) anomalies:
WannaCrypt, Moscow blackout, Slammer, Nimda, Code Red I, occurred in May 2017, May 2005, January 2003, September 2001, and July 2001, respectively.
The Reseaux IP Europeens (RIPE) BGP update messages are publicly available from the Network Coordination Centre (NCC) and contain:
WannaCrypt, Moscow blackout, Slammer, Nimda, Code Red I, and regular data: https://www.ripe.net/analyse/.

Categories:: Machine Learning
Communications
Security
Other

1346 Views

Data for Prediction of Apparent Personality Traits from Selfies using the Five-Factor Model

Since there is no image-based personality dataset, we used the ChaLearn dataset for creating a new dataset that met the characteristics we required for this work, i.e., selfie images where only one person appears and his face is visible, labeled with the person's apparent personality in the photo.

Categories:: Artificial Intelligence
Image Processing
Machine Learning

3580 Views

Performance of Congestion Control Algorithms on High-speed railway scenairo

the measurement data simulated data of Hd-TCP and its comparisons' performance on the real high-speed railways scenario

Categories:: Machine Learning
Communications

533 Views

Intrusion Detection in CAN bus

These datasets are used to detect Intrusions in Controller Area Network (CAN) bus. Intrusions are detected using various Machine Learning and Deep Learning algorithms.

Categories:: Artificial Intelligence
Machine Learning

2607 Views

turbine data for IEEE Access 20191224

monitoring, processing and prediction data

Categories:: Machine Learning
Standards Research Data
Signal Processing

424 Views

ArPC a corpus for paraphrase identification in Arabic text

ArPC is an Arabic paraphrase identification corpus. It consists of 1331 sentence pairs along with their binary score that indicates weather the pairs are paraphrase or not. The corpus has been manually annotated by three Arabic native speakers.

Categories:: Artificial Intelligence
Machine Learning

761 Views

pmmw_data.rar

The PMMW real-time imager, SAIR-U, is developed by Microwave Laboratory of Beihang University, China.It could be (or has been) used in non-contact, non-cooperative (i.e. no need for a fixed posture) security, especially in the environment of large passenger flow. This is the dataset used in the experiment in paper"Real-time Concealed Object Detection from Passive Millimeter Wave Images Based on YOLOv3 Algorithm"

Categories:: Machine Learning

262 Views

pmmw_data.rar

Categories:: Machine Learning

248 Views

Machine Learning

Machine Learning

Pages