*.csv (zip)

We propose a driver pattern dataset consists of 51 features extracted from CAN (Controller Area Network) of Hyundai YF Sonata while four drivers drove city roads of Seoul, Republic of Korea. Under the belief that different driving patterns implicitly exist at CAN data, we collected CAN diagnosis data from four drivers in pursuit of research on driver identification, driver profiling, and abnormal driving behavior detection. Four drivers are named A, B, C, and D.

Categories:
2273 Views

This dataset represents the main different unique learning behaviors that may be found in any group of learners in e-learning/educational systems. It represents 20 learners through 17 OERs.

Categories:
1661 Views

This dataset includes 24,201,654 tweets related to the US Presidential Election on November 3, 2020, collected between July 1, 2020, and November 11, 2020. The related party name and sentiment scores of tweets, also the words that affect the score were added to the data set.

Categories:
6542 Views

Large p small n problem is a challenging problem in big data analytics. There are no de facto standard methods available to it. In this study, we propose a tensor decomposition (TD) based unsupervised feature extraction (FE) formalism applied to multiomics datasets, where the number of features is more than 100000 while the number of instances is as small as about 100.

Categories:
457 Views

We build an original dataset of thermal videos and images that simulate illegal movements around the border and in protected areas and are designed for training machines and deep learning models. The videos are recorded in areas around the forest, at night, in different weather conditions – in the clear weather, in the rain, and in the fog, and with people in different body positions (upright, hunched) and movement speeds (regu- lar walking, running) at different ranges from the camera.

Categories:
9587 Views

Coventry-2018 is a human activity recognition dataset captured by three Panasonic® Grid-EYE (AMG8833) infrared sensors in March 2018. The Grid-EYE sensors represent a 60 field of view scene by an 8 × 8 array named frame. The data streams are synchronized to 10 frames per second and saved as *.csv recordings using the LabVIEW® software. Two layouts are considered in this dataset with different geometry sizes: 1) small layout; and 2) large layout.

Categories:
1605 Views

Presented here is a dataset used for our SCADA cybersecurity research. The dataset was built using our SCADA system testbed described in our paper below [*]. The purpose of our testbed was to emulate real-world industrial systems closely. It allowed us to carry out realistic cyber-attacks.

 

Categories:
1944 Views

Message Queuing Telemetry Transport (MQTT) protocol is one of the most used standards used in Internet of Things (IoT) machine to machine communication. The increase in the number of available IoT devices and used protocols reinforce the need for new and robust Intrusion Detection Systems (IDS). However, building IoT IDS requires the availability of datasets to process, train and evaluate these models. The dataset presented in this paper is the first to simulate an MQTT-based network. The dataset is generated using a simulated MQTT network architecture.

Categories:
19834 Views

This dataset mainly consists 1) source codes of wide-attention and deep model (WADC); 2) datasets to evaluate the performance of the proposed model. Datasets are obtained from the Caltrans Performance Measurement System (CPeMS) http://pems.doc.ca.gov; and Fremont Bridge Bicycle Counter (FBBC), https://data.seattle.gov.

 

Categories:
1665 Views

Pages