The dataset has been developed in Smart Connected Vehicles Innovation Centre (SCVIC) of the University of Ottawa in Kanata North Technology Park.

In order to define a benchmark for Machine Learning (ML)-based Advanced Persistent Threat (APT) detection in the network traffic, we create a dataset named SCVIC-APT-2021, that can realistically represent the contemporary network architecture and APT characteristics.

The original article where this work was initially presented is as follows:


Healthcare systems are capable of collecting a significant number of patient health-related parameters. Analyzing them to find the reasons that cause a given disease is challenging. Feature Selection techniques have been used to address this issue---reducing these parameters to a smaller set with the most "determinant" information. However, existing proposals usually focus on classification problems---aimed to detect whether a person is or is not suffering from an illness or from a finite set of illnesses.


Bitcoin (BTC), ether (ETH), gridcoin (GRC), curecoin (CURE), and foldingcoin (FLDC) market capitalizations in USD.


This data set contains data collected from an overhead crane ( OPC UA server when driving an L-shaped path with different loads (0kg, 120kg, 500kg, and 1000kg). Each driving cycle was driven with an anti-sway system activated and deactivated. Each driving cycle consisted of repeating five times the process of lifting the weight, driving from point A to point B along with the path, lowering the weight, lifting the weight, driving back to point A, and lowering the weight.


The data we are providing this time is a part of the dataset which was used in our previous work, titled “Integrating Activity Recognition and Nursing Care Records: The System, Deployment, and a Verification Study”. The authors of this work proposed a theory that extending of start and end times of the activities can increase the prediction rate. The reason behind the theory is that many of the nurses provided the labels before or after completing an activity. In the paper, they verified and proved this theory.

Last Updated On: 
Wed, 06/01/2022 - 00:00

Anonymous network traffic is more pervasive than ever due to the accessibility of services such as virtual private networks (VPN) and The Onion Router (Tor). To address the need to identify and classify this traffic, machine and deep learning solutions have become the standard. However, high-performing classifiers often scale poorly when applied to real-world traffic classification due to the heavily skewed nature of network traffic data.


This dataset contains the raw data of the measurements/simulations presented in "Modulation Scheme Analysis for Low-Power Leadless Pacemaker Synchronization Based on Conductive Intracardiac Communication" by A. Ryser et al. This work analyzed the bit error rate (BER) performance of a prototype dual-chamber leadless pacemaker both in simulation and in-vitro experiments on porcine hearts.


This is the market data of Bitcoin in terms of price and volume from August 2015 to August 2021. The time interval of sampling is selected as four-hour, that is to say, we choose every kind of price and volume every of four-hour as the original data. The original market data of Bitcoin are obtained from Poloniex, one of the most active crypto-asset exchanges.
Download link on XBlock:


Our data set has 5136 records collected in 214 days. The sampling rate of the sensors is 1 hour. Each record includes the number of vehicles entering and leaving the parking lot in an hour, the CO2 concentration of every building floor at the recording time, and the power consumption of each floor in an hour.


This dataset complies 172 chipless RFID measurements that have been reported in the literature from 2005-2022. The dataset contain the year, reading distance, frequency range, tag type, reader setup, and reference for each entry.