*.csv
The dataset has been developed in Smart Connected Vehicles Innovation Centre (SCVIC) of the University of Ottawa in Kanata North Technology Park.
In order to define a benchmark for Machine Learning (ML)-based Advanced Persistent Threat (APT) detection in the network traffic, we create a dataset named SCVIC-APT-2021, that can realistically represent the contemporary network architecture and APT characteristics. Please cite the following original article where this work was initially presented:
- Categories:
Research in Natural Language Processing (NLP) and computational linguistics highly depends on a good quality representative corpus of any specific language. Bangla is one of the most spoken languages in the world but Bangla NLP research is in its early stage of development due to the lack of quality public corpus. This article describes the detailed compilation methodology of a comprehensive monolingual Bangla corpus, KUMono (Khulna University Monolingual corpus).
- Categories:
Healthcare systems are capable of collecting a significant number of patient health-related parameters. Analyzing them to find the reasons that cause a given disease is challenging. Feature Selection techniques have been used to address this issue---reducing these parameters to a smaller set with the most "determinant" information. However, existing proposals usually focus on classification problems---aimed to detect whether a person is or is not suffering from an illness or from a finite set of illnesses.
- Categories:
Bitcoin (BTC), ether (ETH), gridcoin (GRC), curecoin (CURE), and foldingcoin (FLDC) market capitalizations in USD.
- Categories:
We constructed datasets by extracting different features from Android Apk files, including permissions (official definition and customization), APIs and vulnerabilities. The datasets can be used for malware detection.
- Categories:
This data set contains data collected from an overhead crane (https://doi.org/10.1109/WF-IoT.2018.8355217) OPC UA server when driving an L-shaped path with different loads (0kg, 120kg, 500kg, and 1000kg). Each driving cycle was driven with an anti-sway system activated and deactivated. Each driving cycle consisted of repeating five times the process of lifting the weight, driving from point A to point B along with the path, lowering the weight, lifting the weight, driving back to point A, and lowering the weight.
- Categories:
Anonymous network traffic is more pervasive than ever due to the accessibility of services such as virtual private networks (VPN) and The Onion Router (Tor). To address the need to identify and classify this traffic, machine and deep learning solutions have become the standard. However, high-performing classifiers often scale poorly when applied to real-world traffic classification due to the heavily skewed nature of network traffic data.
- Categories:
This dataset contains the raw data of the measurements/simulations presented in "Modulation Scheme Analysis for Low-Power Leadless Pacemaker Synchronization Based on Conductive Intracardiac Communication" by A. Ryser et al. This work analyzed the bit error rate (BER) performance of a prototype dual-chamber leadless pacemaker both in simulation and in-vitro experiments on porcine hearts.
- Categories:
This is the market data of Bitcoin in terms of price and volume from August 2015 to August 2021. The time interval of sampling is selected as four-hour, that is to say, we choose every kind of price and volume every of four-hour as the original data. The original market data of Bitcoin are obtained from Poloniex, one of the most active crypto-asset exchanges.
Download link on XBlock: http://xblock.pro/#/dataset/5
- Categories: