Nowadays, with the rapid increase in the number of applications and networks, the number of cyber multi-step attacks has been increasing exponentially. Thus, the need for a reliable and acceptable Intrusion Detection System (IDS) solution is becoming urgent to protect the networks and devices. However, implementing a robust IDS needs a reliable and up-to-date dataset in order to capture the behaviors of the new types of attacks, especially multi-step attacks. In this work, a new benchmark Multi-Step Cyber-Attack Dataset (MSCAD) is introduced.

Categories:
206 Views

A very large Bangla monolingual corpus consisting of more than 350 million tokens

Categories:
14 Views

Question Answer Pair dataset for Sentiment Analysis Tasks.

Categories:
37 Views

Air travel is one of the most used ways of transit in our daily lives. So it's no wonder that more and more people are sharing their experiences with airlines and airports using web-based online surveys. This dataset aims to do topic modeling and sentiment analysis on Skytrax (airlinequality.com) and Tripadvisor (tripadvisor.com) postings where there is a lot of interest and engagement from people who have used it or want to use it for airlines.

Categories:
123 Views

This dataset is used to illustrate an application of the "klm-based profiling and preventing security attack (klm-PPSA)" system. The klm-PPSA system is developed to profile, detect, and then prevent known and/or unknown security attacks before a user access a cloud. This dataset was created based on “a.patrik” user logical attempts scenarios when accessing his cloud resources and/or services. You will find attached the CSV file associated with the resulted dataset. The dataset contains 460 records of 13 attributes (independent and dependent variables).

Categories:
126 Views

The C3I Thermal Automotive Dataset provides > 35,000 distinct frames along with annotated thermal frames for the development of smart thermal perception system/ object detection system that will enable the automotive industry and researchers to develop safer and more efficient ADAS and self-driving car systems. The overall dataset is acquired, processed, and open-sourced in challenging weather and environmental scenarios. The dataset is recorded from a lost-cost yet effective 640x480 uncooled LWIR thermal camera.

Categories:
483 Views

“DCA-IoMT Dataset” belongs to the research article entitled “DCA-IoMT: Knowledge Graph Embedding-enhanced Deep Collaborative Alerts-recommendation against COVID19 (DOI: 10.1109/TII.2022.3159710)” accepted for publication in the Journal of IEEE Transactions on Industrial Informatics.

Categories:
376 Views

Bangla is one of the most spoken languages in the world but Bangla NLP research is in its early stage of development due to the lack of quality public corpus. In this article, we describe the detailed compilation methodology of a comprehensive monolingual Bangla corpus, KUMono. Thiscorpus consists of more than 353 million word tokens in total as well as more than one million unique tokens from 18 major text categories of online Bangla websites.

Categories:
75 Views

Abstract (for details, see https://osf.io/e4rvz/)

Last Updated On: 
Thu, 01/27/2022 - 18:19
Citation Author(s): 
Ji-Ping Lin

Pages