Machine Learning
DALHOUSIE NIMS LAB BENIGN DATASET 2024-2 dataset comprises data captured from Consumer IoT devices, depicting three primary real-life states (Power-up, Idle, and Active) experienced by everyday users. Our setup focuses on capturing realistic data through these states, providing a comprehensive understanding of Consumer IoT devices.
The dataset comprises of nine popular IoT devices namely
Amcrest Camera
Smarter Coffeemaker
Ring Doorbell
Amazon Echodot
Google Nestcam
Google Nestmini
Kasa Powerstrip
- Categories:
To download the dataset without purchasing an IEEE Dataport subscription, please visit: https://zenodo.org/records/13738598
Please cite the following paper when using this dataset:
- Categories:
Resource usage fuzzing samples and related data. Contains samples from Pythoin, random data, GPT-3.5, GPT-4, Gemini-1.0, Claude Instant, and Claude Opus. These samples are generated for 50 Python functions. Also included are resource measures for CPU time, instruction count, function calls, peak RAM usage, final RAM allocated, and coverage. These values were collected on an isolated system and account for interference from other processes.
- Categories:
The limited availability of Guitar notes datasets hinders the training of any artificial intelligence model in this field. TaptoTab dataset aims to fill this gap by providing a collection of notes recordings. This dataset is collected as part of an honours project at the Faculty of Computer and Information Sciences, Ain Shams University. The dataset is composed of audio data that has been self-collected, focusing on capturing a comprehensive range of guitar notes. The dataset consists of recordings of guitar notes played on each of the six strings, covering up to the 12th fret.
- Categories:
The data in this dataset is the experimental data related to the article named Privacy-preserving approach to edge federated learning based on blockchain and fully homomorphic encryption , which contains data such as running time comparison, communication spend comparison, encryption and decryption time comparison, accuracy comparison, etc.
- Categories:
Brain-Computer Interface (BCI) technology facilitates a direct connection between the brain and external devices by interpreting neural signals. It is critical to have datasets that contain patient's native languages while developing BCI-based solutions for neurological disorders. However, present BCI research lacks appropriate language-specific datasets, particularly for languages such as Telugu, which is spoken by more than 90 million people in India.
- Categories:
This study investigates the integration of artificial intelligence (AI) to enhance endpoint management solutions. The research explores AI's impact on security, efficiency, and compliance within enterprise environments (R1). Through case studies and empirical analysis, the paper highlights the benefits and challenges of such integrations, offering insights into future developments.
- Categories:
The "Multilabel Extremism Classification Tweets Dataset" dataset contains user comments annotated with labels including toxic, severe toxic, obscene, threat, insult, and identity hate. Designed for multi-label classification, this dataset is valuable for researchers focused on detecting online extremism and toxicity across multiple languages. It enables the development of NLP models for content moderation, hate speech detection, and extremism identification.
- Categories:
The "Multi-Label Extremism and Jihadism Classification Tweets Dataset" dataset is a multilingual resource designed for multi-label classification of online extremism and toxic behavior, including extremism and jihadism. Each comment is annotated with labels indicating the presence of various extremism traits: toxic, severe toxic, obscenity, threats, insults, identity hate, and jihadi content.
- Categories: