Computational Intelligence

Mpox Narrative on Instagram: A Labeled Multilingual Dataset of Instagram Posts on Mpox for Sentiment, Hate Speech, and Anxiety Analysis

To download the dataset without purchasing an IEEE Dataport subscription, please visit: https://zenodo.org/records/13738598

Please cite the following paper when using this dataset:

N. Thakur, “Mpox narrative on Instagram: A labeled multilingual dataset of Instagram posts on mpox for sentiment, hate speech, and anxiety analysis,” arXiv [cs.LG], 2024, URL: https://arxiv.org/abs/2409.05292

Abstract

Categories:

Indonesian Toxic Speech Dataset (IndoToxSpeech)

This dataset contains audio recordings and transcriptions of toxic speech derived from Indonesian conversations during YouTube videos where scammers are confronted. The dataset captures two separate interactions that escalate into toxic exchanges. Each interaction has been verified by native Indonesian speakers and labeled into two classes: toxic and non-toxic. The dataset includes both the original and preprocessed versions of the speech and text data. The original speech files total 136MB, while the preprocessed speech files are 111,7MB.

Categories:

Transforming Workplace efficiency with Autonomous Endpoint Management and Enhancing the Digital Employee Experience

Abstract

Categories:

Dataset for Unveiling Viral Escape Mechanisms with Machine Learning: A Transformative Approach to Mutation Analysis for SARS-CoV-2 and Beyond

Persistent viruses like influenza, HIV, Coronavirus exemplify the challenge of viral escape, significantly hindering the development of long-lasting vaccines and effective treatments. This study leverages a Long Short-Term Memory (LSTM) based deep learning architecture to analyze an extensive dataset of over 3.1 million unique viral spike protein sequences, with SARS-CoV-2 serving as the primary example. Our model, Escape Elite Network(EEN) outperforms existing methods in detecting escape mutations across diverse datasets.

Categories:

Multi-target regression datasets

Mulan , a sourceforge net multi-target dataset available in www.openml.org. Despite the numerous interesting applications of MTR, there are only few publicly available datasets of this kind - perhaps because most applications are industrial - and most experimental evaluations of MTR methods are based on a limited amount of datasets. For this study, much effort was made for the composition of a large and diverse collection of benchmark MTR datasets.

Categories:

Responsible Medical Corpus for Maternal Sexual and Reproductive Health

Maternal, sexual and reproductive healthcare (MSRH) are sensitive urgent public health issues that require timely trustworthy authentic medical responses. Unfortunately, curative healthcare systems of Low Middle-Income Countries (LMICs) are insufficiently responsive to such healthcare needs. Such needs vary among social groups often founded on social inequalities like income, gender and education.

Categories:

CAN Communication LOG based on IEC62228-2019 with EFT Injection

Modern automotive embedded systems include a large number of electronic control units (ECU) responsible for managing sophisticated systems such as engine control, ABS brake systems, traction control, and power steering systems. To ensure the reliability and effectiveness of these functions, it is essential to apply rigorous test approaches and standards. The integration of diagnostic functions in automotive embedded systems demands consistent tests and a detailed analysis of data.

Categories:

The Bengaluru Mobility Challenge, 2024

Bengaluru has been ranked the most congested city in India in terms of traffic for several years now. This hackathon is aimed at creating innovative solutions to the traffic management problem in Bengaluru, and is being co-organised by the Bengaluru Traffic Police, the Centre for Data for Public Good, and the Indian Institute of Science (IISc). The prizes are being sponsored by the IEEE Foundation.

Categories:

Ship Routing Problem Dataset

This is a dataset about minimizing maritime passenger transfer in ship routing. Consists of data on the distance between ports, the number of passengers from the port of origin to the port of destination, ships speed, and the duration of berthing at ports.

Categories:

Data set for TcpQtOptimal - Towards Optimal End-to-end TCP Congestion Control Using Queuing-Based Dynamical Systems Theory

Existing end-to-end congestion control algorithms, in Transmission Control Protocol (TCP), use packet loss and queueing delay for congestion detection, and use static control laws to adjust the sending rate and to control the congestion. This approach presupposes that the network, and its interaction with the congestion control mechanism, is static or quasi-static. In practice, the state of the network continuously changes over time, resulting in suboptimal performance of existing algorithms.

Categories:

Computational Intelligence

Mpox Narrative on Instagram: A Labeled Multilingual Dataset of Instagram Posts on Mpox for Sentiment, Hate Speech, and Anxiety Analysis

Indonesian Toxic Speech Dataset (IndoToxSpeech)

Transforming Workplace efficiency with Autonomous Endpoint Management and Enhancing the Digital Employee Experience

Dataset for Unveiling Viral Escape Mechanisms with Machine Learning: A Transformative Approach to Mutation Analysis for SARS-CoV-2 and Beyond

Multi-target regression datasets

Responsible Medical Corpus for Maternal Sexual and Reproductive Health

Category

CAN Communication LOG based on IEC62228-2019 with EFT Injection

The Bengaluru Mobility Challenge, 2024

Category

Ship Routing Problem Dataset

Data set for TcpQtOptimal - Towards Optimal End-to-end TCP Congestion Control Using Queuing-Based Dynamical Systems Theory