Computational Intelligence

Please cite the following paper when using this dataset:

N. Thakur and C.Y. Han, “An Exploratory Study of Tweets about the SARS-CoV-2 Omicron Variant: Insights from Sentiment Analysis, Language Interpretation, Source Tracking, Type Classification, and Embedded URL Detection,” Journal of COVID, 2022, Volume 5, Issue 3, pp. 1026-1049

Abstract

Categories:
1708 Views

This dataset supports a review and an in-depth analysis on the environmental impacts of integrated circuits (ICs). The paper is currently under review.

We gathered data from foundry reports, industry roadmaps, scientific literature, and commercial state-of-the-art LCA databases. All assumptions are detailed. 

More information can be found on the GitHub repository : https://github.com/ThibaultPirson/environmental-footprint-IC.

 

Categories:
598 Views

Please cite the following paper when using this dataset:

N. Thakur, “A Large-Scale Dataset of Twitter Chatter about Online Learning during the Current COVID-19 Omicron Wave,” Journal of Data, vol. 7, no. 8, p. 109, Aug. 2022, doi: 10.3390/data7080109

Abstract

Categories:
1008 Views

In this study, a primary IMU-based gait dataset has been collected from 30 participants using a MOTI goniometer. This device collects movement data related to specific joints depending on the location of the device. The MOTI sensor contains an IMU consisting of an accelerometer, gyroscope, and magnetometer. Accelerometers measure acceleration the acceleration of the device, gyroscopes measure angular velocity of the device, and magnetometers measure the magnetic field of the Earth. In our study, the device was attached to arm and leg (thigh) positions.

Categories:
681 Views

Please cite the following paper when using this dataset:

N. Thakur, “MonkeyPox2022Tweets: A large-scale Twitter dataset on the 2022 Monkeypox outbreak, findings from analysis of Tweets, and open research questions,” Infect. Dis. Rep., vol. 14, no. 6, pp. 855–883, 2022, DOI: https://doi.org/10.3390/idr14060087.

Abstract

Categories:
3607 Views

Nowadays, with the rapid increase in the number of applications and networks, the number of cyber multi-step attacks has been increasing exponentially. Thus, the need for a reliable and acceptable Intrusion Detection System (IDS) solution is becoming urgent to protect the networks and devices. However, implementing a robust IDS needs a reliable and up-to-date dataset in order to capture the behaviors of the new types of attacks, especially multi-step attacks. In this work, a new benchmark Multi-Step Cyber-Attack Dataset (MSCAD) is introduced.

Categories:
3375 Views

Research in Natural Language Processing (NLP) and computational linguistics highly depends on a good quality representative corpus of any specific language. Bangla is one of the most spoken languages in the world but Bangla NLP research is in its early stage of development due to the lack of quality public corpus. This article describes the detailed compilation methodology of a comprehensive monolingual Bangla corpus, KUMono (Khulna University Monolingual corpus).

Categories:
480 Views

Question Answer Pair dataset for Sentiment Analysis Tasks.

Categories:
103 Views

Air travel is one of the most used ways of transit in our daily lives. So it's no wonder that more and more people are sharing their experiences with airlines and airports using web-based online surveys. This dataset aims to do topic modeling and sentiment analysis on Skytrax (airlinequality.com) and Tripadvisor (tripadvisor.com) postings where there is a lot of interest and engagement from people who have used it or want to use it for airlines.

Categories:
754 Views

Pages