Artificial Intelligence
The availability of labelled Cyber Bulling Types dataset has been exhibited for high profile Natural Language Processing (NLP), which constantly leads the advancement of constructing and model creation-based text. I aim at extracting diverse and efficient Cyber Bully Tweets from the Twitter Social Media Platform. This dataset contains 5 types of cyber bullying samples. They are
1. Sexual Harassment
2. Doxing
3. Cyberstalking
- Categories:
Aspect Sentiment Triplet Extraction (ASTE) is an Aspect-Based Sentiment Analysis subtask (ABSA). It aims to extract aspect-opinion pairs from a sentence and identify the sentiment polarity associated with them. For instance, given the sentence ``Large rooms and great breakfast", ASTE outputs the triplet T = {(rooms, large, positive), (breakfast, great, positive)}. Although several approaches to ASBA have recently been proposed, those for Portuguese have been mostly limited to extracting only aspects without addressing ASTE tasks.
- Categories:
The dataset contains the navigation measurements obtained in the indoor experiment field. The volunteers move on the whole 4th floor of the Building D of Dong Jiu Teaching classes at Huazhong University of Science and Technology. Meanwhile, the experimental area consists of a total area of 717 m 2. These datasets were used and can be used to test and validate the radio map database updating-based localization positioning algorithm through the RSSI signals space.
- Categories:
Most of Facial Expression Recognition (FER) systems rely on machine learning approaches that require large databases (DBs) for an effective training. As these are not easily available, a good solution is to augment the DBs with appropriate techniques, which are typically based on either geometric transformation or deep learning based technologies (e.g., Generative Adversarial Networks (GANs)). Whereas the first category of techniques have been fairly adopted in the past, studies that use GAN-based techniques are limited for FER systems.
- Categories:
The file contains the source code of the mission generator MG1 together with the materials and results of the experiments. This data is associated with a paper in which we focus on an experimental protocol for the comparison of fully-automatic design methods of control software for robot swarms.
- Categories:
数据集收集于 2009 年至 2010 年,通常用于预测学生的表现和个性化。在学生信息方面,此数据集包含 525,535 行学生答案,包括 4,217 名学生和 124 种技能。
- Categories:
(Work in progress)
This dataset contains the augmented images and the images & segmentation maps for seven handwashing steps, six of which are prescirbed WHO handwashing steps.
This work is based on a sample handwashing video dataset uploaded by Kaggle user real-timeAR.
- Categories:
This is a subset of the ASHRAE Global Comfort Database that we used in our study to prove that Deep learning methods performs better than shallow methods predicting the thermal sensation.
- Categories:
This paper describes the creation of the Massive Arabic Speech Corpus (MASC). MASC is a dataset that contains 1,000 hours of speech sampled at 16 kHz and crawled from over 700 YouTube channels. The dataset is multi-regional, multi-genre, and multi-dialect intended to advance the research and development of Arabic speech technology with a special emphasis on Arabic speech recognition. In addition to MASC, a pre-trained 3-gram language model and a pre-trained automatic speech recognition model are also developed and made available to interested researchers.
- Categories: