Machine Learning

Five Years of COVID-19 Discourse on Instagram: A Labeled Instagram Dataset of Over Half a Million Posts for Multilingual Sentiment Analysis

To download this dataset without purchasing an IEEE Dataport subscription, please visit: https://zenodo.org/records/13896353

Please cite the following paper when using this dataset:

Categories:: Artificial Intelligence
Education and Learning Technologies
Machine Learning
Other
Social Sciences
Standards Research Data
Biomedical and Health Sciences
Computational Intelligence
COVID-19
Demographic
Education
Age

665 Views

multi-output regression datasets

you can download these datasets from OpenML: https://www.openml.org/search?type=data&status=active&tags.tag=2019_mult...

Categories:: Machine Learning

106 Views

multi-output regression datasets

you can download these datasets from OpenML: https://www.openml.org/search?type=data&status=active&tags.tag=2019_mult...

Categories:: Machine Learning

57 Views

GATE data for simulating Y-90 Microsphere Clusters in Human Phantom in Vereos Ring Scanner.Copy

Liver cancer treatment, especially for metastatic cases, poses significant challenges in accurately targeting tumours while sparing healthy tissue. Radioembolisation with yttrium-90 (Y-90) microspheres is a promising technique, but precise imaging of microsphere distribution is crucial. This study utilises T-PEPT, a novel Positron Emission Particle Tracking (PEPT) algorithm that combines topological data analysis with machine learning to identify Y-90 microsphere clusters in a digital twin of a patient's liver.

Categories:: Machine Learning
Medical Imaging

94 Views

Unified Multimodal Network Intrusion Detection Systems Dataset

The Unified Multimodal Network Intrusion Detection System (UM-NIDS) dataset is a comprehensive, standardized dataset that integrates network flow data, packet payload information, and contextual features, making it highly suitable for machine learning-based intrusion detection models. This dataset addresses key limitations in existing NIDS datasets, such as inconsistent feature sets and the lack of payload or time-window-based contextual features.

Categories:: Artificial Intelligence
Wireless Networking
IoT
Machine Learning
Standards Research Data
Communications
Security

754 Views

MageCode

We collected programming problems and their solutions from previous studies. After applying some pre-processing steps, we queried advanced LLMs, such as GPT4, with the collected problems to produce machine-generated codes, while the original solutions were labeled as human-written codes. Finally, the entire collected dataset was divided into training, validation, and test sets, ensuring that there is no overlap among these sets, meaning no solutions in two different sets that solve the same programming problem.

Categories:: Artificial Intelligence
Machine Learning

27 Views

Resource allocation in Non-orthogonal multiple access - In-band full duplex cellular systems dataset

The dataset consists of uplink channel gains, downlink channel gains and uplink to downlink channel gains along with corresponding power allocations for uplink users and downlink users across all subcarriers. Additionally, it consists of NOMA decoding order for successful implementation of SIC at NOMA receiver. The number of UL users and DL users are considered as N=M=6, and subcarriers are S=9. Each column in the dataset is a sample for fading channel realization and it should be converted back to the matrix to compute sumrate.

Categories:: Wireless Networking
Machine Learning
Communications

120 Views

RMPS (Rupnagar Maize Paddy Sugarcane) dataset

The study focused on two regions in Rupnagar district, India, with an area of 216 km² as shown in Fig. 1a, using satellite data from June to November 2023. The upper region predominantly features paddy and maize, while the lower region includes paddy and sugarcane. Satellite images were obtained from PlanetScope’s 130-satellite constellation, with a spatial resolution of 3 meter. A total of 32 images, captured between late May and mid-November 2023, were used, all with less than 15% cloud cover.

Categories:: Agriculture
Machine Learning
Geoscience and Remote Sensing
Remote Sensing

133 Views

VOCAL TONE DATASET (SRI LANKAN)

This dataset addresses the challenge of limited vocal recordings available in secondary datasets, particularly those that predominantly feature foreign accents and contexts. To enhance the accuracy of our solution tailored for Sri Lankans, we employed primary data-gathering methods.

The dataset comprises vocal recordings from a sample population of youth. Participants were instructed to read three specific sentences designed to capture a range of vocal tones:

Categories:: Signal Processing
Machine Learning
Image Processing

162 Views

Facial Expression Dataset (Sri Lankan)

The Facial Expression Dataset (Sri Lankan) is a culturally specific dataset created to enhance the accuracy of emotion recognition models in Sri Lankan contexts. Existing datasets, often based on foreign samples, fail to account for cultural differences in facial expressions, affecting model performance. This dataset bridges that gap, using high-quality data sourced from over 100 video clips of professional Sri Lankan actors to ensure expressive and clear facial imagery.

Categories:: Machine Learning
Image Processing

350 Views

Machine Learning

Machine Learning

Pages