Machine Learning

This dataset offers a comprehensive mix of financial, demographic, temporal, and external factor data to help predict credit delinquency. It includes key information such as loan terms, credit balances, and effective interest rates, along with client details like salary, marital status, and profession.

In addition to tracking historical credit behavior and overdue days at different time points, the dataset incorporates critical external factors, including climate change, social unrest, and global crises like COVID-19, which may influence payment delays and financial behavior.

Categories:
53 Views

A dataset has been created by recoloring three existing datasets: NeRF Synthetic, LLFF, and Mip 360. The recoloring was performed to provide ground truth for validating recoloring applications. NeRF Synthetic was recolored using Blender, while LLFF and Mip 360 were processed in Photoshop. For each scene in the datasets, 11 images were recolored, ensuring consistency across the datasets.

Categories:
65 Views

Intrusion detection in Unmanned Aerial Vehicle (UAV) networks is crucial for maintaining the security and integrity of autonomous operations. However, the effectiveness of intrusion detection systems (IDS) is often compromised by the scarcity and imbalance of available datasets, which limits the ability to train accurate and reliable machine learning models. To address these challenges, we present the "CTGAN-Enhanced Dataset for UAV Network Intrusion Detection", a meticulously curated and augmented dataset designed to improve the performance of IDS in UAV environments.

 

Categories:
555 Views

Well logs are interpreted/processed to estimate the in-situ reservoir properties (petrophysical, geomechanical, and geochemical), which is essential for reservoir modeling, reserve estimation, and production forecasting. The modeling is often based on multi-mineral physics or empirical formulae. When sufficient amount of training data is available, machine learning solution provides an alternative approach to estimate those reservoir properties based on well log data and is usually with less turn-around time and human involvements.

Categories:
135 Views

This repository contains the datasets produced using different data generation strategies to train data driven models (e.g., decision trees, gradient tree boosting, and deep neural networks), and to evaluate their performances. The data generation strategies are described, and the results are presented in the conference paper: "Training Data Generation Strategies for Data-driven Security Assessment of Low Voltage Smart Grids" J. Cuenca, E. Aldea, E. Le Guern-Dall'o, R. Féraud, G. Camilleri, and A. Blavette. IEEE ISGT EU 2024, Dubrovnik, Croatia, Oct 2024.

Categories:
17 Views

Human pose estimation has applications in numerous fields, including action recognition, human-robot interaction, motion capture, augmented reality, sports analytics, and healthcare. Many datasets and deep learning models are available for human pose estimation within the visible domain. However, challenges such as poor lighting and privacy issues persist. These challenges can be addressed using thermal cameras; nonetheless, only a few annotated thermal human pose datasets are available for training deep learning-based human pose estimation models.

Categories:
236 Views

The dataset contains the ground-based observations of crop growth stages for Canada's prairie provinces (Manitoba, Saskatchewan and Alberta) from 2019 to 2020. Crop growth stages were visually observed from the side of the fields on a weekly cycle until the fields were harvested. The BBCH (Biologische Bundesanstalt, Bundessortenamt und CHemische Industrie) scale was used to stage growth.

Categories:
307 Views

Concept-1K is a novel dataset designed to facilitate research on incremental learning in large language models. It comprises 1,023 concepts represented as knowledge triplets, focusing on recently emerged topics to minimize data leakage. By providing a fine-grained approach to evaluating model performance, Concept-1K enhances the understanding of how these models learn and retain new information.

Categories:
15 Views

This paper presents an innovative Internet of Things (IoT) system that integrates gas sensors and a custom Convolutional Neural Network (CNN) to classify the freshness and species of beef and mutton in real time. The CNN, trained on 9,928 images, achieved 99% accuracy, outperforming models like ResNet-50, SVM, and KNN. The system uses three gas sensors (MQ135, MQ4, MQ136) to detect gases such as ammonia, methane, and hydrogen sulfide, which indicate meat spoilage.

Categories:
214 Views

The TiHAN-V2X Dataset was collected in Hyderabad, India, across various Vehicle-to-Everything (V2X) communication types, including Vehicle-to-Vehicle (V2V), Vehicle-to-Infrastructure (V2I), Infrastructure-to-Vehicle (I2V), and Vehicle-to-Cloud (V2C). The dataset offers comprehensive data for evaluating communication performance under different environmental and road conditions, including urban, rural, and highway scenarios.

Categories:
260 Views

Pages