*.csv

This dataset provides turbidity measurements collected during a Moringa oleifera leaf water treatment process for compound extraction. The extraction process was conducted over a 15-minute duration, capturing key changes in turbidity to reflect the dynamics of the process. The raw data has been preprocessed, upsampled, and annotated for time series analysis, enabling detailed investigation of extraction patterns. Additionally, the dataset has been optimized using the ForGAN (Forecasting GAN) algorithm to enhance data granularity and support predictive modeling.

Categories:
15 Views

In-vehicle networks are responsible for safety-critical control applications, depending on data communication between electronic control units, and most are based on the CAN protocol. A huge amount of data is necessary for reliability, safety, and cybersecurity analysis in today's automotive solutions, especially to feed machine learning models. It is relevant to provide comprehensive datasets about CAN communication and different driving situations, which represents a lack in recent research because most public datasets are very limited.

Categories:
25 Views

This dataset is the rent price for Kuala Lumpur and its neighborhood obtained from mudah.my in July 2024. The raw data is unprocessed and contains the original description of the house, the details in JSON format, the rent price, and the period.

This dataset is ideal for making rent price forecasts and exploring in depth what factors influence rent prices.

 

Categories:
97 Views

This data approach student achievement in secondary education of two Portuguese schools. The data attributes include student grades, demographic, social and school related features) and it was collected by using school reports and questionnaires. Two datasets are provided regarding the performance in two distinct subjects: Mathematics (mat) and Portuguese language (por). In [Cortez and Silva, 2008], the two datasets were modeled under binary/five-level classification and regression tasks. Important note: the target attribute G3 has a strong correlation with attributes G2 and G1.

Categories:
23 Views

This dataset integrates financial and macroeconomic indicators to support research on stock price prediction and financial forecasting. It includes daily stock data for Malayan Banking Berhad (MBB) (1155.KL) sourced from Yahoo Finance, alongside macroeconomic indicators such as GDP (constant 2015 MYR), GDP growth (YoY %), inflation rate (%), and the Overnight Policy Rate (OPR). The data spans a 20-year period from July 1, 2004, to August 1, 2024, and has been standardized to a daily frequency.

Categories:
27 Views

The charging load dataset are collected by a smart energy measurement system over a one-year period, with data recorded hourly from six Electric Vehicle Charging Stations (EVCSs) located in the city center of China. The data provides a full annual cycle of charging behaviors. The charging stations, labeled EVCS1 through EVCS6, have their loads reported in kilowatts (kW), with values recorded to four decimal places, ensuring high precision.

Categories:
33 Views

The rapid development of electric vehicles has significantly increased the demand for efficient and reliable charging infrastructure, making the analysis of charging load data essential for urban energy planning. A dataset has been compiled from charging load data collected by a smart energy measurement system deployed in a city center of China. The data covers a one-year period, recorded at hourly intervals, and includes measurements from six electric vehicle charging stations (EVCSs), designated EVCS1 to EVCS6, each characterized by distinct charging power capabilities.

Categories:
34 Views

This collection includes multiple short text classification datasets designed for various natural language processing tasks. It contains several topic classification datasets, such as AG'News, Snippets, and TMNNews, which cover a wide range of topics and domains to evaluate the effectiveness of classification models. Additionally, the collection includes a binary sentiment classification dataset, such as Twitter, aimed at determining positive or negative sentiment in text.

Categories:
14 Views

This is a pump fillage time series data set, consisting of 8 time series. The data is sourced from actual production data during the operational process of an oil field. It includes data from 8 oil wells, with measurements collected every half hour between July 22, 2022, and August 16, 2022. The pump fillage is extracted from the operational process of an oil field. The pump fillage data for each well is sorted in chronological order to obtain the pump fillage time series for each well. The data set had varying numbers of cards due to potential communication issues, rangin

Categories:
7 Views

A key challenge in cybersecurity is the absence of a large-scale network dataset that accurately captures modern traffic patterns, diverse intrusion types, and comprehensive network activity. Existing benchmark datasets such as KDDCup99, NSL-KDD, GureKDD, and UNSW-NB15 require updates to reflect contemporary cyberattack signatures effectively.

Categories:
84 Views

Pages