Artificial Intelligence
As the harmful effects of climate change on human society increase, the analysis of abnormal weather is becoming an important issue. Therefore, this work provides the Korean weather dataset, including the anomaly score measurements by using seven different methods. In this dataset, seven types of weather data for each day in 64 Korean cities from 2010 to 2020 are provided by Weather Radar Center in Korea Meteorological Administration.
- Categories:
The data included here within is the associated model training results from the correlated paper "Distribution-Driven Augmentation of Real-World Datasets for Improved Cancer Diagnostics With Machine Learning". This paper focuses on using kernel density estimators to curate datasets by balancing classes and filling missing null values though synthetically generated data. Additionally, this manuscript proposes a technique for joining distinct datasets to train a model with necessary features from multiple different datasets as a type of transfer-learning.
- Categories:
The data set has been prepared as 2 different versions. The data set was shared in two versions due to the fact that the researchers could easily reproduce the tests and hardware limitations. The first version (small_dataset) was prepared using a 10% sub-sample of all dataset. The other version (big_dataset) contains the entire data. In this study, the scenarios tested were run on the small_dataset. The most successful configuration that was selected as a result of the analysis on small_dataset was applied to big_dataset.
- Categories:
Since meteorological satellites can observe the Earth’s atmosphere from a spatial perspective at a large scale, in this paper, a dust storm database is constructed using multi-channel and dust label data from the Fengyun-4A (FY-4A) geosynchronous orbiting satellite, namely, the Large-Scale Dust Storm database based on Satellite Images and Meteorological Reanalysis data (LSDSSIMR), with a temporal resolution of 15 minutes and a spatial resolution of 4 km from March to May of each year during 2020–2022.
- Categories:
We collected relevant data of ultrasonic Doppler flowmeter in the laboratory to study the error of ultrasonic Doppler flowmeter. It contains four sets of data at different turbidities and four sets of data at different liquid levels. Each set of data under different turbidities contains 440 pieces of data, and each set of experiments under different liquid levels contains 220 pieces of data. The entire data set has a total of 2720 pieces of data. The training set test split is 8:2, which we have already split in the uploaded data set.
- Categories:
Acute myocardial infarction (AMI) is the main cause of death in developed and developing countries. AMI is a serious medical problem that necessitates hospitalization and sometimes results in death. Patients hospitalized in the emergency department (ED) should therefore receive an immediate diagnosis and treatment. Many studies have been conducted on the prognosis of AMI with hemogram parameters. However, no study has investigated potential hemogram parameters for the diagnosis of AMI using an interpretable artificial intelligence-based clinical approach.
- Categories:
Using this data, we conduct an extensive investigation into the phenomenon of homophily in the generation of hate speech on Twitter, shedding light on an essential aspect of understanding online hate speech dynamics. We introduce innovative methods to detect multiple forms of hate speech, including manifestations of racism and sexism. Furthermore, we propose and validate novel measures for quantifying familiarity and similarity on Twitter, providing a comprehensive framework for understanding the interactions among users.
- Categories:
The "Multi-modal Sentiment Analysis Dataset for Urdu Language Opinion Videos" is a valuable resource aimed at advancing research in sentiment analysis, natural language processing, and multimedia content understanding. This dataset is specifically curated to cater to the unique context of Urdu language opinion videos, a dynamic and influential content category in the digital landscape.
Dataset Description:
- Categories: