Machine Learning
Data on 2355 COVID-19 cases by date of July to December 2021 were extracted from a data set recorded by COVID-19 referral centers at Qazvin province in Iran. We recorded a wide range of clinical characteristics including age, sex, previous diseases, and hospitalization time. Moreover, we collected data about the different consumed medications including Atrovastatin 20 mg, Atrovastatin 40 mg, Ivermectin 3 mg, Ivermectin 40 mg, Dexamethasone, Kaletra, Favipiravir, Famotidine 40 mg, Interferon, Remdesivir, Hydroxychloroquine.
- Categories:
During our research in generating or optimizing molecules to be drug candidates by extending deep reinforcement learning and graph neural networks algorithms, we used GEOM data [1], and we had an idea to make a dataset obtained from molecules from GEOM to predit the activity towards COVID and the drug linkeness. We calculated over 200 descriptors for the molecules using RDKit [2]. We hope you enjoy using it.
References:
- Categories:
This dataset comprises gunshot audio and supporting data released as part of ShotSpotter Tech Note 098, "Precision and accuracy of acoustic gunshot location in an urban environment".
The data derive from a series of live fire tests of the ShotSpotter Respond gunshot location system conducted in Pittsburgh, PA on December 18th, 2018 by the Pittsburgh Bureau of Police. ShotSpotter uses live fire tests to validate that the deployed sensor density is appropriate for the community in question, and to ensure the system is ready for production use.
- Categories:
The pathology files of 194 colon cancer patients, 137 breast cancer patients, 124 gastric cancer patients, and 169 thyroid cancer patients who were referred to the healthcare facilities of Qazvin Province, Iran were examined for age, sex, surgery type, and pathological information. We collected information between 2010 and 2020.
- Categories:
JVNV is a Japanese emotional speech corpus with verbal content and nonverbal vocalizations whose scripts are generated by a large-scale language model.
Existing emotional speech corpora lack not only proper emotional scripts but also nonverbal vocalizations (NVs) that are essential expressions in spoken language to express emotions.
We propose an automatic script generation method to produce emotional scripts by providing seed words with sentiment polarity and phrases of nonverbal vocalizations to ChatGPT using prompt engineering.
- Categories:
Social Media Big Dataset for Research, Analytics, Prediction, and Understanding the Global Climate Change Trends is focused on understanding the climate science, trends, and public awareness of climate change. The use of dataset for analytics of climate change trends greatly helps in researching and comprehending global climate change trends.
- Categories:
The Numerical Latin Letters (DNLL) dataset consists of Latin numeric letters organized into 26 distinct letter classes, corresponding to the Latin alphabet. Each class within this dataset encompasses multiple letter forms, resulting in a diverse and extensive collection. These letters vary in color, size, writing style, thickness, background, orientation, luminosity, and other attributes, making the dataset highly comprehensive and rich.
- Categories:
A crime is a deliberate act that can cause physical or psychological harm, as well as property damage or loss, and can lead to punishment by a state or other authority according to the severity of the crime. The number and forms of criminal activities are increasing at an alarming rate, forcing agencies to develop efficient methods to take preventive measures. In the current scenario of rapidly increasing crime, traditional crime-solving techniques are unable to deliver results, being slow paced and less efficient.
- Categories:
Our large scale alpine land cover dataset consists of 229'535 very high-resolution aerial images (50cm) and digital elevation model (50cm) with land cover annotations produced by experts in photo-interpretration . The nine land cover types in our study area include bedrock, bedrock with grass, large blocks, large blocks with grass, scree, scree with grass, water area, forest and glacier. The distribution of pixels among classes presents a typical case of a long-tailed distribution with an imbalance factor, defined as the ratio of the most frequent to the rarest class, close to 1000.
- Categories: