Artificial Intelligence

In Tibetan culture, Thangka art holds a significant status and is known as the "Encyclopedia of the Tibetan culture". This unique art form of painting encompasses multiple aspects of Tibetan history, politics, culture, and social life, and serves as precious material for studying Tibetan culture. Considering the lack of publicly available Thangka dataset, We constructed a thangka image super-resolution dataset in our research.

Categories:
20 Views

Anomaly detection plays a crucial role in various domains, including but not limited to cybersecurity, space science, finance, and healthcare. However, the lack of standardized benchmark datasets hinders the comparative evaluation of anomaly detection algorithms. In this work, we address this gap by presenting a curated collection of preprocessed datasets for spacecraft anomalies sourced from multiple sources. These datasets cover a diverse range of anomalies and real-world scenarios for the spacecrafts.

Categories:
789 Views

The Garbage Image Dataset consists of images of garbage items collected from nearby localities using smartphones. The dataset is categorized into five different classes. Each category represents a specific type of garbage item commonly found in everyday waste. The purpose of the Garbage Image Dataset is to provide a collection of labelled images of garbage items from different categories. The dataset can be used to train and evaluate deep learning models for garbage classification tasks.

Categories:
960 Views

The LIAR dataset has been widely followed by fake news detection researchers since its release, and along with a great deal of research, the community has provided a variety of feedback on the dataset to improve it. We adopted these feedbacks and released the LIAR2 dataset, a new benchmark dataset of ~23k manually labeled by professional fact-checkers for fake news detection tasks.

Categories:
268 Views

Image representation of Malware-benign dataset. The Dataset were compiled from various sources malware repositories:  The Malware-Repo, TheZoo,Malware Bazar, Malware Database, TekDefense. Meanwhile benign samples were sourced from system application of Microsoft 10 and 11, as well as open source software repository such as Sourceforge, PortableFreeware, CNET, FileForum. The samples were validated by scanning them using Virustotal Malware scanning services. The Samples were pre-processed by transforming the malware binary into grayscale images following rules from Nataraj (2011).

Categories:
406 Views

The "MANUU: Handwritten Urdu OCR Dataset" is an extensive and meticulously curated collection to advance OCR (Optical Character Recognition) for handwritten Urdu letters, digits, and words. The compilation of the dataset has been conducted methodically, ensuring that it encompasses a wide variety of handwritten instances. This comprehensive collection enables the construction and assessment of strong models for Optical Character Recognition (OCR) systems specifically designed for the complexities of the Urdu script.

Categories:
653 Views

With authorization from relevant departments in the hospital, we are uploading some research data.  This dataset is used for research on indoor navigation systems in hospitals. The data includes panoramic photos (jpg) of the hospital, 3D modeling maps (png), as well as surveillance videos (ts) of indoor staircases, intersections, and other areas within the hospital. All data has been anonymized, and faces in the monitoring videos are all wearing masks to protect personal privacy.

Categories:
69 Views

This data was recorded for emg based force/Torque estimation. EMG and torque signals were collected during simultaneous, isometric, but continuously varying contractions, corresponding to two wrist DoF. The experiment was carried out in two trials with a 5-min rest in between. Each trial included six combinations of tasks, separated by 2 min of rest to minimize the effect of fatigue. The performed tasks were categorized into individual and combined (simultaneous) DoF to test the ability to estimate isolated torque and torque in two simultaneous DoF.

Categories:
247 Views

Any damage that affects the normal functioning of the lungs is termed as a lung disease,

which can prove fatal if not detected early. To address this challenge, two innovative techniques proposed

for the lung disease classification, supporting medical professionals to diagnose and provides preventive

measures at an early stage. The proposed Model 1 integrates a custom MobileNetV2L2 architecture, that

builds upon the MobileNetV2 framework through fine-tuning and customization. This model incorporates a

Categories:
1080 Views

Pages