Artificial Intelligence

Gramatika is a syntectic GEC dataset for Indonesian. The Gramatika dataset has a total of 1.5 million sentences with 4,666,185 errors. Of all sentences, only 30,000 (2%) are correct sentences with no mistakes. Each sentence has a maximum of 6 errors, and there can only be 2 of the same error type in each sentence.We also split the dataset into three splits: train, dev, and test splits, with the proportion of 8:1:1 (with the size of 1,199,705, 150,171, and 150,124 sentences, respectively).
- Categories:

Gramatika is a syntectic GEC dataset for Indonesian. The Gramatika dataset has a total of 1.5 million sentences with 4,666,185 errors. Of all sentences, only 30,000 (2%) are correct sentences with no mistakes. Each sentence has a maximum of 6 errors, and there can only be 2 of the same error type in each sentence.We also split the dataset into three splits: train, dev, and test splits, with the proportion of 8:1:1 (with the size of 1,199,705, 150,171, and 150,124 sentences, respectively).
- Categories:
This paper presents an enhanced methodology for network anomaly detection in Industrial IoT (IIoT) systems using advanced data aggregation and Mutual Information (MI)-based feature selection. The focus is on transforming raw network traffic into meaningful, aggregated forms that capture crucial temporal and statistical patterns. A refined set of 150 features including unique IP counts, TCP acknowledgment patterns, and ICMP sequence ratios was identified using MI to enhance detection accuracy.
- Categories:

This study introduces a high-resolution UAV (Unmanned Aerial Vehicle) remote sensing image dataset aimed at advancing the development of deep learning-based farmland boundary extraction techniques and supporting the optimal deployment of Solar Insect Lights (SILs). Agricultural pests pose a significant threat to crop health and yield, while traditional pest control methods often cause environmental pollution.
- Categories:
This paper explores the cryptanalysis of the ASCON algorithm, a lightweight cryptographic method designed for applications like the Internet of Things (IoT). We utilize deep learning techniques to identify potential vulnerabilities within ASCON's structure. First, we provide an overview of how ASCON operates, including key generation and encryption processes.
- Categories:
This paper explores the cryptanalysis of the ASCON algorithm, a lightweight cryptographic method designed for applications like the Internet of Things (IoT). We utilize deep learning techniques to identify potential vulnerabilities within ASCON's structure. First, we provide an overview of how ASCON operates, including key generation and encryption processes.
- Categories:

Solar-powered insecticidal lamps have been widely used in agricultural pest control systems, where stable 4G connectivity is critical for real-time transmission of multi-source field data (soil parameters, pest images, and environmental metrics). However, the lack of reliable 4G signal strength datasets in agricultural scenarios, especially under rainfall conditions that cause signal degradation, poses a great challenge to deployment planning and network reliability.
- Categories:
This dataset comprises 2 million synthetic samples generated using the Variational Autoencoder-Generative Adversarial Network (VAE-GAN) technique. The dataset is designed to facilitate cardiovascular disease prediction through various demographic, physical, and health-related attributes. It contains essential physiological and behavioral indicators that contribute to cardiovascular health.
Dataset Description The dataset consists of the following features:
- Categories:
This dataset comprises 2 million synthetic samples generated using the Variational Autoencoder-Generative Adversarial Network (VAE-GAN) technique. The dataset is designed to facilitate cardiovascular disease prediction through various demographic, physical, and health-related attributes. It contains essential physiological and behavioral indicators that contribute to cardiovascular health.
Dataset Description The dataset consists of the following features:
- Categories:
In recent years, the fusion of artificial intelligence and semantic web technologies has paved the way for innovative approaches to managing and utilizing information. With the growing demand for structured gastronomical data, there is a need for well-defined ontologies that facilitate recipe organization, ingredient classification, nutritional insights, and personalized diet recommendations. The dataset presents a multilingual recipe ontology and knowledge graph, capturing critical relationships between ingredients, nutrition, cooking actions, and recipe planning.
- Categories: