Machine Learning

The Paddy Doctor dataset contains 16,225 labeled paddy leaf images across 13 classes (12 different paddy diseases and healthy leaves). It is the largest expert-annotated visual image dataset to experiment with and benchmark computer vision algorithms. The paddy leaf images were collected from real paddy fields using a high-resolution (1,080 x 1,440 pixels) smartphone camera. The collected images were carefully cleaned and annotated with the help of an agronomist.

Categories:
12834 Views

The Research Paper "Detection of Bicep Form Using Myoware and Machine Learning" based on the novel dataset has been recently accepted in September 2022 and is being published in SCOPUS Indexed SPRINGER Book Series “Lecture Notes in Networks and Systems”

Categories:
1438 Views

Recently, unmanned aerial vehicles (UAVs) have been receiving significant attention due to the wide range of potential application areas. To support UAV use cases with beyond visual line of sight (BVLOS) and autonomous flights,  cellular networks can provide connectivity points to UAVs and provide remote control and payload communications. However, there are limited datasets to study the coverage of cellular technologies for UAV flights at different altitudes.

Categories:
900 Views

The problem of effective disposal of the trash generated by people has rightfully attracted major interest from various sections of society in recent times. Recently, deep learning solutions have been proposed to design automated mechanisms to segregate waste. However, most datasets used for this purpose are not adequate. In this paper, we introduce a new dataset, TrashBox, containing 17,785 images across seven different classes, including medical and e-waste classes which are not included in any other existing dataset.

Categories:
1130 Views

Guava fruit production is one of the main sources of economic growth in Asian countries, the world production of guava in 2019 was 55 million tons. Guava disease is an important factor in economic loss as well as quantity and quality of guava. The original guava fruit disease dataset consist of 38 images of phytophthora, 30 images of root and 34 images of scab guava disease with 650x650x3 pixel.

Categories:
1301 Views

A synthetic laser reliability dataset generated using generative  adversarial networks (GANs) is provided. The data includes normalized current measurements estimated at the following times: 2, 20, 40, 60, 80, 100, 150, 500, 1000, and 1500 hours. The data can be used to train machine learning models to solve different predictive maintenance tasks such as prediction of performance degradation, remainng useful prediction, and so on. 

Categories:
146 Views

Although asking and replying on social media platforms in mixed language is a very common phenomenon these days, there is lack of precise corpora to analyze such code mixed language. Datasets released by various CQA sites are monolingual i.e. only in English language. To perform our task, we needed annotated bilingual dataset which include Question pairs in mashed up language. In view of this scarcity we created a dataset by scraping pairs of questions from distinct social media networks, for-example Yahoo!

Categories:
127 Views

The dataset contains labeled sentences. The sentences having information related to (1) infections, (2) suffering from pneumonea, (3) deaths, and (4) health updates from government/WHO, are labeled with 1 and the rest are labeled with 0. Source of all the news articles: https://www.thehindu.com/archive/

Categories:
130 Views

This dataset is for doa estimation when an amplitude-phase error exists.

Categories:
262 Views

PROTEIN STRUCTURE AND SYNTHETIC MULTI-VIEW CLUSTERING DATASETS

Multi-View Clustering (MVC) datasets used in the following paper:

Evolutionary Multi-objective Clustering Over Multiple Conflicting Data Views. Authors: Mario Garza-Fabre, Julia Handl, and Adán José-García. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION. Accepted for publication, November 2022.

This entry contains all 420 datasets used in the paper, including:

Categories:
275 Views

Pages