Machine Learning

Three month Coffee Leaf Rust dataset generated by the Cyber Physical Data Collection System.


Deep facial features with identity generated from CelebA dataset using facenet network (128 real-valued features). Dataset contains:
- full dataset
- training dataset
- validation dataset
Link to CelebA dataset:


Music and animal's basic emotions associated with acoustic signals.

Files associated with animals’ sounds mainly were based on the records from  Volodins Bioacoustic Group Homepage

More details:


Giemsa-stained thin blood smear slides from 150 P. falciparum-infected and 50 healthy patients were collected and photographed at Chittagong Medical College Hospital, Bangladesh. The smartphone’s built-in camera acquired images of slides for each microscopic field of view.


Morse code is a system of communication using dots and dashes to represent numbers, letters and symbols. For example, the letter 'B' is represented as a dash followed by 3 dots, i.e. "–...". The dataset used in this competition is synthetically generated, and mimics a human writing dots and dashes on a piece of paper. In this sense, it is like a 1-dimensional version of an image represented by numeric pixel values. The challenge is to classify the resulting 1-dimensional input into 1 out of 64 classes which represent various letter, numbers and symbols.

Last Updated On: 
Tue, 07/14/2020 - 21:14

The dataset contains medical signs of the sign language including different modalities of color frames, depth frames, infrared frames, body index frames, mapped color body on depth scale, and 2D/3D skeleton information in color and depth scales and camera space. The language level of the signs is mostly Word and 55 signs are performed by 16 persons two times (55x16x2=1760 performance in total).


We build an original dataset of thermal videos and images that simulate illegal movements around the border and in protected areas and are designed for training machines and deep learning models. The videos are recorded in areas around the forest, at night, in different weather conditions – in the clear weather, in the rain, and in the fog, and with people in different body positions (upright, hunched) and movement speeds (regu- lar walking, running) at different ranges from the camera.


Mosquito bites result in the deaths of more than 1 million people every year.   Certain species of mosquitos like Aedes are the main vector of arboviruses that cause Dengue, Malaria and Yellow fever. Image based mosquito species classification can be helpful to implement strategies to prevent the spread of mosquito borne disease. Automated mosquito species classification can aid in laborious and time consuming task of entomologists besides enhancing accuracy.


Coventry-2018 is a human activity recognition dataset captured by three Panasonic® Grid-EYE (AMG8833) infrared sensors in March 2018. The Grid-EYE sensors represent a 60 field of view scene by an 8 × 8 array named frame. The data streams are synchronized to 10 frames per second and saved as *.csv recordings using the LabVIEW® software. Two layouts are considered in this dataset with different geometry sizes: 1) small layout; and 2) large layout.


This paper aims to improve the existing techniques on X-ray image inspection of aerial engine by using artificial intelligence (AI) based object detection model. This technique seeks to augment and improve existing automated non-destructive testing (NDT) diagnosis of metal structure of engine parts. Traditional jet-engine maintenance and overhaul processes are resorted to NDT to find defects in internal welds. An application of deep learning for NDT technology can effectively identify presence and location of up to eight types of defects, leading to enhanced work quality and efficiency.