Machine Learning

This dataset comes up as a benchmark dataset for machines to automatically recognizing the handwritten assamese digists (numerals) by extracting useful features by analyzing the structure. The Assamese language comprises of a total of 10 digits from 0 to 9. We have collected a total of 516 handwritten digits from 52 native assamese people irrespective of their age (12-86 years), gender, educational background etc. The digits are captured in .jpeg format using a paint mobile application developed by us which automatically saves the images in the internal storage of the mobile.


This dataset is part of my PhD research on malware detection and classification using Deep Learning. It contains static analysis data (PE Section Headers of the .text, .code and CODE sections) extracted from the 'pe_sections' elements of Cuckoo Sandbox reports. PE malware examples were downloaded from PE goodware examples were downloaded from and from Windows 7 x86 directories.


An accurate and reliable image-based quantification system for blueberries may be useful for the automation of harvest management. It may also serve as the basis for controlling robotic harvesting systems. Quantification of blueberries from images is a challenging task due to occlusions, differences in size, illumination conditions and the irregular amount of blueberries that can be present in an image. This paper proposes the quantification per image and per batch of blueberries in the wild, using high definition images captured using a mobile device.


ASNM datasets include records consisting of many features, that express various properties and characteristics of TCP communications. These features are called Advanced Security Network Metrics (ASNM) and were designed with the intention to discern legitimate and malicious connections (especially intrusions).


Dataset consists of various open GIS data from the Netherlands as Population Cores, Neighbhourhoods, Land Use, Neighbourhoods, Energy Atlas, OpenStreetMaps, openchargemap and charging stations. The data was transformed for buffers with 350m around each charging stations. The response variable is binary popularity of a charging pool.


The Contest: Goals and Organisation

 The 2019 Data Fusion Contest, organized by the Image Analysis and Data Fusion Technical Committee (IADF TC) of the IEEE Geoscience and Remote Sensing Society (GRSS), the Johns Hopkins University (JHU), and the Intelligence Advanced Research Projects Activity (IARPA), aimed to promote research in semantic 3D reconstruction and stereo using machine intelligence and deep learning applied to satellite images.


Attempts to prevent invasion of marine biofouling on marine vessels are demanding. By developing a system to detect marine fouling on vessels in an early stage of fouling is a viable solution. However, there is a  lack of database for fouling images for performing image processing and machine learning algorithm.


This is a reservoir dataset including a large number of figures. Reservoir simulation, an important part of the petroleum industry, a powerful tool helping oil companies understand the reservoir better.

In this dataset, there more than 10,000 figures are showing in different period oilfield development. From the beginning to the end, we keep some variables constant while some changes to make clear the influences of different parts.

Last Updated On: 
Wed, 10/30/2019 - 03:16

The Contest: Goals and Organization


The 2017 IEEE GRSS Data Fusion Contest, organized by the IEEE GRSS Image Analysis and Data Fusion Technical Committee, aimed at promoting progress on fusion and analysis methodologies for multisource remote sensing data.



This dataset includes the measurements of a simulated vehicle inside a Gazebo simulation using different sensors: a simulated UWB tag, a IMU and a PX4Flow.