Machine Learning

This dataset contains a collection of videos consisting of satellite imagery augmented with 3D ship models, accompanied by the ships' corresponding AIS data. The intention of this dataset is for detecting dark ships, which are sea vessels acting maliciously, often while spoofing their AIS data. Multiple datasets exist that consist of satellite imagery of ships, however this dataset has the advantage of including each ships' corresponding AIS data. The simulated ships include both normal and anomalous behavior, whether the anomalous behavior is benign or malicious.

Categories:
480 Views

The Contest: Goals and Organization

The 2022 IEEE GRSS Data Fusion Contest, organized by the Image Analysis and Data Fusion Technical Committee, aims to promote research on semi-supervised learning. The overall objective is to build models that are able to leverage a large amount of unlabelled data while only requiring a small number of annotated training samples. The 2022 Data Fusion Contest will consist of two challenge tracks:

Track SLM:Semi-supervised Land Cover Mapping

Last Updated On: 
Mon, 03/07/2022 - 04:41

 

There exist several commonly used datasets in relation to object detection that include COCO (with multiple versions) and ImageNet containing large annotations for 80 and 1000 objects (i.e. classes) respectively. However, very limited datasets are available comprising specific objects identified by visually imapeired people (VIP) such as wheel-bins, trash-Bags, e-Scooters, advertising boards, and bollard. Furthermore, the annotations for these objects are not available in existing sources.

Categories:
239 Views

The greatest challenge of machine learning problems is to select suitable techniques and resources such as tools and datasets. Despite the existence of millions of speakers around the globe and the rich literary history of more than a thousand years, it is expensive to find the computational linguistic work related to Punjabi Shahmukhi script, a member of the Perso-Arabic context-specific script low-resource language family. The selection of the best algorithm for a machine learning problem heavily depends on the availability of a dataset for that specific task.

Categories:
219 Views

This dataset is used in the experiment of the paper "A Data Embedding Scheme for Efficient Program Behavior Modeling with Neural Networks" accepted by IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI). System calsl and their relevant branch sequences are contained in the tar.gz file. For a detailed description, please refer to the paper.

Categories:
169 Views

The ability to estimate the probability of a drug to receive approval in clinical trials provides natural advantages to optimizing pharmaceutical research workflows. Success rates of a clinical trials have deep implications to costs, duration of development, and under pressure due to stringent regulatory approval processes. We propose a machine learning approach that can predict the outcome of trial with reliable accuracies, using biological activities, physico-chemical properties of the compounds, target related features and NLP-based compound representation.

Categories:
823 Views

Tweets related to 10 different types of disasters were monitored from 28 September 2021 till 6 October 2021. 67528 rows containing 16 fields were extracted using Artificial Intelligence and Natural Language Processing Services of Microsoft.

Categories:
1026 Views

Drought has become one of the main challenges facing global agricultural production and crop safety. Drought stress will lead to the termination of crop photosynthesis and metabolic disorders, which will seriously affect the growth and development of crops. We aimed to study a method for identificaton of the drought stress in tomato seedlings using chlorophyll fluorescence imaging. In this study, chlorophyll fluorescence parameters and there corresponding chlorophyll fluorescence images of 4 different drought stress levels were collected.

Categories:
293 Views

The dataset is the synthesized absorbance spectrum of a set of 9-gas mixtures following Beer-Lambert Law. We used the mid-infrared absorption cross-section spectra of C2H6, CH4, CO, H2O, HBr, HCl, HF, N2O, NO from the high-resolution transmission molecular absorption (HITRAN) database. Each sample contains 1,000 observations equally spaced between 1 µm and 7 µm wavelengths. The concentrations of the nine gases are mutually uncorrelated and follow uniform distribution between 0-10 µM.

Categories:
34 Views

This is an accurately labeled dataset designed to support foreign bodies detection research in industrial production scenarios. In order to facilitate the reproduction of the results of the original paper, we provide this dataset for further research.

Categories:
548 Views

Pages