Artificial Intelligence

Source code for essential protein discovery

The dataset is used to detect essential protein in uncertain PPI network.

Categories:: Artificial Intelligence

97 Views

The PPI datasets,GO dataset, subcellular localization information and essential protein dataset

The PPI datasets were collected from four different sources: DIP, MIPS, Gavin, and Krogan. All self-interactions and repeated interactions were filtered. The essential proteins were collected from the following four different databases: MIPS,SGD,DEGand SGDP (http://www.sequence.stanford.edu/group/). Gene expression data were downloaded from the Gene Expression Omnibus (GEO) database (http://www.ncbi.nlm.nih.gov/geo/) with accession number GSE3431.

Categories:: Artificial Intelligence

362 Views

WIKIBIOCN

A Chinese dataset for table-to-text generation named WIKIBIOCN which inculeds 33,244 biography sentences with related tables from Chinese Wikipedia (July 2018).

The dataset is divided into training set (30,000), verification set (1000) and test set (2,244).

Categories:: Artificial Intelligence
Machine Learning

170 Views

IRAN Datasets

We conduct experiments with the datasets of SemEval 2014 task4 to evaluate our model, the SemEval 2014 datasets consist of reviews in two categories: Restaurant and Laptop, and the reviews contains three labels of sentiment polarity: {positive, negative, neutral}

Categories:: Artificial Intelligence

254 Views

Biomedical Keyphrase Extraction Dataset

We crawled large amounts of biomedical articles from PubMed for the keyphrase extraction system evaluation.

The articles, that consist of title, abstract and keyphrases provided by the authors, are used for the experiments.

In our paper, cancer-related biomedical articles are selected.

Categories:: Artificial Intelligence
Biomedical and Health Sciences

606 Views

Data for Prediction of Apparent Personality Traits from Selfies using the Five-Factor Model

Since there is no image-based personality dataset, we used the ChaLearn dataset for creating a new dataset that met the characteristics we required for this work, i.e., selfie images where only one person appears and his face is visible, labeled with the person's apparent personality in the photo.

Categories:: Artificial Intelligence
Image Processing
Machine Learning

3601 Views

Intrusion Detection in CAN bus

These datasets are used to detect Intrusions in Controller Area Network (CAN) bus. Intrusions are detected using various Machine Learning and Deep Learning algorithms.

Categories:: Artificial Intelligence
Machine Learning

2615 Views

GAN based synthesized audio dataset

With the development of audio synthesis techniques, the most state-of-art synthesis methods based on Generative Adversarial Network(GAN) have been proposed. Whether the automatic speaker verification (ASV) systems are vulnerability to the GAN based synthesized audios is urgently needed to be verified. We present a publicly available set of GAN based synthesized audios generated by some open source schemes (WaveGAN,TifGAN,GANSynth,MelGAN), which allows researches to verify impact of the GAN-synthetic audio on security of ASV systems.

Categories:: Artificial Intelligence
Signal Processing

666 Views

Long-term 3DC Dataset

Long-term 3DC Dataset as described in

Virtual Reality to Study the Gap Between Offline and Real-Time EMG-based Gesture Recognition

Categories:: Artificial Intelligence

1063 Views

ArPC a corpus for paraphrase identification in Arabic text

ArPC is an Arabic paraphrase identification corpus. It consists of 1331 sentence pairs along with their binary score that indicates weather the pairs are paraphrase or not. The corpus has been manually annotated by three Arabic native speakers.

Categories:: Artificial Intelligence
Machine Learning

762 Views