Name: The PPI datasets,GO dataset, subcellular localization information and essential protein dataset
Creator: Wei Liu
License: https://creativecommons.org/licenses/by/4.0/
Keywords: Artificial Intelligence

Abstract

The PPI datasets were collected from four different sources: DIP, MIPS, Gavin, and Krogan. All self-interactions and repeated interactions were filtered. The essential proteins were collected from the following four different databases: MIPS,SGD,DEGand SGDP (http://www.sequence.stanford.edu/group/). Gene expression data were downloaded from the Gene Expression Omnibus (GEO) database (http://www.ncbi.nlm.nih.gov/geo/) with accession number GSE3431. The dataset includes three metabolism cycles with a total of 36 time points. The gene expression data set includes 9336 genes. The GO data applied in our method are extracted from the GO Consortium.The subcellular localization information is obtained from the COMPARTMENTS database.

Instructions:

All code has been prepocessed and can be opened by .txt or.xlsx.

Comments

The PPI datasets were collected from four different sources: DIP, MIPS， Gavin, and Krogan. All self-interactions and repeated interactions were filtered. It can be opened by the use of txt.The gene expression data, GO annotation data and standard essential proteins data is saved as .xlsx.The subcellular localization information data is also saved as txt file.

Submitted by Wei Liu on Mon, 01/13/2020 - 04:39

Dataset Files

data.zip (5.14 MB)

Documentation

Attachment	Size
Read me.txt	363 bytes

Datasets

Standard Dataset

The PPI datasets,GO dataset, subcellular localization information and essential protein dataset

Abstract

Comments

More from this Author

Source code for essential protein discovery

Dataset Files

Documentation

QUESTIONS?